Cloning and expression of ACE2 genes from different bat species

ACE2 genes from seven bat species were amplified and cloned (Fig. 1, sFig. 1). Full-length genes were obtained from Rhinolophus ferrumequinum from Hubei province (Rf-HB), R. macrotis from Hubei province (Rm-HB), R. pearsoni from Guangxi (Rp-GX), R. pusillus from Hubei province (Rpu-HB), R. sinicus from Guangxi province (Rs-GX) and R. sinicus from Hubei province (Rs-HB). For the following bat species, amplification of the full-length coding region was not successful, and instead,the N-terminal region was cloned in frame with the C-terminal region of the human ACE2 gene to form a chimeric full-length ACE molecule: R. pearsoni from Guizhou province (Rp-GZ), Myotis daubentonii bat from Yunnan province (Md-YN) and Hipposideros pratti bat from Henan province (Hp-HN). The full-length sequences of bat ACE2 are identical in size to that of hACE2 (805 aa in total). Sequence comparison showed that bat ACE2s are closely related to ACE2s of other mammals and have an aa sequence identity of 80–82% to human and civet ACE2. The aa identity of ACE2 from different bat families ranges from 78 to 84%, and within the genus Rhinolophus, the sequence identity increases to 89–98%. The major sequence variation among bat ACE2s is located in the N-terminal region, which has been identified in structural studies as the SARS-CoV-binding region [6, 7]. A phylogenetic tree was constructed based on the sequences of bat ACE2 (sFig. 2) using the MEGA package [14].

Fig. 1 Sequence alignment of SARS-CoV binding regions of ACE2s from 9 bats, civet and human. The GenBank accession numbers of bat, civet and human ACE2 are as follows: human (NM021804), civet (AY881174), Rf-HB (GQ999931), Rm-HB (GQ999932), Rs-GX (GQ999933), Rp-GX (EF569964), Hp-HN (GQ999934), Rp-GZ (GQ999935), Rs-HB (GQ999936), Md-YN(GQ999937) and Rpu-HB (GQ999938). The alignment was generated using ClustalX v1.83. In black are single, fully conserved residues. In gray are strongly conserved residues. In light gray are weakly conserved residues. Asterisks indicate residues that interact directly with the receptor-binding domain of the SARS-CoV S protein Full size image

All ACE2 genes were cloned into a eukaryotic expression vector and used to transfect HeLa cells. Western blot analysis showed that all ACE2s were expressed efficiently and at very similar levels and were recognized by a rabbit anti-bat ACE2 antibody with an apparent molecular weight of approximately 100–130 kDa (Fig. 2c).

Fig. 2 Testing of the ability of bat ACE2 proteins to mediate pseudovirus HIV/BJ01-S and live SARS- CoV infection. a HeLa cells transfected with plasmids encoding bat and human ACE2s were infected with pseudovirus HIV/BJ01-S. Infectivity was determined by measuring the activity of reporter luciferase as described in “Materials and methods”. HeLa cells transfected with pcDNA3.1 and human ACE2 were used as the negative and positive controls, respectively. All tests were performed in triplicate, and the experiments were repeated three times. The error bar represents the calculated standard deviation. I27T, N31K, K35E, and H41Y are mutants of MdACE2 that were made using a QuikChange II Site-Directed Mutagenesis Kit. b SARS-CoV live virus infection using the ACE2s from bats as described in “Materials and methods”. HeLa cells transfected with pcDNA3.1 and human ACE2 were used as the negative and positive controls, respectively. c Expression of bat or human ACE2. Lysates from HeLa cells transfected with plasmid expressing human or bat ACE2 were analyzed by western blot. Rabbit anti-bat ACE2 polyclonal antibody (upper panel) or β-actin monoclonal antibody (lower panel) was used as the primary antibody. Lane 1 vector pcDNA3.1 control; lanes 2–10 bat ACE2 from samples Rf-HB, Rm-HB, Rpu-HB, Hp-HN, Rp-HB, Rp-GZ, Rs-GX, Rs-HB and Md-YN; lanes 11–14 Md-YN ACE2 mutant I27T, N31K, K35E and H41Y; lane 15 human ACE2. The abbreviations of bat species are given in the main text Full size image

Functionality of bat ACE2 as an SARS-CoV entry receptor

To examine the susceptibility of different bat ACE2 molecules to SARs-CoV entry, the HIV/BJ01-S pseudovirus system was used to infect HeLa cells transiently expressing bat ACE2 or human ACE2 genes. Among the bat ACE2s, only MdACE2 (MdACE2) and Rs-HB ACE2 demonstrated significant pseudovirus infection, as deduced from the significantly higher level of luciferase activity in comparison to background activity in the negative control (Fig. 2a). Although such assays are not to be viewed as an absolute quantification of receptor activity, it is nevertheless worth noting that MdACE2-mediated infection seemed to be more efficient than with Rs-HB ACE2. In the same context, it is clear that the bat ACE2s were less efficient overall than the human ACE2 in this particular assay system. The biological significance of this observation remains to be determined. The functionality of MdACE2 and Rs-HB ACE2 as SARS-CoV entry receptors was further confirmed by infection with live virus. As shown in Fig. 2b, both bat ACE2 proteins could clearly support SARs-CoV infection. No attempt was made to quantify infection efficiency in this study due to difficulties encountered in conducting experiments under BSL4 conditions.

Structural modeling of bat ACE2 molecules

Homologous structural modeling of human SARS-CoV RBD complexed with MdACE2 supports MdACE2 as a receptor for human SARS-CoV S protein. The crystal structure of human SARS-CoV RBD complexed with hACE2 shows that two salt bridges at the SARS-CoV-hACE2 interface, between hACE2 Lys31 and Glu35 and between hACE2 Lys353 and hACE2 Glu38, are both buried in a hydrophobic environment and contribute critically to the SARS-CoV-hACE2 interactions (Fig. 3a, c) [7]. Disturbance of the formation of either of these salt bridges weakens SARS-CoV-hACE2 binding. The Lys31-Glu35 salt bridge at the SARS-CoV-hACE2 interface becomes an Asn31-Lys35 hydrogen bond at the SARS-CoV-Md-YNACE2 interface (Fig. 3b), which possibly weakens virus-receptor binding but still is largely compatible with the virus-receptor interface. Thr27 on hACE2 supports the Lys31-Gu35 salt bridge through hydrophobic interactions with Tyr475 (Fig. 3a); Ile27 on MdACE2 supports the Asn31-Lys35 hydrogen bond more efficiently than Thr27 through tighter hydrophobic interactions with Tyr475 (Fig. 3b). Moreover, Tyr41 on hACE2 supports the Lys353-Glu38 salt bridge (Fig. 3c); His41 on MdACE2 supports the same salt bridge less efficiently than Tyr41 (Fig. 3d). Overall, MdACE2 is an efficient receptor for SARS-CoV, despite the fact that its receptor activity is lower than that of hACE2.

Fig. 3 Homologous structural modeling of SARS-CoV and Md-YN ACE2 (MdACE2) interactions. a Critical salt bridge between hACE2 Lys31 and Glu35 and the hydrophobic residues surrounding it, based on the experimentally determined crystal structure of SARS-CoV RBD complexed with hACE2 (PDB 2AJF). b Homologous structural modeling of the hydrogen bond between MdACE2 Asn31 and Lys35. The modeling was done in the program O [3]. c Critical salt bridge between hACE2 Lys353 and Glu38 and the hydrophobic residues surrounding it, based on the structure of SARS-CoV RBD complexed with hACE2. d Homologous structural modeling of the salt bridge between MdaACE2 Lys353 and SARS-CoV Glu38 and the hydrophobic residues surrounding it. Structural illustrations were prepared using the program Povscript [2] Full size image

Compared with MdACE2, Rs-HB ACE2 contains Glu31 and Glu35, which are not compatible with each other due to their same negative charges, which disfavor virus-receptor binding. However, Rs-HB ACE2 also contains Thr27 and Tyr41, both of which support SARS-CoV entry by contributing favorably to the hydrophobic interactions at the virus-receptor interface. Thus, Rs-HB is a low-efficiency receptor for SARS-CoV. All of the other bat ACE2 molecules contain combinations of the aforementioned key residues that are completely incompatible with virus–receptor interactions. More specifically, they either contain same-charged residues at the 31 and 35 positions, which repel each other, or contain His41 and Lys27, which disfavor SARS-CoV binding (Fig. 1). In particular, Lys27 on some of these bat ACE2 molecules is incompatible with certain hydrophobic residues, such as Leu443 and Phe460, on SARS-CoV RBD (Fig. 3a, b). Therefore, these bat ACE2 molecules are not receptors for SARS-CoV.

Site-directed mutagenesis analysis

To confirm the above homologous structural analysis, we carried out site-directed mutagenesis on MdACE2. Our results show that mutations E31K, K35E, and I27T all dramatically decrease the receptor activity of MdACE2, whereas mutation H41Y greatly increases its receptor activity (Fig. 2a). Therefore, our mutagenesis data further confirmed that key residues in ACE2 determine the receptor activity of MdACE2.

Our finding that M. daubentoni and R. sinicus could support SARS-CoV infection has important implications in relation to the origin of SARS-CoV. Since all lines of investigation have indicated that ACE2-binding affinity is among the important determinants for SARS-CoV host range, our data would suggest that M. Daubentonii and R. sinicus have the potential to serve as the direct reservoirs for human SARS-CoV or its highly related civet SARS-CoV. To further investigate the potential of M. Daubentonii and R. sinicus as reservoirs for SARS-CoV, more efforts will have to be directed toward widening the surveillance of bats in these families and in different geographical locations.

Another important finding of our current study is the great genetic diversity of bat ACE2 proteins, which is in contrast to the genetically homogenous hACE2 [10]. Sequence variations of bat ACE2, especially in positions that are critical to SARS-CoV binding, such as residues 27, 31, 35, and 41, suggest that, in addition to the Md-YN and Rs-HB ACE2s, there may be many other bats with an ACE2 protein that makes them susceptible to SARS-CoV entry. This again highlights the need for more field surveillance and molecular characterization of different bat ACE2 proteins until the true reservoir host of SARS-CoV is identified and its spillover mechanisms and transmission pathways are fully characterized.