Detailed How To: The Potential for Respiratory Droplet–Transmissible A/H5N1 Influenza Virus to Evolve in a Mammalian Host

* This is information has been made public, I am leaving the figures out...

Science 22 June 2012:
Vol. 336 no. 6088 pp. 1541-1547
DOI: 10.1126/science.1222526


Avian A/H5N1 influenza viruses pose a pandemic threat. As few as five amino acid substitutions, or four with reassortment, might be sufficient for mammal-to-mammal transmission through respiratory droplets. From surveillance data, we found that two of these substitutions are common in A/H5N1 viruses, and thus, some viruses might require only three additional substitutions to become transmissible via respiratory droplets between mammals. We used a mathematical model of within-host virus evolution to study factors that could increase and decrease the probability of the remaining substitutions evolving after the virus has infected a mammalian host. These factors, combined with the presence of some of these substitutions in circulating strains, make a virus evolving in nature a potentially serious threat. These results highlight critical areas in which more data are needed for assessing, and potentially averting, this threat.

Recent studies have shown that the A/Indonesia/5/2005 avian A/H5N1 influenza virus may require as few as five amino acid substitutions (1), and the A/Vietnam/1203/2004 A/H5N1 influenza virus requires four substitutions and reassortment (2), to become transmissible between ferrets via respiratory droplets. Here, we assess the likelihood that these substitutions could arise in nature. We first analyzed A/H5N1 sequence surveillance data to identify whether any of these substitutions are already circulating. We then explored the probability of the virus evolving the remaining substitutions after a spillover event of an avian virus into a single mammalian host and in a short chain of transmission between mammalian hosts.

The minimal set of substitutions identified by (1) (the Herfst et al. set) contains two receptor-binding amino acid substitutions, Q222L and G224S (H5 numbering used throughout) in the hemagglutinin (HA), known to change the virus from the more avian-like alpha-2-3–linked sialic acid specificity to the more humanlike alpha-2-6–linked sialic acid (3, 4). The remaining three substitutions in the set are T156A in HA, which disrupts the N-linked glycosylation sequon spanning positions 154 to 156; H103Y in the HA trimer-interface; and E627K in the PB2, which is a common mammalian polymerase adaptation (5). (Numbers refer to amino acid positions in the mature H5 proteins; for example, Q222L indicates that glutamine at position 222 was replaced by leucine. Single-letter abbreviations for the amino acid residues are as follows: A, Ala; C, Cys; D, Asp; E, Glu; F, Phe; G, Gly; H, His; I, Ile; K, Lys; L, Leu; M, Met; N, Asn; P, Pro; Q, Gln; R, Arg; S, Ser; T, Thr; V, Val; W, Trp; and Y, Tyr.)

The four amino acid substitutions in HA identified by (2) (the Imai et al. set) also contain two receptor-binding amino acid substitutions, N220K and Q222L, one of which is in common with the Herfst et al. set and which together are known to change the sialic acid linkage preference to the more human-like alpha-2-6 linkage (2). The remaining two substitutions are N154D, which disrupts the same N-linked glycosylation sequon as the T156A substitution in the Herfst et al. set, and T315I in the stalk region.

Of the three receptor-binding substitutions in the two sets, only N220K in the Imai et al. set has been detected by means of surveillance in consensus sequencing of the HA of A/H5N1 viruses, and only in 2 of 3392 sequences [both avian viruses, one from 2007 in Vietnam, one from Egypt in 2010 (Fig. 1, B and F, black arrows)]. The T315I stalk substitution and H103Y trimer interface substitution have each been detected once in two viruses from China in 2002 (Fig. 1, A and B, orange arrows). T315I has been detected in two pre-1997 H5N1 viruses, four H5N2 viruses, two H5N3 viruses, and two H5N9 viruses. H103Y has been detected in five H5N2 viruses and one H5N3 virus. The remaining substitutions, N154D and T156A in the HA glycosylation sequon and E627K in PB2, however, are common and occur in 942 of 3392, 1803 of 3392, and 432 of 1612 sequences, respectively. A summary of the substitutions detected in surveillance is shown in fig. S1 and table S1. For viruses in which both HA and PB2 have been sequenced, 338 of 1533 have lost the 154-to-156 glycosylation sequon and have E627K in PB2. These viruses have been collected in at least 28 countries in Europe, the Middle East, Africa, and Asia.

Fig. 1

(A to L) Phylogenetic trees of the A/H5N1 HA1 nucleotide sequences. The sequences are split into three trees: 2022 avian H5 sequences from East and Southeast (E and SE) Asia (top row); 1097 avian H5 sequences from Europe, the Middle East, and Africa (middle row); and 385 human H5 sequences (bottom row). Each sequence is color coded by the minimum number of nucleotide mutations required to obtain the four amino acid substitutions in HA in the Herfst et al. set (column 1), to obtain the four amino acid substitutions in the Imai et al. set (column 2), to disrupt the N-linked glycosylation sequon spanning positions 154 to 156 in HA (column 3), and to obtain E627K in the PB2 segment of the corresponding virus in these HA trees (column 4). In columns 1 and 2, blue indicates five nucleotide changes, green indicates four, and orange indicates three. In columns 3 and 4, yellow viruses require one mutation, and pink require zero mutations. Gray indicates PB2 not sequenced. Clades as defined by (35) are marked to the right of the branches; the red portion of the vertical clade-identification lines indicates strains sampled in 2010 or 2011. The viruses indicated by black arrows are two nucleotides from the Imai et al. set. The virus indicated in (A) by the orange arrow has the H103Y substitution, and the virus indicated in (B) by the orange arrow has the T315I substitution. The blue circle indicates A/Indonesia/5/2005, and the red circle indicates A/Vietnam/1203/2004, the starting viruses used by (1) and (2), respectively. The initial trees were constructed with PhyML version 3.0 (36), with A/Chicken/Scotland/1959 as the root, using GTR+I+Γ4 [determined by ModelTest (37)] as the evolutionary model. GARLI version 0.96 (38) was run on the best tree from PhyML for 1 million generations to optimize tree topology and branch lengths. “Zoom-able” versions of these trees are shown in fig. S1 to show detail

The HA glycosylation sequon substitutions, N154D and T156A, have drifted in and out of the avian virus population over time, suggesting that they may be under little selective pressure in birds. The other substitutions—which are rare in birds, particularly those that change the sialic acid linkage preference—are likely to be negatively selected in birds.

Phylogenetic trees of the A/H5N1 HA are shown in Fig. 1, color-coded by the number of nucleotide mutations required to obtain the five Herfst et al. set (column 1) and four Imai et al. set (column 2) of substitutions in HA. Obtaining these mutations does not necessarily mean the virus will be transmissible through respiratory droplets between ferrets because the genetic background of each strain is different from the strain used by Herfst et al. (Fig. 1A, blue circle) and the strain used by Imai et al. (Fig. 1J, red circle). Other than for clade, the variation in color in Fig. 1, columns 1 and 2, is due to the presence (mostly in East and Southeast Asia) or absence (mostly outside of East and Southeast Asia) of the glycosylation sequon at positions 154 to 156.

The sequenced viruses that are closest to the Herfst et al. set are in clade (Fig. 1A and fig. S1A). These HAs have acquired a silent nucleotide mutation that makes the amino acid substitution G224S require only one nucleotide mutation instead of the two mutations for other strains. It is the requirement of these two nucleotide mutations that makes viruses usually farther from the Herfst et al. set than the Imai et al. set. The viruses in clade have been sampled in Nepal, Mongolia, Japan, and Korea from 2009 to 2011. Seventeen out of 94 of these viruses have been sequenced in PB2 (Fig. 1D), and none have the E627K substitution. Thus, the closest known viruses to the Herfst et al. set by consensus sequencing are four nucleotide substitutions away.

The majority of H5 viruses in clade 2.2 (and its subclades) are three nucleotide mutations from the Imai et al. set in HA (Fig. 1, F and J). These viruses have been sampled in Europe from 2005 to 2007, in the Middle East (including Egypt) from 2005 to 2011, and in Africa from 2005 to 2007. Viruses sampled in 2010 and 2011 are indicated by the red portion of the vertical line delimiting the clade (Fig. 1 and by the time series in fig. S1, F and J). If it is the loss of glycosylation that is important, rather than any other effect of N154D, then as shown in Fig. 1, column 3, almost all the non-Asian viruses have lost the glycosylation sequon, and thus all these viruses would potentially be functionally three nucleotides from the Imai et al. set in HA.

The viruses indicated by the black arrows in Fig. 1, B and F (one from Vietnam in 2007 and one from Egypt in 2010), have the N220K receptor-binding substitution and have lost the glycosylation sequon at positions 154 to 156. Thus, these two viruses are two nucleotide substitutions from the Imai et al. set in HA, and are the viruses closest to having the full Imai et al. set in HA detected to date by means of consensus sequencing.

Surveillance has detected humans with A/H5N1 viruses four nucleotide mutations from the full Herfst et al. set and three from the Imai et al. set in HA. Viruses isolated from human A/H5N1 infections (Fig. 1, bottom row) are generally the same number of mutations in HA away from the Herfst et al. and Imai et al. sets, by means of consensus sequencing, as their most closely related avian viruses. The within-host evolution modeling below indicates that any host adaptation substitutions would only reach a small proportion of the total virus population in the first spillover host and, although potentially critical in the host-adaptation process, would not be detected with consensus sequencing. Thus, the absence of evidence of host-adaption through consensus sequencing is not evidence for the absence of potentially critical adaptation to the mammalian host. See (6) for details of human strains and their most genetically similar avian A/H5N1 viruses.

To explore the probability of accumulating the remaining nucleotide mutations after the avian virus has been transmitted to a human (or other mammalian host), we constructed a mathematical model (7–10) of the within-host evolutionary dynamics of the virus. In the model, errors made by the virus polymerase are the source of mutation (10−5 mutations per site, per genome replication), the initial virus population expands exponentially [each infected cell produces 104 virions (11, 12), and 1010 cells can be infected (13)] until it reaches 1014 virions, after which the virus population size stays roughly constant, and selection is modeled by use of differences in expected numbers of progeny (fig. S2 and table S2) (6). The results of the model are largely insensitive to the number of cells that can be infected, maximum virus population size, and whether the virus population remains roughly constant or declines (figs. S3 to S5). Typical infections were simulated out to 5 days corresponding to the approximate time of peak viral load, and long-duration infections to 14 days (14).

It is not possible to calculate the level of risk precisely because of uncertainties in some aspects of the biology. We used the model to compare the relative effects of factors that could increase or decrease the probability of accumulating mutations and to identify areas for further investigation that are critical for more accurate risk assessment. We compare and contrast the effects of factors that can increase the probability of accumulating mutations and thus evolving a respiratory droplet–transmissible A/H5N1 influenza virus in a mammalian host, and factors that could decrease the probability of evolving a such a virus. The factors we considered that can increase the probability are random mutation, positive selection, long infection, alternate functionally equivalent substitutions, and transmission of partially adapted viruses as a proportion of the within-host diversity both in the avian-mammal and the mammal-mammal transmission events (10, 14–18). The factors we considered that can decrease the probability are an effective immune response, deleterious substitutions, and order-dependence in the acquisition of substitutions. We considered these factors for starting viruses differing in the number of mutations that separates them from a respiratory droplet–transmissible A/H5N1 virus—viruses that require five, four, three, two, or one mutations at specific positions in the virus HA, reflecting that zero, one, two, three, or four of the mutations are already present in the avian population and thus are present at the start of the infection in mammals. We treat each amino acid substitution as if it can be acquired by a single-nucleotide mutation, as is the case for the circulating viruses closest to acquiring the Herfst et al. or Imai et al. sets [see (6) for the general case].

First, we considered random mutation. Even without any positive selection pressure, the random process of mutations introduced by the virus polymerase in the expanding population of viruses will on average produce viruses that contain the required single, double, or triple mutations and even some quadruple mutants. These mutants will arise after a few days of an infection in a host in which the virus replicates efficiently (Fig. 2A) and would be delayed if replication is impaired (fig. S5). However, the existence of a virus within-host does not mean that it will transmit because it might exist only as a small proportion of the total virus population and thus have little chance of being excreted (Fig. 2B). The minimum proportion of mutant virus required to make transmission likely is not known, but increased proportion translates into increased probability of transmission; thus, we focused on proportion of mutant virus in the total virus population. These proportions (equivalent to the probability of a single virion to be a mutant), both here and below, cannot yet be precisely determined—they are sensitive to some biological parameters that are not yet known accurately and some that are specific to a particular virus or mutant. For such parameters, we tested a range of the current best estimates and focused on the relative, rather than the absolute, effects (6).

Fig. 2

Fig. 2

Expected absolute numbers and proportions of respiratory droplet–transmissible A/H5N1 virions within a host initially infected by strains that require five (blue), four (green), three (orange), two (red), or one (purple) mutation (or mutations) to become respiratory droplet–transmissible, calculated from the deterministic model. (A) The absolute number of respiratory droplet–transmissible A/H5N1 viruses in a host. The intersections with the gray line indicate the point when at least one virus in each host is expected to have the required mutations. The change in slope is due to the transition in the virus population from exponential expansion to constant size. (B) Expected proportion of respiratory droplet–transmissible A/H5N1 viruses in the total virus population over time in the random mutation case (when all mutations are fitness-neutral).

Second, we considered positive selection. Some of the substitutions identified by (1) and (2) have been shown to increase within-host virus fitness—specifically, the loss of glycosylation at positions 154 and 156 and E627K in PB2. However, given the absence of specific information on the within-host selective advantage or disadvantage conferred by each substitution, or combination of substitutions, we considered two cases of positive selection: one in which each individual substitution confers an additive advantage (hill-climb) and one in which only viruses that have acquired all substitutions have an advantage (all-or-nothing). We considered a total advantage of 1.1-, 2-, or 10-fold in each genome replication step for the full set of respiratory droplet transmission–enabling substitutions (table S2 and fig. S6) (A twofold advantage at each genome replication step translates into an approximately 100-fold increase in mutant virus titer after 36 hours.) In the all-or-nothing scenario, a strong increase in proportion occurs for viruses that have acquired all mutations because of its substantial fitness advantage over the rest of the population. The rate at which all-or-nothing selection increases the proportion of respiratory droplet–transmissible A/H5N1 viruses, as compared with the neutral case, is mostly independent of the number of mutations required (Fig. 3A). In contrast, for hill-climb selection the rate of increase above the neutral case decreases when fewer mutations are required (Fig. 3A). This difference between the all-or-nothing and hill-climb is because the fitness differential from the starting virus to the respiratory droplet–transmissible A/H5N1 virus decreases as the number of needed mutations decreases (if some of the mutations are already present in the avian host) (table S2) (6). We consider this hill-climb case to be the most likely situation during the host-adaptation we modeled (in the absence of deleterious substitutions). However, we have also compared the two selection scenarios when the starting fitnesses of all-or-nothing and hill-climb are the same independent of the starting number of necessary mutations, and discuss the subtle tradeoff between the fitness advantage of, and clonal-interference among, intermediate mutants (figs. S7 and S8) (6, 19).

Fig. 3

Factors that increase or decrease the proportion of respiratory droplet–transmissible A/H5N1 virus based on starting viruses that require five (blue), four (green), three (orange), two (red), or one (purple) mutation (or mutations) to become respiratory droplet–transmissible. (A) The effect of hill-climb and all-or-nothing positive selection compared with random mutation alone. (B) The effect of avian–mammal transmission of partially adapted virus as a result of intra-host diversity (100 viruses start the infection, one of which has a mutation) and the effect of alternate substitutions with 10 functionally equivalent sites for a virus requiring five mutations (blue), nine sites for a virus requiring four mutations (green), eight sites for a virus requiring three mutations (orange), seven sites for a virus requiring two mutations (red), and six sites for a virus requiring one mutation (purple), both with hill-climb selection, compared with hill-climb selection alone. (C) The effect of two of the required substitutions being individually deleterious (for these two specific substitutions, either substitution alone reduces the replicative fitness of the virus to zero) and the effect of complete order dependence of acquiring substitutions, both with hill-climb selection as compared with hill-climb selection alone.

Third, we considered long infection. Because both random mutation and positive selection increase the expected proportion of mutated virions with every viral generation, the longer a host is infected, the greater the proportion of a particular mutant (Fig. 4) (15). Human A/H5N1 infections lasting 14 days or longer have been reported especially in children, the elderly, and the immunocompromised (14) and have been associated with the evolution of oseltamivir resistance (20). It might be that only immunocompromised individuals can typically transmit the virus late in a long infection. The increasing proportion of mutant virus is only dependent on continued virus production and is independent of whether the virus load stays constant or declines (fig. S4) (21). The variance in the proportion of mutant virus (Fig. 4, pale regions) increases with each additional mutation required because of the increased number of combinatorial options and the greater selective advantage of mutant viruses as compared with wild-type viruses in the hill-climb scenario. The pale regions only reflect the within-model variance in results, as indicated by the different runs of the stochastic model, and not uncertainty as a result of other factors; sensitivity of the outcomes for model parameters such as the error rate and the number of virions produced by each infected cell are explored in (6) (fig. S5).

Fig. 4

Proportion of respiratory droplet–transmissible A/H5N1 virus in a long infection with virus replication for 14 days in the presence of hill-climb selection. Bold lines show results from a probability-based deterministic model of virus mutation, the pale region (composed of lines) shows 10,000 stochastic model simulations for each starting virus. Starting viruses require either five (blue), four (green), or three (orange) mutations to become respiratory droplet–transmissible. For the stochastic simulations, the lines start when the first virion that has the required mutations appears.

Fourth, we considered functionally equivalent substitutions. The sets of substitutions required for a respiratory droplet–transmissible A/H5N1 virus identified by (1) and (2) are unlikely to be the only combinations of substitutions capable of producing a respiratory droplet–transmissible A/H5N1 virus. If particular biological traits could be achieved by other substitutions, this would increase the expected proportions of respiratory droplet–transmissible A/H5N1 viruses. This is likely to be the case, given that there are multiple substitutions that can cause changes in receptor-binding specificity and two sites where substitutions will result in loss of glycosylation: positions 154 and 156 (table S3). If five substitutions could be from any 10 specific positions in the virus genome (or if two already existed in nature, three from any eight), then there would be 252 (or 56) combinations, and this would raise the proportion of respiratory droplet–transmissible A/H5N1 virus within a host by ~102.5 (or ~101.5) above the case of positive selection alone after 5 days (Fig. 3B, figs. S9 and S10, and table S4).

Fifth, we considered the avian-to-mammal transmission of partially adapted mutants. We specifically considered the case in which one of the required mutations exists as a small proportion of the avian within-host viral population, or in the viral populations from the >20 mammalian hosts in which A/H5N1 infections have been observed (22–25), so that they would not be detected by the usual consensus sequencing techniques. If the mutant is one of the 100 virions that seed an infection (16, 17), then with positive selection the probability of acquiring the remaining mutations increases by 103 after 5 days of infection above the case of positive selection alone (Fig. 3B). If the proportion of mutants in the seeding population is 10−4 however, the increase in proportion of respiratory droplet transmissible A/H5N1 virions in the mammalian host is small (fig. S11).

Sixth, we considered mammal-to-mammal transmission of partially adapted viruses. Transmission of viruses between mammals that have some but not all of the substitutions necessary for respiratory droplet transmission potentially increases the risk of evolving a respiratory droplet–transmissible A/H5N1 virus, but this increase is modulated by the difficulty of transmitting partially adapted strains and the loss of diversity at transmission. Two primary factors strongly modulate the effect of transmission on the accumulation of mutations. First, transmission could decrease the accumulation of mutations by the loss of low-proportion mutants because only a limited portion of the virus population will be transmitted. Second, transmission could increase the accumulation of respiratory droplet transmission–enabling substitutions by concentrating a transmissible virus during excretion from or seeding into a host—for example, if the adapted virus has increased tropism for the mammalian upper respiratory tract and therefore concentrated in the nose and throat. Thus, the effect of transmission can range from negligible, if mutants are culled by the loss of diversity at transmission, to substantial, if selection favors mutants at transmission (table S5). Given that A/H5N1 virus infections have been observed in >20 mammalian species, there is a potentially large pool of nonhuman hosts in which short chains of transmission could play a role in the emergence of respiratory droplet–transmissible A/H5N1 viruses.

In contrast to these factors that could increase the rate of accumulating substitutions, we next discuss factors that could decrease this rate.

First, we considered an effective immune response. An immune response that substantially shortened an infection would decrease the probability of the accumulation of mutations; however, there are many reported cases of infections up to and beyond 5 days (14, 21). Variation in the number of virions produced by each infected cell does not affect the deterministic calculations of the proportion of mutants. However, if this number is substantially lower for the stochastic simulations—for example, 25 (6) as compared with 10,000 (used for most of the figures)—the slower growth and lower total number of viruses could substantially delay the appearance of mutants within a host. As the number of required mutations increases, stochastic effects caused by the slower growth decrease the proportion of these mutants (fig. S5) (6).

Second, we considered deleterious intermediate substitutions. The receptor binding and trimer-interface or stalk substitutions required by (1) and (2) are, as we have seen, either rare or absent in influenza viruses isolated to date. The receptor-binding substitutions, although deleterious in birds, would be expected to be advantageous in humans. However, the details of this host-adaptation are not yet elucidated, and so we also consider the possibility that there are deleterious intermediate substitutions and explore a variety of scenarios (figs. S12 and S13). When two of the required substitutions are individually deleterious (for these two specific substitutions, either substitution alone reduces the replicative fitness of the virus to zero), this slows the rate of accumulation of mutations for the three-mutation case by less than the amount that hill-climb positive selection increases the rate above the neutral case (Fig. 3C). When three substitutions are required (all single and double substitutions reduce the replicative fitness of a mutant virus to zero), this can lower the accumulation rate ~102 below the neutral case (fig. S12). Deleterious (or advantageous) substitutions other than the respiratory droplet–transmissible A/H5N1 substitutions can, to a first approximation, be ignored in calculating proportions because such substitutions would on average affect all viruses equally and thus would not specifically affect the accumulation of respiratory droplet–transmissible A/H5N1 mutations (6).

Third, we considered order dependence in the acquisition of substitutions. It is not currently known whether the acquisition of some or all of the respiratory droplet transmission–enabling substitutions is dependent on the order in which viruses accumulate those substitutions. For example, the gain of 2-6-receptor binding might be required before loss of 2-3-receptor binding. If there were any order dependence, it would slow down the rate of accumulation of mutations. However, even in the most extreme scenario in which there is a single specific order in which the substitutions must be acquired, and any other order results in a virion with a replicative fitness of zero, if fewer than four mutations are required, the effect on the rate of accumulation of mutations is less than that of the deleterious scenario described above (Fig. 3C and figs. S14 and S15).

In addition to the substitutions in HA, the Imai et al. virus was a reassortment with an A/H1pdm09 virus. The probability of a reassortment event is difficult to determine given current knowledge. In one study (26), it has been estimated to be more likely than the likelihood of acquiring a single mutation as calculated here.

Highly pathogenic avian A/H5N1 viruses have been infecting humans for over a decade, with ~600 reported cases to date (and possibly many more that have not been reported), but there have yet to be known cases of efficient human-to-human transmission (27, 28). One hypothesis for the lack of sustained transmission is that it is not possible for A/H5N1 viruses to become respiratory droplet–transmissible in mammals; (1) and (2) have shown that this may not be the case in ferrets. Another hypothesis is that the number of mutations necessary for respiratory droplet–transmissibility might be so great that such a virus would be unlikely to evolve. We show here that in biologically plausible scenarios, respiratory droplet–transmissible A/H5N1 viruses can evolve during a mammalian infection. Given that respiratory droplet transmission between mammals is possible and that respiratory droplet–transmissible A/H5N1 mutants are likely to evolve in infected individuals, the primary impediment to transmission could be whether the respiratory droplet–transmissible A/H5N1 viruses comprise a sufficient proportion of the within-host viral population to actually transmit.

The minimum proportion of virus required for transmission is not known, but increased proportion likely translates into increased probability of transmission. There cannot be respiratory droplet transmission if there are no viruses in the air. Given a peak excretion rate of ~107 viruses per day (29, 30), a proportion of which are likely to become aerosolized (31), mutants at proportions near or above 10−7 might thus be among the particles excreted. Each of the factors analyzed above has a potentially substantial effect on the rate of accumulating mutations (Fig. 3), and the effects of each can be additive. With plausible combinations of these factors, a virus that requires three mutations reaches proportions at which a few respiratory droplet–transmissible A/H5N1 viruses are likely to be among the particles excreted. For a virus that requires five mutations, it may only reach such proportions with more extreme combinations of factors or if an event occurs that is not encompassed by the model (32). However, it is known that influenza viruses are capable of respiratory droplet transmission in animal models at low infectious doses (33), and that transmission routes other than in respiratory droplets could be important; thus despite the three key current unknowns about transmission (6), even low numbers of excreted respiratory droplet–transmissible A/H5N1 virus may be relevant for emergence. In addition, the probability of emergence increases when more mammals are infected when this also corresponds with a rise in potential transmission events. The output of the model is a guide to understand the approximate effects of different factors and should not be interpreted as actual proportions of virus and probabilities of transmission, given the uncertainty inherent in parameter estimates and model structure, and the inherent unpredictability of rare events (34).

These results highlight four areas of investigation that are critical to more accurately assess and monitor the risk of a respiratory droplet–transmissible A/H5N1 virus emerging and to increase our understanding of virus emergence in general. Some of this work is already ongoing, planned, or suggested. The work of Herfst et al. (1) and Imai et al. (2) and the analyses here help to prioritize particular areas.

First, additional surveillance in higher-risk regions where viruses require fewer nucleotide mutations to acquire respiratory droplet transmission–enabling substitutions (Fig. 1 and fig. S1) (and in regions connected by travel, trade, and migratory flyways) is key for monitoring the emergence of a respiratory droplet–transmissible A/H5N1 virus. Surveillance of nonhuman mammalian hosts, especially any that harbor long infections or live in large groups, is important for the early identification of mammalian adaptation. Additionally, studies are needed on the accumulation of mutations within-host and in short chains of transmission in mammals (22–25), even when endemic circulation has not been observed.

Second, deep sequencing of avian and other nonhuman virus samples is necessary to accurately estimate the prevalence of the respiratory droplet transmission–enabling amino acid substitutions in nature. Deep sequencing of human samples, particularly at multiple time points from individuals with long infections, would be useful for evaluating within-host evolution, for estimating selective advantage of substitutions, and for testing the underlying dynamics and assumptions of the model (15). Respiratory droplet–transmissible A/H5N1 mutations present in a proportion higher than the polymerase error rate—exceeding approximately 10−5, but far below the threshold for detection with consensus sequencing and thus not detectable with current surveillance practices—would increase the risk of a respiratory droplet–transmissible A/H5N1 evolving. Thus, sequencing deeper than that currently routinely achieved for RNA viruses (ideally detecting mutations at 0.1% frequency and lower for detailed studies) is necessary to more accurately assess the risk posed by intra-host variability (15).

Third, experiments are needed to determine which substitutions, besides the already identified receptor-binding substitutions by (1) and (2), are capable of producing respiratory droplet–transmissible A/H5N1 viruses, including the important case of functionally equivalent substitutions or alternative sets of substitutions that would require fewer nucleotide mutations than those of the Herfst et al. or Imai et al. sets. This work will be important for calculating risk and for monitoring in surveillance.

Fourth, further studies are needed to elucidate the changes in within-host fitness and between-host transmissibility associated with each respiratory droplet transmission–enabling substitution and combination of substitutions. These studies are necessary for determining the dynamics of within-host selection [including data on, and modeling of, the effects of glycan heterogeneity between the upper and lower respiratory tract (6)] and the potential for transmission of partially adapted viruses. It is important to determine the strength of selection at transmission because it can increase the proportion of respiratory droplet transmission–enabling substitutions. Further work is needed to refine the estimate for virus excretion and the minimum human infectious dose (29).

Numerous avian A/H5N1 viruses have been sampled in the past 2 years that are four nucleotide mutations from acquiring the Herfst et al. set of HA and PB2 substitutions and three nucleotide mutations from acquiring the Imai et al. set in HA (the Imai et al. set also requires a reassortment event). Precise estimates of the probability of evolving the remaining mutations for the virus to become a respiratory droplet–transmissible A/H5N1 virus cannot be accurately calculated at this time because of gaps in knowledge of the factors described above. However, the analyses here, using current best estimates, indicate that the remaining mutations could evolve within a single mammalian host, making the possibility of a respiratory droplet–transmissible A/H5N1 virus evolving in nature a

References and Notes


S. Herfst

et al

., Science336, 1534 (2012).

Abstract/FREE Full Text


M. Imai

et al

., Nature, published online 2 May 2012; 10.1038/nature10831.



A. Vines

et al

., The role of influenza A virus hemagglutinin residues 226 and 228 in receptor specificity and host range restriction. J. Virol.72, 7626 (1998).

Abstract/FREE Full Text


S. Chutinimitkul

et al

., In vitro assessment of attachment pattern and replication efficiency of H5N1 influenza A viruses with altered receptor specificity. J. Virol.84, 6825 (2010).

Abstract/FREE Full Text


M. Hatta

et al

., Growth of H5N1 Influenza A Viruses in the Upper Respiratory Tracts of Mice. PLoS Pathog.3, e133 (2007).


6.↵Materials and methods are available as supplementary materials on Science Online.


S. E. Luria,

M. Delbrück

, Mutations of bacteria from virus sensitivity to virus resistance. Genetics28, 491 (1943).

CrossRefMedlineWeb of Science


C. J. Mode,

T. Raj,

C. K. Sleeman

, Simulating the emergence and survival of mutations using a self regulating multitype branching processes. J. Probab. Stat.2011, 1 (2011).



J. M. Coffin

, HIV population dynamics in vivo: Implications for genetic variation, pathogenesis, and therapy. Science267, 483 (1995).

Abstract/FREE Full Text


A. S. Perelson,

L. Rong,

F. G. Hayden

, J. Infect. Dis., published online 23 March 2012; 10.1093/infdis/jis265.


Y. Sidorenko,

U. Reichl

, Structured model of influenza virus replication in MDCK cells. Biotechnol. Bioeng.88, 1 (2004).

CrossRefMedlineWeb of Science


L. Möhler,

D. Flockerzi,

H. Sann,

U. Reichl

, Mathematical model of influenza A virus production in large-scale microcarrier culture. Biotechnol. Bioeng.90, 46 (2005).

CrossRefMedlineWeb of Science


E. R. Weibel

, Morphometry of the human lung: The state of the art after two decades. Bull. Eur. Physiopathol. Respir.15, 999 (1979).

MedlineWeb of Science


Writing Committee WHO

, Update on Avian Influenza A (H5N1) virus infection in humans. N. Engl. J. Med.358, 261 (2008).

CrossRefMedlineWeb of Science


P. R. Murcia

et al

., Intra- and interhost evolutionary dynamics of equine influenza virus. J. Virol.84, 6943 (2010).

Abstract/FREE Full Text


T. Kuiken

et al

., Host species barriers to influenza virus infections. Science312, 394 (2006).

Abstract/FREE Full Text


S. Bonhoeffer,

M. A. Nowak

, Pre-existence and emergence of drug resistance in HIV-1 infection. Proc. Biol. Sci.264, 631 (1997).

Abstract/FREE Full Text


S. Wain-Hobson

, The fastest genome evolution ever described: HIV variation in situ. Curr. Opin. Genet. Dev.3, 878 (1993).



W. G. Hill,

A. Robertson

, The effect of linkage on limits to artificial selection. Genet. Res.8, 269 (1966).

MedlineWeb of Science


A. Antón

et al

., Selection and viral load kinetics of an oseltamivir-resistant pandemic influenza A (H1N1) virus in an immunocompromised patient during treatment with neuraminidase inhibitors. Diagn. Microbiol. Infect. Dis.68, 214 (2010).



M. D. de Jong

et al

., Fatal outcome of human influenza A (H5N1) is associated with high viral load and hypercytokinemia. Nat. Med.12, 1203 (2006).

CrossRefMedlineWeb of Science


C. A. Nidom

et al

., Influenza A (H5N1) viruses from pigs, Indonesia. Emerg. Infect. Dis.16, 1515 (2010).



J. Keawcharoen

et al

., Avian influenza H5N1 in tigers and leopards. Emerg. Infect. Dis.10, 2189 (2004).

MedlineWeb of Science


X. Qi

et al

., Molecular characterization of highly pathogenic H5N1 avian influenza A viruses isolated from raccoon dogs in China. PLoS ONE4, e4682 (2009).



L. Reperant

et al

., Rev. Sci. Tech.1, 137 (2009).


N. M. Ferguson,

C. Fraser,

C. A. Donnelly,

A. C. Ghani,

R. M. Anderson

, Public health risk from the avian H5N1 influenza epidemic. Science304, 968 (2004).

Abstract/FREE Full Text


T. Y. Aditama

et al

., Avian influenza H5N1 transmission in households, Indonesia. PLoS ONE7, e29971 (2012).



Y. Yang,

M. E. Halloran,

J. D. Sugimoto,

I. M. Longini Jr.

, Detecting human-to-human transmission of avian influenza A (H5N1). Emerg. Infect. Dis.13, 1348 (2007).

MedlineWeb of Science


M. P. Atkinson,

L. M. Wein

, Quantifying the routes of transmission for pandemic influenza. Bull. Math. Biol.70, 820 (2008).

CrossRefMedlineWeb of Science


P. Fabian

et al

., Influenza virus in human exhaled breath: An observational study. PLoS ONE3, e2691 (2008).



R. Tellier

, Aerosol transmission of influenza A virus: A review of new studies. J. R. Soc. Interface6 (suppl. 6), S783 (2008).



T. Ord,

R. Hillerbrand,

A. Sandberg

, Probing the improbable: Methodological challenges for risks with low probabilities and high stakes. J. Risk Res.13, 191 (2010).

CrossRefWeb of Science


J. A. Lednicky

et al

., Ferrets develop fatal influenza after inhaling small particle aerosols of highly pathogenic avian influenza virus A/Vietnam/1203/2004 (H5N1). Virol. J.7, 231 (2010).



D. J. Spiegelhalter,

H. Riesch

, Philos. Trans. R. Soc. London Ser. A269, 4730 (2011).


J. Bahl

et al

., Continued evolution of highly pathogenic avian influenza A (H5N1): Updated nomenclature. Influenza Other Respir. Viruses6, 1 (2012).

CrossRefMedlineWeb of Science


S. Guindon

et al

., New algorithms and methods to estimate maximum-likelihood phylogenies: Assessing the performance of PhyML 3.0. Syst. Biol.59, 307 (2010).

Abstract/FREE Full Text


D. Posada,

K. A. Crandall

, MODELTEST: Testing the model of DNA substitution. Bioinformatics14, 817 (1998).

Abstract/FREE Full Text


D. J. Zwickl, thesis, University of Texas (2006).


J. Steel,

A. C. Lowen,

S. Mubareka,

P. Palese

, Transmission of influenza virus in a mammalian host is increased by PB2 amino acids 627K or 627E/701N. PLoS Pathog.5, e1000252 (2009).



Z. Li

et al

., Molecular basis of replication of duck H5N1 influenza viruses in a mammalian mouse model. J. Virol.79, 12058 (2005).

Abstract/FREE Full Text


E. J. Schrauwen

et al

., The multibasic cleavage site in H5N1 virus is critical for systemic spread along the olfactory and hematogenous routes in ferrets. J. Virol.86, 3975 (2012).

Abstract/FREE Full Text


T. Kuiken,

J. K. Taubenberger

, Pathology of human influenza revisited. Vaccine26, (Suppl 4), D59 (2008).

CrossRefMedlineWeb of Science


Y. Hatta

et al

., Viral replication rate regulates clinical outcome and CD8 T cell responses during highly pathogenic H5N1 influenza virus infection in mice. PLoS Pathog.6, e1001139 (2010).



E. M. Sorrell

et al

., Predicting ‘airborne’ influenza viruses: (trans-) mission impossible? Curr. Opin. Virol.1, 635 (2011).



L. A. Loeb

, Mutator phenotype may be required for multistage carcinogenesis. Cancer Res.51, 3075 (1991).

FREE Full Text


K. Shinya

et al

., Avian flu: influenza virus receptors in the human airway. Nature440, 435 (2006).



D. van Riel

et al

., H5N1 virus attachment to lower respiratory tract. Science312, 399 (2006).

Abstract/FREE Full Text


A. Mehle,

J. A. Doudna

, Adaptive strategies of the influenza virus polymerase for replication in humans. Proc. Natl. Acad. Sci. U.S.A.106, 21312 (2009).

Abstract/FREE Full Text


S. J. Stray,

G. M. Air

, Apoptosis by influenza viruses correlates with efficiency of viral mRNA synthesis. Virus Res.77, 3 (2001).

CrossRefMedlineWeb of Science


P. Baccam,

C. Beauchemin,

C. A. Macken,

F. G. Hayden,

A. S. Perelson

, Kinetics of influenza A virus infection in humans. J. Virol.80, 7590 (2006).

Abstract/FREE Full Text


J. K. Taubenberger

et al

., Characterization of the 1918 influenza virus polymerase genes. Nature437, 889 (2005).



Y. Gao

et al

., Identification of amino acids in HA and PB2 critical for the transmission of H5N1 avian influenza viruses in a mammalian host. PLoS Pathog.5, e1000709 (2009).



L. M. Chen

et al

., In vitro evolution of H5N1 avian influenza virus toward human-type receptor specificity. Virology422, 105 (2012).

CrossRefMedlineWeb of Science


P. Auewarakul

et al

., An avian influenza H5N1 virus that binds to a human-type receptor. J. Virol.81, 9950 (2007).

Abstract/FREE Full Text


Y. Watanabe

et al

., Acquisition of human-type receptor binding specificity by new H5N1 influenza virus sublineages during their emergence in birds in Egypt. PLoS Pathog.7, e1002068 (2011).



Z. Y. Yang

et al

., Immunization by avian H5 influenza hemagglutinin mutants with altered receptor binding specificity. Science317, 825 (2007).

Abstract/FREE Full Text


J. Stevens

et al

., Structure and receptor specificity of the hemagglutinin from an H5N1 influenza virus. Science312, 404 (2006).

Abstract/FREE Full Text


J. Stevens

et al

., Recent avian H5N1 viruses exhibit increased propensity for acquiring human receptor specificity. J. Mol. Biol.381, 1382 (2008).

CrossRefMedlineWeb of Science


S. Yamada

et al

., Haemagglutinin mutations responsible for the binding of H5N1 influenza A viruses to human-type receptors. Nature444, 378 (2006).



Y. Iwasa,

F. Michor,

M. A. Nowak

, Stochastic tunnels in evolutionary dynamics. Genetics166, 1571 (2004).

Abstract/FREE Full Text


D. B. Weissman,

M. M. Desai,

D. S. Fisher,

M. W. Feldman

, The rate at which asexual populations cross fitness valleys. Theor. Popul. Biol.75, 286 (2009).

CrossRefMedlineWeb of Science


D. B. Weissman,

M. W. Feldman,

D. S. Fisher

, The rate of fitness-valley crossing in sexual populations. Genetics186, 1389 (2010).

CrossRefMedlineWeb of Science


R. Durrett,

D. Schmidt

, Waiting for two mutations: With applications to regulatory sequence evolution and the limits of Darwinian evolution. Genetics180, 1501 (2008).

Abstract/FREE Full Text


R. Durrett,

D. Schmidt,

J. Schweinsberg

, A waiting time problem arising from the study of multi-stage carcinogenesis. Ann. Appl. Probab.19, 676 (2009).

CrossRefWeb of Science


M. Lynch

, Scaling expectations for the time to establishment of complex adaptations. Proc. Natl. Acad. Sci. U.S.A.107, 16577 (2009).



N. L. Komarova,

A. Sengupta,

M. A. Nowak

, Mutation-selection networks of cancer initiation: tumor suppressor genes and chromosomal instability. J. Theor. Biol.223, 433 (2003).

CrossRefMedlineWeb of Science


Acknowledgments: C.A.R. was supported by a University Research Fellowship from the Royal Society. The authors acknowledge an Nederlandse Organisatie voor Wetenschappelijk Onderzoek (NWO) VICI grant, European Union (EU) FP7 programs EMPERIE (223498) and ANTIGONE (278976), Human Frontier Science Program (HFSP) program grant P0050/2008, Wellcome 087982AIA, the Bill and Melinda Gates Foundation (OPPGH5383), and NIH Director’s Pioneer Award DP1-OD000490-01. R.A.M.F was supported by National Institute of Allergy and Infectious Diseases (NIAID)–NIH contract HHSN266200700010C. A.E.X.B. was supported by a long-term fellowship from the HFSP. E.M., G.N., and Y.K. are supported by the Bill and Melinda Gates Foundation (OPPGH5383) and NIAID-NIH grant R01 AI 069274; in addition, Y.K. was supported by a Grant-in-Aid for Specially Promoted Research from the Ministry of Education, Culture, Sports, Science, and Technology of Japan and by ERATO. Y.K. and G.N. have a financial interest as founders of FluGen and hold a patent on influenza virus reverse genetics. Y.K and G.N. have a paid consulting relationship with Theraclone; Y.K. also has a paid consulting relationship with Crucell. Y.K. has received speaker’s honoraria from Chugai Pharmaceuticals, Novartis, Daiichi-Sankyo Pharmaceutical, Toyama Chemical, Wyeth, GlaxoSmithKline, and Astellas and grant support from Chugai Pharmaceuticals, Daiichi Sankyo Pharmaceutical, Toyama Chemical, Otsuka Pharmaceutical Company A.D.M.E.O. (on behalf of Viroclinics Biosciences BV) has advisory affiliations with GlaxoSmithKline, Novartis, and Roche. A.D.M.E.O. is Chief Scientific Officer of Viroclinics Biosciences BV. We thank S. Cornell, E. Ghedin, R. Johnstone, L. Reperant, and D. M. Smith for helpful discussions and the reviewers for their detailed and thoughtful comments.

Categories: . Biological Warfare, . General Mutations, Societal

Tags: , , , , , , ,

2 replies


  1. URL
  2. www
%d bloggers like this: