Permalink
Switch branches/tags
Nothing to show
Find file
Fetching contributors…
Cannot retrieve contributors at this time
2 lines (1 sloc) 163 KB
<?xml version="1.0" encoding="UTF-8"?><!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.1d1 20130915//EN" "JATS-archivearticle1.dtd"><article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article" dtd-version="1.1d1"><front><journal-meta><journal-id journal-id-type="nlm-ta">elife</journal-id><journal-id journal-id-type="hwp">eLife</journal-id><journal-id journal-id-type="publisher-id">eLife</journal-id><journal-title-group><journal-title>eLife</journal-title></journal-title-group><issn publication-format="electronic">2050-084X</issn><publisher><publisher-name>eLife Sciences Publications, Ltd</publisher-name></publisher></journal-meta><article-meta><article-id pub-id-type="publisher-id">03735</article-id><article-id pub-id-type="doi">10.7554/eLife.03735</article-id><article-categories><subj-group subj-group-type="display-channel"><subject>Research article</subject></subj-group><subj-group subj-group-type="heading"><subject>Biochemistry</subject></subj-group><subj-group subj-group-type="heading"><subject>Genomics and evolutionary biology</subject></subj-group></article-categories><title-group><article-title>Measurement of average decoding rates of the 61 sense codons in vivo</article-title></title-group><contrib-group><contrib contrib-type="author" id="author-15713" equal-contrib="yes"><name><surname>Gardin</surname><given-names>Justin</given-names></name><xref ref-type="aff" rid="aff1">1</xref><xref ref-type="fn" rid="equal-contrib">&#x2020;</xref><xref ref-type="fn" rid="con1"/><xref ref-type="fn" rid="conf1"/><xref ref-type="other" rid="dataro1"/></contrib><contrib contrib-type="author" id="author-15714" equal-contrib="yes"><name><surname>Yeasmin</surname><given-names>Rukhsana</given-names></name><xref ref-type="aff" rid="aff2">2</xref><xref ref-type="fn" rid="equal-contrib">&#x2020;</xref><xref ref-type="fn" rid="con2"/><xref ref-type="fn" rid="conf1"/></contrib><contrib contrib-type="author" id="author-3817"><name><surname>Yurovsky</surname><given-names>Alisa</given-names></name><xref ref-type="aff" rid="aff1">1</xref><xref ref-type="fn" rid="con3"/><xref ref-type="fn" rid="conf1"/></contrib><contrib contrib-type="author" id="author-15715"><name><surname>Cai</surname><given-names>Ying</given-names></name><xref ref-type="aff" rid="aff1">1</xref><xref ref-type="fn" rid="con4"/><xref ref-type="fn" rid="conf1"/></contrib><contrib contrib-type="author" id="author-15716"><name><surname>Skiena</surname><given-names>Steve</given-names></name><xref ref-type="aff" rid="aff2">2</xref><xref ref-type="other" rid="par-2"/><xref ref-type="other" rid="par-3"/><xref ref-type="fn" rid="con5"/><xref ref-type="fn" rid="conf1"/></contrib><contrib contrib-type="author" corresp="yes" id="author-15353"><name><surname>Futcher</surname><given-names>Bruce</given-names></name><contrib-id contrib-id-type="orcid">http://orcid.org/0000-0002-1012-9022</contrib-id><xref ref-type="aff" rid="aff1">1</xref><xref ref-type="corresp" rid="cor1">&#x2a;</xref><xref ref-type="other" rid="par-1"/><xref ref-type="fn" rid="con6"/><xref ref-type="fn" rid="conf1"/></contrib><aff id="aff1"><label>1</label><institution content-type="dept">Department of Molecular Genetics and Microbiology</institution>, <institution>Stony Brook University</institution>, <addr-line><named-content content-type="city">Stony Brook</named-content></addr-line>, <country>United States</country></aff><aff id="aff2"><label>2</label><institution content-type="dept">Department of Computer Science</institution>, <institution>Stony Brook University</institution>, <addr-line><named-content content-type="city">Stony Brook</named-content></addr-line>, <country>United States</country></aff></contrib-group><contrib-group content-type="section"><contrib contrib-type="editor"><name><surname>Sonenberg</surname><given-names>Nahum</given-names></name><role>Reviewing editor</role><aff><institution>McGill University</institution>, <country>Canada</country></aff></contrib></contrib-group><author-notes><corresp id="cor1"><label>&#x2a;</label>For correspondence: <email>bfutcher@gmail.com</email></corresp><fn fn-type="con" id="equal-contrib"><label>&#x2020;</label><p>These authors contributed equally to this work</p></fn></author-notes><pub-date publication-format="electronic" date-type="pub"><day>27</day><month>10</month><year>2014</year></pub-date><pub-date pub-type="collection"><year>2014</year></pub-date><volume>3</volume><elocation-id>e03735</elocation-id><history><date date-type="received"><day>19</day><month>06</month><year>2014</year></date><date date-type="accepted"><day>24</day><month>10</month><year>2014</year></date></history><permissions><copyright-statement>Copyright &#xa9; 2014, Gardin et al</copyright-statement><copyright-year>2014</copyright-year><copyright-holder>Gardin et al</copyright-holder><license xlink:href="http://creativecommons.org/licenses/by/4.0/"><license-p>This article is distributed under the terms of the <ext-link ext-link-type="uri" xlink:href="http://creativecommons.org/licenses/by/4.0/">Creative Commons Attribution License</ext-link>, which permits unrestricted use and redistribution provided that the original author and source are credited.</license-p></license></permissions><self-uri content-type="pdf" xlink:href="elife03735.pdf"/><abstract><object-id pub-id-type="doi">10.7554/eLife.03735.001</object-id><p>Most amino acids can be encoded by several synonymous codons, which are used at unequal frequencies. The significance of unequal codon usage remains unclear. One hypothesis is that frequent codons are translated relatively rapidly. However, there is little direct, in vivo, evidence regarding codon-specific translation rates. In this study, we generate high-coverage data using ribosome profiling in yeast, analyze using a novel algorithm, and deduce events at the A- and P-sites of the ribosome. Different codons are decoded at different rates in the A-site. In general, frequent codons are decoded more quickly than rare codons, and AT-rich codons are decoded more quickly than GC-rich codons. At the P-site, proline is slow in forming peptide bonds. We also apply our algorithm to short footprints from a different conformation of the ribosome and find strong amino acid-specific (not codon-specific) effects that may reflect interactions with the exit tunnel of the ribosome.</p><p><bold>DOI:</bold> <ext-link ext-link-type="doi" xlink:href="10.7554/eLife.03735.001">http://dx.doi.org/10.7554/eLife.03735.001</ext-link></p></abstract><abstract abstract-type="executive-summary"><object-id pub-id-type="doi">10.7554/eLife.03735.002</object-id><title>eLife digest</title><p>Genes contain the instructions for making proteins from molecules called amino acids. These instructions are encoded in the order of the four building blocks that make up DNA, which are symbolized by the letters A, T, C, and G. The DNA of a gene is first copied to make a molecule of RNA, and then the letters in the RNA are read in groups of three (called &#x2018;codons&#x2019;) by a cellular machine called a ribosome. &#x2018;Sense codons&#x2019; each specify one amino acid, and the ribosome decodes hundreds or thousands of these codons into a chain of amino acids to form a protein. &#x2018;Stop codons&#x2019; do not encode amino acids but instead instruct the ribosome to stop building a protein when the chain is completed.</p><p>Most proteins are built from 20 different kinds of amino acid, but there are 61 sense codons. As such, up to six codons can code for the same amino acid. The multiple codons for a single amino acid, however, are not used equally in gene sequences&#x2014;some are used much more often than others.</p><p>Now, Gardin, Yeasmin et al. have instantly halted the on-going processes of decoding genes and building proteins in yeast cells. Codons being translated into amino acids are trapped inside the ribosome; and codons that take the longest to decode are trapped most often. By using a computer algorithm, Gardin, Yeasmin et al. were able to measure just how often each kind of sense codon was trapped inside the ribosome and use this as a measure of how quickly each codon is decoded. The more often a given codon is used in a gene sequence, the less likely it was found to be trapped inside the ribosome&#x2014;which suggests that these codons are decoded quicker than other codons and pass through the ribosome more quickly. Put another way, it appears that genes tend to use the codons that can be read the fastest.</p><p>Certain properties of a codon also affected its decoding speed. Codons with more As and Ts, for example, are decoded faster than codons with more Cs and Gs. Furthermore, whenever a chemically unusual amino acid called proline has to be added to a new protein chain, it slowed down the speed at which the protein was built. The method described by Gardin, Yeasmin et al. for peering into a decoding ribosome may now help future studies that aim to answer other questions about how proteins are built.</p><p><bold>DOI:</bold> <ext-link ext-link-type="doi" xlink:href="10.7554/eLife.03735.002">http://dx.doi.org/10.7554/eLife.03735.002</ext-link></p></abstract><kwd-group kwd-group-type="author-keywords"><title>Author keywords</title><kwd>translation</kwd><kwd>decoding</kwd><kwd>codon bias</kwd><kwd>codon usage</kwd><kwd>synonymous codons</kwd><kwd>ribosome</kwd></kwd-group><kwd-group kwd-group-type="research-organism"><title>Research organism</title><kwd><italic>S. cerevisiae</italic></kwd></kwd-group><funding-group><award-group id="par-1"><funding-source><institution-wrap><institution-id institution-id-type="FundRef">http://dx.doi.org/10.13039/100000057</institution-id><institution content-type="university">National Institute of General Medical Sciences</institution></institution-wrap></funding-source><award-id>RO1 GM098400</award-id><principal-award-recipient><name><surname>Futcher</surname><given-names>Bruce</given-names></name></principal-award-recipient></award-group><award-group id="par-2"><funding-source><institution-wrap><institution-id institution-id-type="FundRef">http://dx.doi.org/10.13039/100000083</institution-id><institution content-type="university">Directorate for Computer and Information Science and Engineering</institution></institution-wrap></funding-source><award-id>DBI-1060572</award-id><principal-award-recipient><name><surname>Skiena</surname><given-names>Steve</given-names></name></principal-award-recipient></award-group><award-group id="par-3"><funding-source><institution-wrap><institution-id institution-id-type="FundRef">http://dx.doi.org/10.13039/100000083</institution-id><institution content-type="university">Directorate for Computer and Information Science and Engineering</institution></institution-wrap></funding-source><award-id>IIS-1017181</award-id><principal-award-recipient><name><surname>Skiena</surname><given-names>Steve</given-names></name></principal-award-recipient></award-group><funding-statement>The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.</funding-statement></funding-group><custom-meta-group><custom-meta><meta-name>elife-xml-version</meta-name><meta-value>2.0</meta-value></custom-meta><custom-meta specific-use="meta-only"><meta-name>Author impact statement</meta-name><meta-value>The translation rates of each yeast codon have been measured in both the A and P sites of the ribosome.</meta-value></custom-meta></custom-meta-group></article-meta></front><body><sec sec-type="intro" id="s1"><title>Introduction</title><p>Different synonymous codons are used in genes at very different frequencies, and the reasons for this biased codon usage have been debated for three decades (<xref ref-type="bibr" rid="bib18">Fitch, 1976</xref>; <xref ref-type="bibr" rid="bib25">Hasegawa et al., 1979</xref>; <xref ref-type="bibr" rid="bib42">Miyata et al., 1979</xref>; <xref ref-type="bibr" rid="bib2">Bennetzen and Hall, 1982</xref>; <xref ref-type="bibr" rid="bib36">Lipman and Wilbur, 1983</xref>; <xref ref-type="bibr" rid="bib56">Sharp and Li, 1986</xref>; <xref ref-type="bibr" rid="bib8">Bulmer, 1987</xref>; <xref ref-type="bibr" rid="bib17">Drummond and Wilke, 2008</xref>) (reviewed by <xref ref-type="bibr" rid="bib48">Plotkin and Kudla (2011)</xref>; <xref ref-type="bibr" rid="bib19">Forster (2012)</xref>; <xref ref-type="bibr" rid="bib44">Novoa and Ribas de Pouplana (2012)</xref>). In particular, it has been suggested that the frequently-used codons are translated more rapidly than rarely-used codons, perhaps because tRNAs for the frequent codons are relatively highly expressed (<xref ref-type="bibr" rid="bib48">Plotkin and Kudla, 2011</xref>). However, there have also been competing hypotheses, including the idea that frequently-used codons are translated more accurately (<xref ref-type="bibr" rid="bib48">Plotkin and Kudla, 2011</xref>). Genes are often recoded to use frequent codons to increase protein expression (<xref ref-type="bibr" rid="bib9">Burgess-Brown et al., 2008</xref>; <xref ref-type="bibr" rid="bib41">Maertens et al., 2010</xref>), but without any solid understanding of why this manipulation is effective. There is little or no direct in vivo evidence as to whether the more common codons are indeed translated more rapidly than the rarer codons. Even if they are, the fact that translation is typically limited by initiation, not elongation, leaves the effectiveness of codon optimization a puzzle (<xref ref-type="bibr" rid="bib48">Plotkin and Kudla, 2011</xref>).</p><p>Ribosome profiling (<xref ref-type="bibr" rid="bib28">Ingolia et al., 2009</xref>) allows the observation of positions of ribosomes on translating cellular mRNAs. The basis of the method is that a translating ribosome protects a region of mRNA from nuclease digestion, generating a 30 base &#x2018;footprint&#x2019;. The footprint is roughly centered on the A-site of the ribosome. If some particular codon in the A-site were translated slowly, then the ribosome would dwell at this position, and so footprints generated from ribosomes at this position would be relatively common. Thus, if one looked at the number of ribosome footprints generated along an mRNA, there should be more footprints centered at every codon that is translated slowly and fewer centered at every codon translated rapidly; in principle, this is a method for measuring rates of translation of individual codons.</p><p>Experimentally, there is dramatic variation in the number of footprints generated at different positions along any particular mRNA (<xref ref-type="bibr" rid="bib29">Ingolia et al., 2011</xref>) (<xref ref-type="fig" rid="fig1">Figure 1</xref>). However, these large peaks and valleys do not correlate with particular codons (<xref ref-type="bibr" rid="bib29">Ingolia et al., 2011</xref>; <xref ref-type="bibr" rid="bib11">Charneski and Hurst, 2013</xref>). It is still unclear what features of the mRNA cause the peaks and valleys, though there is evidence that prolines, or a poly-basic amino acid stretch, contribute to a slowing of the ribosome and a peak of ribosome footprints (<xref ref-type="bibr" rid="bib29">Ingolia et al., 2011</xref>; <xref ref-type="bibr" rid="bib7">Brandman et al., 2012</xref>; <xref ref-type="bibr" rid="bib11">Charneski and Hurst, 2013</xref>).<fig id="fig1" position="float"><object-id pub-id-type="doi">10.7554/eLife.03735.003</object-id><label>Figure 1.</label><caption><title>Two ribosome profiles of the <italic>TDH1</italic> gene.</title><p>Top profile is from the data of <xref ref-type="bibr" rid="bib28">Ingolia et al., 2009</xref>; bottom profile is from the SC-lys dataset (&#x2018;Materials and methods&#x2019;). The first (leftmost) peak in the profiles is at the ATG start codon; it may differ in relative height because the SC-lys dataset was generated using flash-freezing.</p><p><bold>DOI:</bold> <ext-link ext-link-type="doi" xlink:href="10.7554/eLife.03735.003">http://dx.doi.org/10.7554/eLife.03735.003</ext-link></p></caption><graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="elife03735f001"/></fig></p><p>Still, the fact that prolines and poly-basic amino acid stretches affect translation speed does not tell us whether different synonymous codons may also cause smaller effects. This question was investigated by <xref ref-type="bibr" rid="bib50">Qian et al. (2012)</xref> and <xref ref-type="bibr" rid="bib11">Charneski and Hurst (2013)</xref> using the yeast ribosome profiling data of <xref ref-type="bibr" rid="bib28">Ingolia et al. (2009)</xref>. Neither group found any effect of different synonymous codons on translation rate&#x2014;that is, perhaps surprisingly, each codon, rare or common, appeared to be translated at the same rate (<xref ref-type="bibr" rid="bib50">Qian et al., 2012</xref>; <xref ref-type="bibr" rid="bib11">Charneski and Hurst, 2013</xref>).</p><p>We have re-investigated this issue with two differences from these previous investigations. First, we have generated four yeast ribosome profiling datasets by optimized methods, including the flash-freezing of growing cells before the addition of cycloheximide (&#x2018;Materials and methods&#x2019;); Ingolia et al. added cycloheximide before harvesting cells. Second, we have developed a novel method of analysis, designed with the knowledge that, at best, codon decoding rates could account for only a small portion of the variation in ribosome footprints across an mRNA (&#x2018;Materials and methods&#x2019;). The combination of optimized data and novel analysis reveals that different codons are decoded at different rates.</p></sec><sec sec-type="results" id="s2"><title>Results</title><p>In principle, using the ribosome footprint data to establish occupancy as a function of position might seem easy: align the reads to the reference genome to identify the 10 or so codons under each read, and tabulate the frequency of each codon observed in each position. Analysis of this general kind has been carried out previously, but without detecting codon-specific differences in decoding rates (<xref ref-type="bibr" rid="bib50">Qian et al., 2012</xref>; <xref ref-type="bibr" rid="bib11">Charneski and Hurst, 2013</xref>). However, this analysis in its simplest form would overweight the highly expressed genes, which account for a large fraction of total reads&#x2014;that is, a relatively small number of highly expressed genes would dominate the analysis. But because there are extreme peaks and valleys in ribosome footprint profiles (<xref ref-type="fig" rid="fig1">Figure 1</xref>), and because these are not primarily due to codon usage, this simple analysis would likely fail, because the results would depend mainly on a relatively small number of chromosome positions, and because of the peak-to-valley variability affecting these positions. Defining the right normalizations to compensate for differences in gene expression, gene length, sequence composition, etc, is complicated and problematic.</p><p>Instead, we have opted for a simpler approach. We independently analyze many selected regions (windows) where the effects of codon usage are particularly easy to assay. For each codon, we identify all translated regions in the genome where a particular codon (say CTC) occurs uniquely within a window of 10 codons upstream and 10 codons downstream&#x2014;that is, a window 19-codons wide, with the codon of interest occurring exactly once at position 10 of the 19-position window. For footprints 10-codons long, there are exactly 10 classes of footprints that contain this particular CTC and fit entirely in the window. That is, the CTC of interest can occur at position 1 of the footprint, or position 2, &#x2026;., or position 10. Analysis was restricted to windows with at least 20 total reads and at least 3 non-empty classes. For our four datasets discussed below, there was an average of 408, 1586, 1749, or 2868 qualifying windows per codon, respectively (more windows for the abundant codons, fewer for the rare codons).</p><p>In the absence of any codon preference of the ribosome, there should be a uniform distribution of footprints across the ten positions. That is, in a window centered on CTC and containing 100 footprints, one expects 10 footprints at each of the 10 positions, a relative frequency of 0.1 (10/100) at each position. On the other hand, if the ribosome was to dwell for an extended time over the CTC whenever that codon was at, say, position 6 of the footprint, then there might be 30 footprints with CTC in position 6, and about 8 footprints at each of the other 9 positions, thus giving a frequency distribution with a peak at position 6. Many such relative frequency distributions can be fairly averaged over all windows over all genes centered on a specific codon. Regions on highly expressed genes can be fairly compared with similar regions on genes with lower expression, because we are dealing with relative frequency distributions. Each window thus represents an independent trial of the ribosome's dwell time over each given codon. Averaging over the hundreds or thousands of windows in the genome generates a statistically rigorous analysis. Note that we do not attempt any normalization based on gene expression&#x2014;instead, we take each qualifying window as an independent experiment, regardless of level of expression, then average all frequency distributions from all windows for each codon. A related idea was also used by <xref ref-type="bibr" rid="bib34">Lareau et al. (2014)</xref>, although on significantly different data, and with normalization by gene.</p><p>The relative frequency averaged over all windows is a number between 0 and 1, and we compare this to the baseline frequency (0.1) (total footprints over 10 positions) to compute a final statistic, which we call the Ribosome Residence Time, or RRT. For instance, if the average relative frequency for a codon at a particular position is 0.1, then the RRT is 1, and we interpret this to mean that the ribosome spends the average amount of time at the given codon at the given position. An RRT of two suggests that the ribosome spends twice as long as average at the given codon.</p><sec id="s2-1"><title>Validation of ribosome residence time analysis</title><p>We tested this method of analysis using simulated and real positive and negative control data. For a simulated negative control, we assigned real footprint data from our SC-lys dataset to random codons and did RRT analysis. As expected, all codons at all positions show an RRT of about 1, that is, no signal (<xref ref-type="fig" rid="fig2">Figure 2A</xref>). For a simulated positive control, we generated a simulated data set of 2 million 10-codon reads over coding genes, but we biased these simulated reads to give more reads for the codon AAA at position 6 of the footprint. As expected, RRT analysis shows a peak for AAA at position 6 (<xref ref-type="fig" rid="fig2">Figure 2B</xref>).<fig id="fig2" position="float"><object-id pub-id-type="doi">10.7554/eLife.03735.004</object-id><label>Figure 2.</label><caption><title>Validation for ribosome residence time analysis.</title><p>(<bold>A</bold>) Simulated data, negative control. Real footprint data from the SC-lys dataset were randomly assigned to codons, and RRT analysis was carried out. A flat line with an RRT value of 1 indicates no signal. (<bold>B</bold>) Simulated data, positive control. A dataset of 2 million simulated reads was generated but biased to give more reads over the codon AAA at position 6. (<bold>C</bold>) Real data, negative control. RNA-seq data from naked fragments of RNA 30 nucleotides long, processed as if for ribosome profiling, were analyzed. (<bold>D</bold>) Real data, positive control. Real ribosome footprinting data from Li et al. were analyzed (<xref ref-type="bibr" rid="bib35">Li et al., 2012</xref>). In this experiment, <italic>E. coli</italic> were starved for serine. Note that the highest Ser peak is for TCA, which is the rarest Ser codon in <italic>E. coli</italic>, and the lowest Ser peak is for AGC, which is the most common Ser codon in <italic>E. coli</italic>. High values at position 9 as well as 8 may indicate that the A-site may be at position 8 in some fragments and position 9 in others.</p><p><bold>DOI:</bold> <ext-link ext-link-type="doi" xlink:href="10.7554/eLife.03735.004">http://dx.doi.org/10.7554/eLife.03735.004</ext-link></p></caption><graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="elife03735f002"/></fig></p><p>For a real-data negative control, we pooled the control mRNA-seq data for 30 bp fragments from our four experiments (&#x2018;Materials and methods&#x2019;) and analyzed these mRNA fragments. Since this RNA came from a total naked RNA preparation, there were no ribosomes and no ribosome footprints, so there should not be any signal from translation, even though we are analyzing real 30 bp RNA fragments. Indeed, RRT analysis shows no peaks in positions 2 through 9 of these fragments (<xref ref-type="fig" rid="fig2">Figure 2C</xref>). However, there are modest deviations from 1 at the termini, positions 1 and 10. We attribute these to some base-specificity for the enzymatic reactions used to generate the fragment library (<xref ref-type="bibr" rid="bib33">Lamm et al., 2011</xref>; <xref ref-type="bibr" rid="bib30">Jackson et al., 2014</xref>; <xref ref-type="bibr" rid="bib51">Raabe et al., 2014</xref>). Supporting this interpretation, the same peaks and valleys at positions 1 and 10 (i.e., the same base-specificity) were seen in real ribosome-footprint data (see below).</p><p>For a real data positive control experiment, we used the <italic>Escherichia coli</italic> data generated by Li et al., who starved <italic>E. coli</italic> for serine, and did ribosome profiling (<xref ref-type="bibr" rid="bib35">Li et al., 2012</xref>). Because of the starvation for serine, there is an expectation that all six serine codons should be decoded slowly and so should have high RRT values. This proved to be the case (<xref ref-type="fig" rid="fig2">Figure 2D</xref>). The six serine codons had 6 of the 7 highest RRT values at position 8 (<xref ref-type="fig" rid="fig2">Figure 2D</xref>, <xref ref-type="table" rid="tbl1">Table 1</xref>), which presumably represents the A-site in this experiment. Note that because these are <italic>E. coli</italic> ribosomes, the phase of the footprint (i.e., the position of the A-site in the footprint) is different from its phase with regard to yeast ribosomes (see below). The RRT analysis of <italic>E. coli</italic> footprints also showed interesting variation at positions 2, 3, and 4 (<xref ref-type="fig" rid="fig2">Figure 2D</xref>), which we will consider elsewhere.<table-wrap id="tbl1" position="float"><object-id pub-id-type="doi">10.7554/eLife.03735.005</object-id><label>Table 1.</label><caption><p>Top ten RRTs at position 8 in <italic>E. coli</italic> starved for serine</p><p><bold>DOI:</bold> <ext-link ext-link-type="doi" xlink:href="10.7554/eLife.03735.005">http://dx.doi.org/10.7554/eLife.03735.005</ext-link></p></caption><table frame="hsides" rules="groups"><thead><tr><th>Codon</th><th>AA</th><th>Usage</th><th>RRT</th></tr></thead><tbody><tr><td>TCA</td><td>Ser</td><td>8.1</td><td>1.98</td></tr><tr><td>TCC</td><td>Ser</td><td>9.0</td><td>1.90</td></tr><tr><td>TCG</td><td>Ser</td><td>8.8</td><td>1.73</td></tr><tr><td>TCT</td><td>Ser</td><td>8.7</td><td>1.71</td></tr><tr><td>AGT</td><td>Ser</td><td>9.4</td><td>1.57</td></tr><tr><td>ATA</td><td>Ile</td><td>5.5</td><td>1.42</td></tr><tr><td>AGC</td><td>Ser</td><td>16.0</td><td>1.25</td></tr><tr><td>ATT</td><td>Ile</td><td>29.7</td><td>1.18</td></tr><tr><td>CCT</td><td>Pro</td><td>7.2</td><td>1.15</td></tr><tr><td>CCA</td><td>Pro</td><td>8.4</td><td>1.13</td></tr></tbody></table></table-wrap></p><p><xref ref-type="bibr" rid="bib34">Lareau et al. (2014)</xref> starved <italic>Saccharomyces cerevisiae</italic> for histidine using the His3 inhibitor 3-aminotriazole. This was another potential positive control, where the two His codons should be decoded slowly. We analyzed these ribosome profiling data. However, of the 11 million reads obtained in that experiment, about 10.6 million mapped to ribosomal RNA. The remaining &#x223c;0.4 million reads mapped to mRNA, but gave only 10 (ten) total windows passing our quality filters for RRT analysis, and this is too few. However, when we relaxed the filters to obtain more (albeit lower quality) windows, we observed obvious peaks (high RRT values) for both histidine codons at position 6 specifically in the 3-aminotriazole experiment (data not shown).</p></sec><sec id="s2-2"><title>Ribosome residence time analysis of codons</title><p>Having found that RRT analysis gives the expected results in control experiments, we applied it to the analysis of four of our ribosome profiling experiments. Our experiments differ from those of Ingolia et al. and Lareau et al., in that in those studies, cycloheximide was added to the growing yeast culture before harvesting (<xref ref-type="bibr" rid="bib28">Ingolia et al., 2009</xref>; <xref ref-type="bibr" rid="bib34">Lareau et al., 2014</xref>), whereas we harvest by flash-freezing and later add cycloheximide to the frozen cells (&#x2018;Materials and methods&#x2019;). The nature of our results is shown in <xref ref-type="fig" rid="fig3">Figure 3</xref> using the rare Leu codon CTC as an example. In this example, 10 codon (30 nucleotide) footprints that have CTC as the first codon have about the average relative frequency&#x2014;that is, they have about the same relative frequency as footprints with any other codon at the first position. Similarly when CTC is in the 2nd, 3rd, 4th, 7th, 8th, 9th, and 10th positions. However, there is a relative over abundance of footprints that have CTC at the 6th position. In fact, for CTC at the 6th position, averaged over 451 windows (in the case of this rare codon), there are 1.89-fold more footprints than at the baseline. This suggests that ribosomes move relatively slowly when CTC is at the 6th position, and, therefore, these ribosomes are more frequently captured as footprints. We say that CTC has a Ribosome Residence Time (RRT) of 1.89 at position 6.<fig id="fig3" position="float"><object-id pub-id-type="doi">10.7554/eLife.03735.006</object-id><label>Figure 3.</label><caption><title>Principle of ribosome residence time analysis.</title><p>The ribosome protects a 30 nt &#x2018;footprint&#x2019; of RNA centered around the A, P, and E sites (positions 6, 5, and 4). The rare Leu codon CTC has a high RRT at position 6, which is likely the A-site.</p><p><bold>DOI:</bold> <ext-link ext-link-type="doi" xlink:href="10.7554/eLife.03735.006">http://dx.doi.org/10.7554/eLife.03735.006</ext-link></p></caption><graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="elife03735f003"/></fig></p><p><xref ref-type="fig" rid="fig4">Figure 4</xref> shows data for all 61 sense codons from one of four experiments, the &#x2018;SC-lys&#x2019; experiment. In a large majority of cases, a codon has its highest or lowest footprint abundance when the codon is in position 6. We interpret this to mean that the codon affects the rate of ribosome movement when the codon is in position 6, which we believe to be the A-site of the ribosome (see below for further support for this assignment). The behavior of the six Leu codons and the four Thr codons is highlighted in <xref ref-type="fig" rid="fig4">Figure 4B,C</xref>. Footprint frequencies also differ from the average in a specific way at positions 5 (<xref ref-type="fig" rid="fig4">Figure 4D</xref>) (see below) and 1 and 10, the two ends of the footprint. We attribute variation at positions 1 and 10 to some base-specificity for the enzymatic reactions involved in generating and analyzing ribosome footprints (<xref ref-type="bibr" rid="bib33">Lamm et al., 2011</xref>; <xref ref-type="bibr" rid="bib30">Jackson et al., 2014</xref>; <xref ref-type="bibr" rid="bib51">Raabe et al., 2014</xref>); the same variations are seen in reactions with naked RNA fragments.<fig id="fig4" position="float"><object-id pub-id-type="doi">10.7554/eLife.03735.007</object-id><label>Figure 4.</label><caption><title>Results of Ribosome Residence Time analysis.</title><p>(<bold>A</bold>) The pattern of RRTs for all codons at all positions. Most peaks are at position 6, with some at position 5. (<bold>B</bold>) The RRTs for the six leucine codons. CTC has the highest RRT of any codon at position 6. (<bold>C</bold>) The RRTs for the four threonine codons. ACC has the lowest RRT of any codon at position 6. (<bold>D</bold>) The RRTs for the four proline codons. Proline has peaks at position 5, the P-site, as well as at position 6.</p><p><bold>DOI:</bold> <ext-link ext-link-type="doi" xlink:href="10.7554/eLife.03735.007">http://dx.doi.org/10.7554/eLife.03735.007</ext-link></p></caption><graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="elife03735f004"/></fig></p><p><xref ref-type="fig" rid="fig5">Figure 5A</xref> shows the deduced rate of ribosome movement for each codon, plotted against the frequency of codon usage. There is a good correlation (r &#x3d; &#x2013;0.52); that is, the ribosome moves faster over the more common codons.<fig id="fig5" position="float"><object-id pub-id-type="doi">10.7554/eLife.03735.008</object-id><label>Figure 5.</label><caption><title>Correlation of ribosome residence times with codon properties.</title><p>(<bold>A</bold>) Correlation of RRT with codon usage. RRT is plotted against the frequency of each codon per 1000 codons. (<bold>B</bold>) Correlation of RRT with the GC content of each codon. The codons were divided into quartiles by RRT (Fastest&#x2013;Slowest), and the GC content of those &#x223c;15 codons is shown in a violin plot.</p><p><bold>DOI:</bold> <ext-link ext-link-type="doi" xlink:href="10.7554/eLife.03735.008">http://dx.doi.org/10.7554/eLife.03735.008</ext-link></p></caption><graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="elife03735f005"/></fig></p><p>There is also a correlation, albeit weaker, with the AT-richness of the codon. AT-rich codons are decoded somewhat faster than average, while GC-rich codons are decoded more slowly (<xref ref-type="fig" rid="fig5">Figure 5B</xref>). The mean RRT of codons with 3 or 2 GC residues was 1.23, while the mean RRT of codons with 1 or 0 GC residues was 1.01, a statistically significant difference (p &#x3c; 0.003 by a two-tailed <italic>t</italic> test).</p><p><xref ref-type="table" rid="tbl2">Table 2</xref> shows the Ribosome Residence Time at position 6 for each of the 61 sense codons. The slowest codon is the rare Leu codon CTC. Relatively, the ribosome spends about 1.9 times as long with a CTC codon in the A site as it does at the average codon. If the yeast ribosome spends 50 milliseconds (<xref ref-type="bibr" rid="bib20">Futcher et al., 1999</xref>) on an average codon in the A-site, then the RRT suggests it spends about 95 milliseconds on CTC codons. The fastest codon is the relatively abundant Thr codon ACC (<xref ref-type="fig" rid="fig4">Figure 4C</xref>, <xref ref-type="table" rid="tbl2">Table 2</xref>), where it spends 0.70 times as long as average (i.e., about 35 milliseconds).<table-wrap id="tbl2" position="float"><object-id pub-id-type="doi">10.7554/eLife.03735.009</object-id><label>Table 2.</label><caption><p>Ribosome residence time at position 6 (A) and 5 (B)</p><p><bold>DOI:</bold> <ext-link ext-link-type="doi" xlink:href="10.7554/eLife.03735.009">http://dx.doi.org/10.7554/eLife.03735.009</ext-link></p></caption><table frame="hsides" rules="groups"><thead><tr><th colspan="5">A</th></tr><tr><th>Codon</th><th>AA</th><th>Usage</th><th>RRT</th><th>p value</th></tr></thead><tbody><tr><td>CTC</td><td>Leu</td><td>5.4</td><td>1.89</td><td>&#x2a;0.0001</td></tr><tr><td>CCC</td><td>Pro</td><td>6.8</td><td>1.71</td><td>&#x2a;0.0001</td></tr><tr><td>GGG</td><td>Gly</td><td>6</td><td>1.61</td><td>&#x2a;0.0001</td></tr><tr><td>AGG</td><td>Arg</td><td>9.2</td><td>1.59</td><td>&#x2a;0.0001</td></tr><tr><td>ATA</td><td>Ile</td><td>17.8</td><td>1.57</td><td>&#x2a;0.0001</td></tr><tr><td>GGA</td><td>Gly</td><td>10.9</td><td>1.56</td><td>&#x2a;0.0001</td></tr><tr><td>TGG</td><td>Trp</td><td>10.4</td><td>1.53</td><td>&#x2a;0.0001</td></tr><tr><td>GTG</td><td>Val</td><td>10.8</td><td>1.52</td><td>&#x2a;0.0001</td></tr><tr><td>CGC</td><td>Arg</td><td>2.6</td><td>1.45</td><td>&#x2a;0.0001</td></tr><tr><td>CGA</td><td>Arg</td><td>3</td><td>1.45</td><td>&#x2a;0.0008</td></tr><tr><td>CGG</td><td>Arg</td><td>1.7</td><td>1.44</td><td>&#x2a;0.0010</td></tr><tr><td>TCG</td><td>Ser</td><td>8.6</td><td>1.43</td><td>&#x2a;0.0001</td></tr><tr><td>CCA</td><td>Pro</td><td>18.3</td><td>1.38</td><td>&#x2a;0.0001</td></tr><tr><td>ACA</td><td>Thr</td><td>17.8</td><td>1.35</td><td>&#x2a;0.0001</td></tr><tr><td>CCG</td><td>Pro</td><td>5.3</td><td>1.31</td><td>&#x2a;0.0001</td></tr><tr><td>GTA</td><td>Val</td><td>11.8</td><td>1.31</td><td>&#x2a;0.0001</td></tr><tr><td>GCA</td><td>Ala</td><td>16.2</td><td>1.28</td><td>&#x2a;0.0001</td></tr><tr><td>CCT</td><td>Pro</td><td>13.5</td><td>1.27</td><td>&#x2a;0.0001</td></tr><tr><td>TCA</td><td>Ser</td><td>18.7</td><td>1.26</td><td>&#x2a;0.0001</td></tr><tr><td>TAC</td><td>Tyr</td><td>14.8</td><td>1.25</td><td>&#x2a;0.0001</td></tr><tr><td>TAT</td><td>Tyr</td><td>18.8</td><td>1.25</td><td>&#x2a;0.0001</td></tr><tr><td>GAG</td><td>Glu</td><td>19.2</td><td>1.25</td><td>&#x2a;0.0001</td></tr><tr><td>CTA</td><td>Leu</td><td>13.4</td><td>1.25</td><td>&#x2a;0.0001</td></tr><tr><td>CTT</td><td>Leu</td><td>12.3</td><td>1.24</td><td>&#x2a;0.0001</td></tr><tr><td>TGC</td><td>Cys</td><td>4.8</td><td>1.23</td><td>&#x2a;0.0001</td></tr><tr><td>GGC</td><td>Gly</td><td>9.8</td><td>1.22</td><td>&#x2a;0.0001</td></tr><tr><td>CAG</td><td>Gln</td><td>12.1</td><td>1.15</td><td>&#x2a;0.0002</td></tr><tr><td>ACG</td><td>Thr</td><td>8</td><td>1.12</td><td>0.0069</td></tr><tr><td>AGT</td><td>Ser</td><td>14.2</td><td>1.10</td><td>0.0060</td></tr><tr><td>AGC</td><td>Ser</td><td>9.8</td><td>1.09</td><td>0.0213</td></tr><tr><td>CAC</td><td>His</td><td>7.8</td><td>1.08</td><td>0.0098</td></tr><tr><td>TTT</td><td>Phe</td><td>26.1</td><td>1.05</td><td>0.0529</td></tr><tr><td>GAA</td><td>Glu</td><td>45.6</td><td>1.04</td><td>0.0538</td></tr><tr><td>AGA</td><td>Arg</td><td>21.3</td><td>1.01</td><td>0.3014</td></tr><tr><td>TTC</td><td>Phe</td><td>18.4</td><td>1.00</td><td>0.4955</td></tr><tr><td>GCG</td><td>Ala</td><td>6.2</td><td>0.99</td><td>0.4650</td></tr><tr><td>TCC</td><td>Ser</td><td>14.2</td><td>0.99</td><td>0.3341</td></tr><tr><td>TTA</td><td>Leu</td><td>26.2</td><td>0.99</td><td>0.3166</td></tr><tr><td>TCC</td><td>Ser</td><td>23.5</td><td>0.98</td><td>0.2249</td></tr><tr><td>CAT</td><td>His</td><td>13.6</td><td>0.93</td><td>0.0188</td></tr><tr><td>GGT</td><td>Gly</td><td>23.9</td><td>0.93</td><td>&#x2a;0.0003</td></tr><tr><td>ATG</td><td>Met</td><td>20.9</td><td>0.92</td><td>0.0027</td></tr><tr><td>ATT</td><td>Ile</td><td>30.1</td><td>0.92</td><td>&#x2a;0.0005</td></tr><tr><td>TTG</td><td>Leu</td><td>27.2</td><td>0.92</td><td>&#x2a;0.0001</td></tr><tr><td>CTG</td><td>Leu</td><td>10.5</td><td>0.92</td><td>0.0139</td></tr><tr><td>AAT</td><td>Asn</td><td>35.7</td><td>0.88</td><td>&#x2a;0.0001</td></tr><tr><td>AAA</td><td>Lys</td><td>41.9</td><td>0.88</td><td>&#x2a;0.0003</td></tr><tr><td>CGT</td><td>Arg</td><td>6.4</td><td>0.87</td><td>&#x2a;0.0002</td></tr><tr><td>CAA</td><td>Gln</td><td>27.3</td><td>0.87</td><td>&#x2a;0.0001</td></tr><tr><td>GCC</td><td>Ala</td><td>12.6</td><td>0.86</td><td>&#x2a;0.0001</td></tr><tr><td>GAC</td><td>Asp</td><td>20.2</td><td>0.85</td><td>&#x2a;0.0001</td></tr><tr><td>TGT</td><td>Cys</td><td>8.1</td><td>0.81</td><td>&#x2a;0.0001</td></tr><tr><td>GCT</td><td>Ala</td><td>21.2</td><td>0.81</td><td>&#x2a;0.0001</td></tr><tr><td>ATC</td><td>Ile</td><td>17.2</td><td>0.80</td><td>&#x2a;0.0001</td></tr><tr><td>ACT</td><td>Thr</td><td>20.3</td><td>0.78</td><td>&#x2a;0.0001</td></tr><tr><td>GAT</td><td>Asp</td><td>37.6</td><td>0.76</td><td>&#x2a;0.0001</td></tr><tr><td>AAC</td><td>Asn</td><td>24.8</td><td>0.76</td><td>&#x2a;0.0001</td></tr><tr><td>GTT</td><td>Val</td><td>22.1</td><td>0.75</td><td>&#x2a;0.0001</td></tr><tr><td>GTC</td><td>Val</td><td>11.8</td><td>0.75</td><td>&#x2a;0.0001</td></tr><tr><td>AAG</td><td>Lys</td><td>30.8</td><td>0.74</td><td>&#x2a;0.0001</td></tr><tr><td>ACC</td><td>Thr</td><td>12.7</td><td>0.70</td><td>&#x2a;0.0001</td></tr></tbody></table><table frame="hsides" rules="groups"><thead><tr><th colspan="5">B</th></tr><tr><th>Codon</th><th>AA</th><th>Usage</th><th>RRT</th><th>p value</th></tr></thead><tbody><tr><td>CCT</td><td>Pro</td><td>13.5</td><td>1.80</td><td>&#x2a;0.0001</td></tr><tr><td>CCC</td><td>Pro</td><td>6.8</td><td>1.48</td><td>&#x2a;0.0001</td></tr><tr><td>CCA</td><td>Pro</td><td>18.3</td><td>1.48</td><td>&#x2a;0.0001</td></tr><tr><td>AAT</td><td>Asn</td><td>35.7</td><td>1.39</td><td>&#x2a;0.0001</td></tr><tr><td>CGC</td><td>Arg</td><td>1.7</td><td>1.34</td><td>0.0070</td></tr><tr><td>CCG</td><td>Pro</td><td>5.3</td><td>1.30</td><td>&#x2a;0.0001</td></tr></tbody></table><table-wrap-foot><fn><p>A. Usage of each codon per 1000 codons and the Ribosome Residence Time (RRT) at position 6 (the A-site of the ribosome). The p-value for a difference between the calculated RRT value and an RRT value of 1 is shown. p-values less than or equal to 0.001 are marked with an asterisk. B. As for A, but for the six highest values at position 5 (the P-site).</p></fn></table-wrap-foot></table-wrap></p><p>There are also peaks at position 5 (<xref ref-type="fig" rid="fig4">Figure 4A,D</xref>), which we interpret as the ribosome's P-site, where the peptide bond is formed. All four Pro codons are high at position 5: CCT, CCA, and CCC are the three slowest codons at position 5, while CCG is 6th (<xref ref-type="fig" rid="fig4">Figure 4D</xref>, <xref ref-type="table" rid="tbl2">Table 2</xref>). Proline is a unique amino acid in having a secondary rather than a primary amino group, and so it is less reactive in peptide bond formation. Proline forms peptide bonds slowly (<xref ref-type="bibr" rid="bib43">Muto and Ito, 2008</xref>; <xref ref-type="bibr" rid="bib60">Wohlgemuth et al., 2008</xref>; <xref ref-type="bibr" rid="bib45">Pavlov et al., 2009</xref>; <xref ref-type="bibr" rid="bib31">Johansson et al., 2011</xref>), and proline has been associated with slow translation in footprinting experiments (<xref ref-type="bibr" rid="bib29">Ingolia et al., 2011</xref>). Our result that the ribosome slows with proline at position 5 is consistent with this and tends to confirm our assignment of position 5 to the P-site and, therefore, position 6 to the A-site. A few other residues also seem slightly slow at position 5 (e.g., Asn, Gly, see <xref ref-type="table" rid="tbl2">Table 2</xref> and <xref ref-type="supplementary-material" rid="SD1-data">Supplementary file 1</xref>), possibly due to low reactivity in peptide bond formation (<xref ref-type="bibr" rid="bib31">Johansson et al., 2011</xref>).</p><p>All four proline codons also have high RRTs at position 6, the A-site (<xref ref-type="fig" rid="fig4">Figure 4D</xref>, <xref ref-type="table" rid="tbl2">Table 2</xref>). The dipeptide ProPro is translated very slowly (<xref ref-type="bibr" rid="bib15">Doerfel et al., 2013</xref>; <xref ref-type="bibr" rid="bib23">Gutierrez et al., 2013</xref>; <xref ref-type="bibr" rid="bib46">Peil et al., 2013</xref>; <xref ref-type="bibr" rid="bib58">Ude et al., 2013</xref>). We wondered whether the apparent slowness of proline at both positions 5 and 6 was an informatic artefact due to extreme slowness for ProPro dipeptides. We redid the original analysis after excluding all footprints encoding ProPro dipeptides. Results did not change significantly; Pro still appeared to be slow at both positions 5 and 6 (<xref ref-type="fig" rid="fig6">Figure 6A</xref>). On the other hand, when we looked specifically at footprints containing a ProPro dipeptide, there was a very large peak at position 5 (<xref ref-type="fig" rid="fig6">Figure 6B</xref>), consistent with the very slow peptide bond formation seen in studies cited above.<fig id="fig6" position="float"><object-id pub-id-type="doi">10.7554/eLife.03735.010</object-id><label>Figure 6.</label><caption><title>Analysis of ProPro dipeptides.</title><p>(<bold>A</bold>) RRT analysis of windows containing no ProPro dipeptides. (<bold>B</bold>) RRT analysis of windows containing ProPro dipeptides.</p><p><bold>DOI:</bold> <ext-link ext-link-type="doi" xlink:href="10.7554/eLife.03735.010">http://dx.doi.org/10.7554/eLife.03735.010</ext-link></p></caption><graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="elife03735f006"/></fig></p><p>To establish repeatability, we generated and analyzed three other ribosome profiling datasets and also re-analyzed previously published data (<xref ref-type="bibr" rid="bib28">Ingolia et al., 2009</xref>). All five data sets gave qualitatively similar results; pairwise correlations for RRTs at position 6 ranged from 0.22 to 0.96 between the datasets (<xref ref-type="table" rid="tbl3">Table 3</xref>). The poorest correlation (0.22) was a correlation with the previously published dataset, which was generated using significantly different methods than our datasets. In particular, that dataset was generated by adding cycloheximide to the growing culture, then harvesting (<xref ref-type="bibr" rid="bib28">Ingolia et al., 2009</xref>), whereas our data were generated by flash-freezing first, then adding cycloheximide to the frozen cells. Complete results for all five experiments are given in <xref ref-type="supplementary-material" rid="SD1-data">Supplementary file 1</xref>. More recently, we also subjected the long footprint data of <xref ref-type="bibr" rid="bib34">Lareau et al. (2014)</xref> to RRT analysis and obtained correlations at position 6 of 0.21, 0.47, 0.23, and 0.27, respectively, for their &#x2018;untreated 1&#x2019;, &#x2018;untreated 2&#x2019;, &#x2018;untreated merge&#x2019;, and &#x2018;cycloheximide 1&#x2019; experiments to our SC-lys experiment. Again, these experiments were carried out in a significantly different way from ours and it is not surprising that the correlations are modest. It is reassuring that a positive correlation can be seen even for experiments where no cycloheximide was used.<table-wrap id="tbl3" position="float"><object-id pub-id-type="doi">10.7554/eLife.03735.011</object-id><label>Table 3.</label><caption><p>Correlations between experiments</p><p><bold>DOI:</bold> <ext-link ext-link-type="doi" xlink:href="10.7554/eLife.03735.011">http://dx.doi.org/10.7554/eLife.03735.011</ext-link></p></caption><table frame="hsides" rules="groups"><thead><tr><th/><th>YPD1</th><th>-His</th><th>YPD2</th><th>Ingo.</th></tr></thead><tbody><tr><td>-Lys</td><td>0.80</td><td>0.35</td><td>0.76</td><td>0.22</td></tr><tr><td>YPD1</td><td/><td>0.53</td><td>0.96</td><td>0.55</td></tr><tr><td>-His</td><td/><td/><td>0.58</td><td>0.37</td></tr><tr><td>YPD2</td><td/><td/><td/><td>0.53</td></tr></tbody></table><table-wrap-foot><fn><p>The pairwise Spearman correlations between the RRT values at position 6 are shown for five independent experiments, where the experiments are named YPD1, YPD2, SC-Lys, SC-His, and Ingolia. The SC-Lys and SC-His experiments were carried out by JG, and used flash-freezing as the initial method for stopping ribosome movement. The YPD1 and YPD2 experiments were carried out by YC (<xref ref-type="bibr" rid="bib10">Cai and Futcher, 2013</xref>), and used addition of ice and cycloheximide to the culture as the initial method for stopping ribosome movement. The &#x2018;Ingo&#x2019; experiment was that carried out by <xref ref-type="bibr" rid="bib28">Ingolia et al. (2009)</xref>. Further details are given in &#x2018;Materials and methods&#x2019;. Complete RRT values for each position in each experiment are provided in <xref ref-type="supplementary-material" rid="SD1-data">Supplementary file 1</xref>.</p></fn></table-wrap-foot></table-wrap></p><p>There are strong correlations between codon usage, the number of tRNA genes for the relevant tRNA, and tRNA abundance (<xref ref-type="bibr" rid="bib26">Ikemura, 1981</xref>, <xref ref-type="bibr" rid="bib27">1982</xref>; <xref ref-type="bibr" rid="bib16">Dong et al., 1996</xref>; <xref ref-type="bibr" rid="bib57">Tuller et al., 2010</xref>; <xref ref-type="bibr" rid="bib44">Novoa and Ribas de Pouplana, 2012</xref>). Although one cannot determine causation from this correlation (<xref ref-type="bibr" rid="bib48">Plotkin and Kudla, 2011</xref>), nevertheless it is consistent with the idea that the rate of decoding in translation is at least partly limited by tRNA concentration. Most of our results are consistent with this. However, there are some interesting exceptions. In yeast, the 61 sense codons are decoded by only 42 tRNAs. There are 12 pairs of codons that share a single tRNA (e.g., Phe TTC and TTT; Tyr TAT and TAC; etc) (<xref ref-type="bibr" rid="bib54">Roth, 2012</xref>). In many but not all cases, the RRT of the two codons is similar (<xref ref-type="table" rid="tbl2">Table 2</xref>), consistent with the &#x2018;concentration&#x2019; hypothesis. However, there are also cases where the RRT appears to be significantly different for two codons sharing the same tRNA. For instance, the Cys codon TGC has an RRT of 1.23, while TGT has an RRT of 0.81 (<xref ref-type="table" rid="tbl2">Table 2</xref>). Both codons are recognized by the same tRNA, which in this case is complementary for TGC, and wobble for TGT. Similarly, the Gly codon GGC has an RRT of 1.22 (tRNA is complementary), while GGT has an RRT of 0.93 (tRNA is wobble). Both these relationships (RRT for TGC &#x3e; TGT, and RRT for GGC &#x3e; GGT) were true in all five datasets (<xref ref-type="supplementary-material" rid="SD1-data">Supplementary file 1</xref>). In both the cases, the perfect match is decoded more slowly than the wobble match and in both cases, the slower, complementary pairing has a G:C match at the third (i.e., wobble) position. These and other similar examples (not shown) suggest that the RRT depends on more than just the concentration of the relevant tRNA. Perhaps the long RRTs for these GC-rich codons are related to the time needed to eject incorrectly paired anti-codons of incorrect tRNAs, although this explanation is somewhat at odds with the literature (<xref ref-type="bibr" rid="bib13">Daviter et al., 2006</xref>; <xref ref-type="bibr" rid="bib21">Gromadski et al., 2006</xref>). Alternatively, it has been suggested that translocation can occur more quickly when the codon:anticodon interaction is weaker (<xref ref-type="bibr" rid="bib55">Semenkov et al., 2000</xref>; <xref ref-type="bibr" rid="bib32">Khade and Joseph, 2011</xref>).</p></sec><sec id="s2-3"><title>RRT analysis of short footprints</title><p>Recently, Lareau et al. made the exciting discovery that ribosome profiling on cells that have not been treated with any drug yields two classes of footprints, long (28&#x2013;30 nucleotides) and short (20&#x2013;22 nucleotides) (<xref ref-type="bibr" rid="bib34">Lareau et al., 2014</xref>). It is the long class that is seen in cycloheximide experiments, and which we have characterized above. The short (20&#x2013;22 nuc.) footprints seem to represent a different conformation of the ribosome, perhaps one that occurs when the ribosome translocates along the mRNA. Furthermore, Lareau et al. found that treatment of cells with the elongation inhibitor anisomycin efficiently generates short footprints. Lareau et al. suggest that the long and short footprints are reporting on two different states of translation (<xref ref-type="bibr" rid="bib34">Lareau et al., 2014</xref>).</p><p>We applied RRT analysis to the short footprints generated by Lareau et al., with special focus on the footprints after anisomycin treatment. All three of their anisomycin datasets were studied, and the pairwise correlations between the RRT results for these three datasets were very high, ranging from 0.89 to 0.998. Partial results are shown in <xref ref-type="fig" rid="fig7">Figure 7</xref> and <xref ref-type="table" rid="tbl4">Table 4</xref>, and complete results are shown in <xref ref-type="supplementary-material" rid="SD2-data">Supplementary file 2</xref>. RRT analysis showed a series of peaks at different positions along the 7-codon footprint. The RRT values for the short footprints did not significantly correlate with RRT values for the long footprints, even when the phases of the footprints were shifted. This suggests, in agreement with Lareau et al., that the short and long footprints are indeed reporting on different translational processes. Furthermore, for the short footprints the RRT values are amino acid-specific, while for the long footprints at position 6, the RRT values are codon-specific (<xref ref-type="table" rid="tbl2">Table 2</xref>; <xref ref-type="table" rid="tbl4">Table 4</xref>; <xref ref-type="fig" rid="fig4">Figure 4</xref>, <xref ref-type="fig" rid="fig7">Figure 7</xref>, <xref ref-type="fig" rid="fig8">Figure 8</xref>). This again indicates that the two kinds of footprints are reporting on different translational processes. The amino acids in the peaks at positions 3, 5, and 6 are shown in <xref ref-type="table" rid="tbl4">Table 4</xref>: the peak at position 3 contains glycine; the peak at position 5 contains smallish hydrophobic amino acids (Leu, Val, Ile, and to some extent Phe), and the peak at position 6 is dominated by the two basic amino acids, Arg and Lys. It has previously been shown that basic amino acids can cause a pause in elongation by interacting with the ribosome exit tunnel (<xref ref-type="bibr" rid="bib40">Lu et al., 2007</xref>; <xref ref-type="bibr" rid="bib38">Lu and Deutsch, 2008</xref>; <xref ref-type="bibr" rid="bib7">Brandman et al., 2012</xref>; <xref ref-type="bibr" rid="bib61">Wu et al., 2012</xref>; <xref ref-type="bibr" rid="bib11">Charneski and Hurst, 2013</xref>). The basis of the anisomycin arrest is partly but not fully understood (<xref ref-type="bibr" rid="bib24">Hansen et al., 2003</xref>; <xref ref-type="bibr" rid="bib6">Blaha et al., 2008</xref>), and so it is difficult to clearly interpret these results (but see &#x2018;Discussion&#x2019;). Nevertheless, the application of RRT analysis to the anisomycin-generated footprints gives strong specific signals that are unlikely to be explained by a random process. We note, however, that results from the short footprints from untreated (no anisomycin) cells are only modestly correlated (0.23) with results from short footprints from the anisomycin-treated cells (data not shown).<fig id="fig7" position="float"><object-id pub-id-type="doi">10.7554/eLife.03735.012</object-id><label>Figure 7.</label><caption><title>RRT analysis of short footprints from anisomycin treatment.</title><p>The short, seven-codon footprints from anisomycin treatment (dataset 1b) from <xref ref-type="bibr" rid="bib34">Lareau et al. (2014)</xref> were analyzed for RRT. All 61 sense codons are shown; codons for selected amino acids are color-coded by amino acid. Position along the footprint is shown on the x-axis.</p><p><bold>DOI:</bold> <ext-link ext-link-type="doi" xlink:href="10.7554/eLife.03735.012">http://dx.doi.org/10.7554/eLife.03735.012</ext-link></p></caption><graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="elife03735f007"/></fig><table-wrap id="tbl4" position="float"><object-id pub-id-type="doi">10.7554/eLife.03735.013</object-id><label>Table 4.</label><caption><p>Top 10 RRTs at positions 3 through 6 of the anisomycin-generated short footprints</p><p><bold>DOI:</bold> <ext-link ext-link-type="doi" xlink:href="10.7554/eLife.03735.013">http://dx.doi.org/10.7554/eLife.03735.013</ext-link></p></caption><table frame="hsides" rules="groups"><thead><tr><th>Pos 3</th><th>Pos 4</th><th>Pos 5</th><th>Pos 6</th></tr></thead><tbody><tr><td>Gly GGG 2.64</td><td>Pro CCC 2.36</td><td>Leu TTA 2.75</td><td>Arg CGA 3.72</td></tr><tr><td>Gly GGC 2.52</td><td>Pro CCA 2.34</td><td>Leu CTC 2.73</td><td>Arg CGG 3.50</td></tr><tr><td>Gly GGT 2.36</td><td>Met ATG 2.25</td><td>Val GTA 2.43</td><td>Pro CCG 2.74</td></tr><tr><td>Gly GGA 2.32</td><td>Pro CCT 2.17</td><td>Leu CTA 2.36</td><td>Lys AAA 2.59</td></tr><tr><td>Asp GAC 1.80</td><td>Ala GCC 2.13</td><td>Leu TTG 2.29</td><td>Lys AAG 2.49</td></tr><tr><td>Ala GCC 1.79</td><td>Phe TTC 2.03</td><td>Val GTG 2.21</td><td>Arg CGC 2.46</td></tr><tr><td>Ala GCA 1.70</td><td>Ala GCA 2.01</td><td>Leu CTT 2.16</td><td>Arg CGT 2.34</td></tr><tr><td>Ala GCT 1.65</td><td>Ala GCT 1.98</td><td>Val GTC 2.12</td><td>Arg AGG 2.32</td></tr><tr><td>Ala GCG 1.59</td><td>Tyr TAC 1.98</td><td>Val GTT 2.11</td><td>Arg AGA 2.21</td></tr><tr><td>Blu GAG 1.58</td><td>Ser TCC 1.97</td><td>Ile ATA 2.03</td><td>Asp GAT 2.12</td></tr></tbody></table></table-wrap><fig id="fig8" position="float"><object-id pub-id-type="doi">10.7554/eLife.03735.014</object-id><label>Figure 8.</label><caption><title>Short footprints are amino acid-specific; long footprints are codon-specific.</title><p>For the set of codons corresponding to each amino acid (x-axis), a test was done to see if all the codons behaved similarly or not. For the short footprints (left, panel <bold>A</bold>), p-values (y-axis) are generally small, showing that each codon for a particular amino acid behaves similarly (&#x2018;Materials and methods&#x2019;). For the long footprints (right, panel <bold>B</bold>), p-values are generally large, showing that the codons for each particular amino acid behave differently (&#x2018;Materials and methods&#x2019;).</p><p><bold>DOI:</bold> <ext-link ext-link-type="doi" xlink:href="10.7554/eLife.03735.014">http://dx.doi.org/10.7554/eLife.03735.014</ext-link></p></caption><graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="elife03735f008"/></fig></p><p>It appeared that the RRT values at position 6 for the long footprints were codon-specific (<xref ref-type="fig" rid="fig4">Figure 4</xref>, <xref ref-type="table" rid="tbl2">Table 2</xref>), while the RRT values for the short footprints were amino acid-specific (<xref ref-type="fig" rid="fig7">Figure 7</xref>, <xref ref-type="table" rid="tbl4">Table 4</xref>). To confirm this, we developed a statistical test for the coherence of the results for a particular amino acid (&#x2018;Materials and methods&#x2019;). Briefly, this method tests whether every codon for a particular amino acid behaves similarly, and it yields a small p-value if it does. Indeed, this analysis confirms that the short footprints give results specific to the amino acid, while the long footprints generally do not (i.e., the long footprints are codon-specific) (<xref ref-type="fig" rid="fig8">Figure 8</xref>). This suggests that the long footprints are reporting on the process of decoding (which depends on specific codons), while the short footprints are reporting on events after decoding.</p></sec></sec><sec sec-type="discussion" id="s3"><title>Discussion</title><p>To our knowledge, this is the first measurement of the differential rate of translation of all 61 codons in vivo. There is a correlation between a high codon usage and a high rate of decoding. Although this is a correlation that has been widely expected, there has been little evidence for it; indeed, the most recent experiments suggested that all codons were decoded at the same rate (<xref ref-type="bibr" rid="bib50">Qian et al., 2012</xref>; <xref ref-type="bibr" rid="bib11">Charneski and Hurst, 2013</xref>). Some workers have had other expectations for decoding rates. For instance, an important theory was that the more common codons were common because their translation might be more accurate (<xref ref-type="bibr" rid="bib48">Plotkin and Kudla, 2011</xref>) (and this still might be correct).</p><p>Translation is optimized for both speed and accuracy (<xref ref-type="bibr" rid="bib5">Bieling et al., 2006</xref>). During translation, the ribosome must sample many incorrect tRNAs at the A-site before finding a correct tRNA. It must match the anti-codon of that correct tRNA with the codon; after such matching, there is a conformational change around the codon&#x2013;anticodon interaction at the decoding center (<xref ref-type="bibr" rid="bib14">Demeshkina et al., 2012</xref>; <xref ref-type="bibr" rid="bib62">Zeng et al., 2014</xref>). The ribosome must form the peptide bond (<xref ref-type="bibr" rid="bib53">Rodnina, 2013</xref>; <xref ref-type="bibr" rid="bib49">Polikanov et al., 2014</xref>), translocate (<xref ref-type="bibr" rid="bib55">Semenkov et al., 2000</xref>; <xref ref-type="bibr" rid="bib32">Khade and Joseph, 2011</xref>; <xref ref-type="bibr" rid="bib63">Zhou et al., 2014</xref>), and eject the empty tRNA. The nascent peptide must make its way through the ribosome exit tunnel (<xref ref-type="bibr" rid="bib38">Lu and Deutsch, 2008</xref>; <xref ref-type="bibr" rid="bib47">Petrone et al., 2008</xref>; <xref ref-type="bibr" rid="bib39">Lu et al., 2011</xref>; <xref ref-type="bibr" rid="bib59">Wilson and Beckmann, 2011</xref>). Depending on the rate of each of these events, the concentration of the various tRNAs might or might not have a detectable effect on the overall rate of translation. Our findings that (i) the more frequent codons (i.e., the ones with the highest tRNA concentrations) are decoded rapidly; and (ii) GC-rich codons are decoded slowly; and (iii) proline is slow in the P-site, suggest that there are at least three processes that happen somewhat slowly and on a similar timescale. The high rate of decoding for high concentration tRNAs may reflect the relatively short time it takes for the ribosome to find a high-concentration correct tRNA among many incorrect tRNAs. The fact that we detect proline-specific delays of a similar magnitude to the rare-codon specific delays suggest that peptide bond formation and identification of the correct tRNA are happening on similar time scales. In general, this is what one might expect from the evolution of such an important process as protein synthesis&#x2014;if one process was entirely rate-limiting, there would be very strong selection for greater speed in that process, until a point is reached where it &#x2018;catches up&#x2019; with other processes, and several processes together are then rate-limiting.</p><p>Even though these data establish that common codons are translated relatively rapidly, this does not on its own explain the success of codon optimization for increasing protein expression, since the rate of translation is primarily limited by the rate of initiation, not elongation (<xref ref-type="bibr" rid="bib1">Andersson and Kurland, 1990</xref>; <xref ref-type="bibr" rid="bib48">Plotkin and Kudla, 2011</xref>) (although one recent study identifies a mechanism whereby rapid elongation causes rapid initiation [<xref ref-type="bibr" rid="bib12">Chu et al., 2014</xref>]). Nevertheless, on a genome-wide (and not gene-specific) scale, the use of faster codons would mean that a given genomic set of mRNAs would require (or titrate out) fewer ribosomes to make a given amount of protein than the same set of mRNAs using slower codons (<xref ref-type="bibr" rid="bib1">Andersson and Kurland, 1990</xref>; <xref ref-type="bibr" rid="bib48">Plotkin and Kudla, 2011</xref>). Based on our RRT measurements, and taking into account the different copy numbers of different mRNAs (<xref ref-type="bibr" rid="bib37">Lipson et al., 2009</xref>), we roughly estimate that yeast requires about 5% fewer ribosomes than if they were to make protein at the same overall rate but using each synonymous codon at an equal frequency (&#x2018;Materials and methods&#x2019;). This provides at least a sufficient reason for the bias towards faster synonymous codons.</p><p>We applied RRT analysis to the short footprints identified by Lareau et al. (<xref ref-type="fig" rid="fig7">Figure 7</xref>). These short footprints seem to report on a different translational process than the long footprints seen in cycloheximide experiments. We see that the basic amino acids Arg and Lys are slow at position 6; small hydrophobic amino acids are slow at position 5; and glycine is slow at position 3. While we know too little about the nature of the short footprints to reliably interpret these results, one speculative possibility is that the results report on the interaction of amino acids in the nascent peptide chain with the exit tunnel of the ribosome (<xref ref-type="bibr" rid="bib52">Raue et al., 2007</xref>; <xref ref-type="bibr" rid="bib47">Petrone et al., 2008</xref>; <xref ref-type="bibr" rid="bib3">Berndt et al., 2009</xref>; <xref ref-type="bibr" rid="bib4">Bhushan et al., 2010</xref>; <xref ref-type="bibr" rid="bib39">Lu et al., 2011</xref>; <xref ref-type="bibr" rid="bib59">Wilson and Beckmann, 2011</xref>; <xref ref-type="bibr" rid="bib22">Gumbart et al., 2012</xref>). We find Arg and Lys slow at position 6, and this correlates with the fact that these basic amino acids cause a pause by interacting with the exit tunnel (<xref ref-type="bibr" rid="bib40">Lu et al., 2007</xref>; <xref ref-type="bibr" rid="bib38">Lu and Deutsch, 2008</xref>; <xref ref-type="bibr" rid="bib7">Brandman et al., 2012</xref>; <xref ref-type="bibr" rid="bib61">Wu et al., 2012</xref>; <xref ref-type="bibr" rid="bib11">Charneski and Hurst, 2013</xref>). This would then suggest that small hydrophobic amino acids, and then glycine, might similarly cause pauses by interacting with positions one or three amino acids further out in the exit tunnel.</p><p>In summary, we believe that RRT analysis is a sensitive high-resolution method that can characterize the interaction of codons and amino acids with the ribosome. It can be applied to ribosome profiling data of many types, from many organisms. In this study, we show that frequent codons are decoded more quickly than rare codons; that codons high in AT are decoded somewhat quickly; that proline forms peptide bonds slowly; and that short footprints from anisomycin treated cells have an interesting RRT profile that may reflect interaction of amino acids with the ribosome exit tunnel.</p></sec><sec sec-type="materials|methods" id="s4"><title>Materials and methods</title><p>Experiments were done with yeast strain background BY4741. Ribosome profiling was based on the method of Ingolia (<xref ref-type="bibr" rid="bib28">Ingolia et al., 2009</xref>), but with modifications (see below). Programs for analysis of ribosome residence time were written by the authors, primarily RY and AY. The Perl code for ribosome residence time analysis is given in <xref ref-type="supplementary-material" rid="SD3-data SD4-data">Source code 1 and 2</xref>.</p><sec id="s4-1"><title>Ribosome profiling</title><p>Informatic analysis was conducted on four ribosome profiling experiments (YPD1, YPD2, SC-lys, and SC-his) done for other reasons in the Futcher lab. The strains and methods used varied slightly from experiment to experiment; nevertheless similar results were obtained for the RRT analysis (<xref ref-type="table" rid="tbl2">Table 2</xref>). The ribosome profiling experiments YPD1 and YPD2 have been reported previously (<xref ref-type="bibr" rid="bib10">Cai and Futcher, 2013</xref>) as the &#x2018;WT&#x2019; and &#x2018;whi3&#x2019; experiments, respectively.</p><p>All experiments used <italic>S. cerevisiae</italic> strain background BY4741. Two biologically independent ribosome-profiling libraries and mRNA-seq libraries were obtained from YPD rich media (the YPD1 and YPD2 experiments), and two biologically independent ribosome-profiling libraries and mRNA-seq libraries were prepared in synthetic media (the SC-lys and SC-his experiments). Two methods for harvesting cells were used. After harvesting and footprint size selection, footprints from all four experiments were processed identically into sequencing libraries using the ARTseq Yeast Ribosome Profiling kit, following the manufacture's instructions beginning with step B3 in the protocol.</p><sec id="s4-1-1"><title>Harvesting method 1 (YPD1 and YPD2 experiments)</title><p>1 liter of cells in YPD were grown to a density of 2.0 &#xd7; 10<sup>7</sup> cells/ml. Medium was cooled to 0&#xb0;C by adding ice (stored at &#x2212;20&#xb0;C) and simultaneously cycloheximide was added to a concentration of 100 &#xb5;g/ml to quickly halt translation and freeze translating ribosomes in place. Cells were centrifuged using a Sorvall Evolution RC centrifuge at 3000 rpm for 2 min at 4&#xb0;C. The resulting cell pellet was washed with ice-cold RNase-free water containing 100 &#xb5;g/ml cycloheximide by gentle vortexing and repelleted. Supernatant was aspirated, and cells were resuspended in polysome lysis buffer prepared according to the ARTseq ribosome profiling kit instructions. Cell lysis buffer slurry was slowly dripped into an RNase-free 50 ml conical tube containing liquid nitrogen. Resulting frozen pellets of cell slurry were lysed using a TissueLyser II and 50 ml grinding jars at liquid nitrogen temperature for six 3 min cycles at 15 hertz. Frozen cell lysate was scraped from the grinding jar into a new RNase-free 50 ml conical tube followed by reheating the slurry in a 30&#xb0;C water bath with constant swirling. Immediately after complete thawing (&#x223c;3&#x2013;5 min), cell lysate was centrifuged for 5 min at 3000&#xd7;<italic>g</italic>. Supernatant was moved to a 1.5 ml RNase-free centrifuge tube and centrifuged for 10 min at 20,000&#xd7;<italic>g</italic>. Clarified lysate total RNA content was estimated using a Nanodrop at A260 nm, and polysome complexes were digested using ARTseq ribonuclease mix according to the manufacture's instructions. Ribosome-protected mRNA footprints were purified using an Illustra Microspin S-400HR column prepared according to ARTseq manufacture's instructions. All following library generation steps were performed according to the ARTseq protocol starting at step 4 (PAGE purification). Following the end repair step in the protocol, a biotinylated oligonucleotide antisense to a specific rRNA fragment was used to reduce rRNA contamination using a protocol from the Jonathan Weissman lab (personal communication from Gloria Brar).</p></sec><sec id="s4-1-2"><title>Harvesting method 2 (SC-lys and SC-his experiments)</title><p>Synthetic media lacking lysine or lacking histidine was used to prepare 1 liter of cells at 2.0 &#xd7; 10<sup>7</sup> cells/ml. The strains were prototropic for Lys or His (<italic>HIS3</italic> gap1 frame1), respectively. Cells were harvested by vacuum filtration using Whatman 7184&#x2013;009 membrane filters at 30&#xb0;C. A liquid nitrogen cooled spatula was used to scrap cells from the membrane followed by immediate flash freezing in an RNase-free 50 ml conical tube containing liquid nitrogen. Special care was taken to ensure cells were exposed to air for as little time as possible, between vacuum filtration and flash freezing (2&#x2013;3 s), to prevent the loss of ribosome footprints at the 5&#x2032; ends of mRNAs (personal communication, Gloria Brar). ARTseq polysome lysis buffer containing cycloheximide at 50 &#xb5;g/ml was slowly dripped into the liquid nitrogen filled cell pellet conical tube. Cells were lysed using a TissueLyser II and 50 ml grinding jars at liquid nitrogen temperature for six 3 min cycles at 15 hertz. Frozen cell lysate was scraped from the grinding jar into a new RNase-free 50 ml conical tube followed by reheating the slurry in a 30&#xb0;C water bath with constant swirling. Immediately after complete thawing (&#x223c;3&#x2013;5 min), cell lysate was centrifuged for 5 min at 3000&#xd7;<italic>g</italic>. Supernatant was moved to a 1.5 ml RNase-free centrifuge tube and centrifuged for 10 min at 20,000&#xd7;<italic>g</italic>. Clarified lysate total RNA content was estimated using a Nanodrop at A260 nm, and polysome complexes were digested using ARTseq ribonuclease mix according to the manufacture's instructions.</p></sec><sec id="s4-1-3"><title>SC-lys Dataset</title><p>Digested monosomes were purified using sucrose cushion ultracentrifugation for 3 hr at 35,000 rpm using a SW-41 rotor. The sucrose cushion contained 9 ml of 10% sucrose polysome lysis buffer lacking triton detergent layered over 3 ml of 60% sucrose polysome lysis buffer lacking triton detergent. Gradient fractionation was carried out using a BioRad EM-1 UV absorbance monitor and a peristaltic pump. Efficiency of RNase digestion was monitored in tandem using an undigested control lysate on an identically prepared 10&#x2013;60% sucrose cushion and a digested control centrifuged on a 10&#x2013;60% sucrose gradient. Following fractionation, the monosome containing fraction was mixed 1:1 with 4 M guanidine thiocyanate and was precipitated overnight using a 1:1 vol of 100% isopropanol chilled to &#x2212;20&#xb0;C. The RNA pellet was aspirated and resuspended in 400 &#x3bc;l RNase-free water, and protein was removed by two acid phenol&#x2013;chloroform purifications followed by one chloroform purification. Recovered supernatant was brought to 0.3 M ammonium acetate and precipitated with 3 vol of 100% ethanol. All following library generation steps were performed according to the ARTseq protocol starting at step 4 (PAGE purification). Following the end repair step in the protocol, a biotinylated oligonucleotide antisense to a specific rRNA fragment was used to reduce rRNA contamination using a protocol from the Jonathan Weissman lab (personal communication Gloria Brar).</p></sec><sec id="s4-1-4"><title>SC-his Dataset</title><p>Digested monosomes were purified using an Illustra Microspin S-400HR column according to ARTseq manufacture's instruction. All following library generation steps were performed according to the ARTseq protocol starting at step 4 (PAGE purification). Following the end repair step in the protocol, a biotinylated oligonucleotide antisense to a specific rRNA fragment was used to reduce rRNA contamination using a protocol from the Jonathan Weissman lab (personal communication Gloria Brar).</p></sec></sec><sec id="s4-2"><title>Data analysis</title><p>Unless indicated, data processing and analysis were performed using a collection of custom programs written in Perl.</p></sec><sec id="s4-3"><title>Sequence processing and alignment</title><p>Primary data were generated using Illumina HiSeq2000. Data were processed using Fastq clipper from the FASTX Toolkit 0.0.13 to remove the adaptor sequence and all reads shorter than 25 nucleotides were discarded. Alignment to the reference was done using bowtie2 2.1.0 in local alignment mode.</p><p>Before performing our analysis on the <xref ref-type="bibr" rid="bib28">Ingolia et al. (2009)</xref> data, in order to adhere to the processing guidelines of that paper, we used bowtie 0.12.8, reporting all alignments with at most three mismatches, and a seed length of 21. We then processed the multiple alignments, removing the poly-A tails and picking the one with the greatest number of bases matching to the reference.</p></sec><sec id="s4-4"><title>Ribosome residence time analysis</title><p>This analysis uses the general idea that many different mRNA sequences should get an independent and equal vote on decoding speed. We opted to analyze select regions where the effects of codon usage become particularly easy to assay. First, we discounted all reads with more than two mismatches or quality less than 10. We identified the first in-frame codon of each read and discarded those less than 30 nucleotides long to exclude fragments that may have been over digested by RNAase I. We then examined the coding regions of the genome, ignoring those overlapping with other genes, rRNAs, and tRNAs, in order to maximize our confidence in unique mapping. Each of the footprint reads that fully fit into a coding region that it aligned to was considered for further analysis.</p><p>For each particular codon, we identified all instances in our coding regions where this codon (say CTC) occurs uniquely within a window of 10 codons upstream and 10 codons downstream (i.e., a window of 19 codons with the target CTC in the center of the window). For footprints that are 10 codons long, there will be 10 classes of footprints where this particular CTC can appear&#x2014;position 1, position 2, ..., position 10. Thus, all footprints where the first codon of the footprint aligns to this particular CTC will belong to the position 1 class, all footprints where the second codon of the footprint aligns to this particular CTC will belong to position 2 class, etc.</p><p>In the absence of any codon preference of the ribosome, we would expect to see a uniform distribution of reads across these 10 classes. In general, the codon-positional preference is described by the relative frequency of reads in each of these classes. These relative frequency distributions can be fairly averaged over all target regions over all genes centered on a specific codon. This average we call the &#x2018;Ribosome Residence Time&#x2019; (RRT); it is intended as a statistical estimate of the relative time spent by the ribosome at a particular codon at a particular position. Typically we discuss the RRT at position 6 (the A-site), but we also discuss the RRT at position 5 (the P-site). Regions on highly expressed genes can be fairly compared with similar regions on genes with lower expression, because we are dealing with relative frequency distributions (i.e., percentage instead of read counts). Each region represents an independent trial of any positional preference of the given central codon. Averaging over the 100s or 1000s of occurrences on the genome provides for a statistically rigorous analysis.</p><p>Relative frequency distributions will only be representative if the observed number of reads in the window is high enough that no single position dominates the distribution. For this reason, we restricted our analysis to windows with at least 20 total reads with at least 3 non-empty classes.</p><p>The frequency distributions are not normally distributed; this is in part because the number of reads is limited, so many windows have zero footprints at many positions, so the mode of the distribution is often 0. Nevertheless we believe that the mean is a good summary statistic. Maximum values are less than 1, so the mean cannot be skewed by extremely high values. We have also calculated the RRTs using the median of the windows instead of the mean, but the results are almost indistinguishable. The Spearman rank correlation between the RRTs as calculated by the mean, and by the median, is 0.97, while the Kendall Tau correlation is 0.89.</p><p>For each codon, we obtain the two-tailed p-value by comparing the experimentally determined relative frequency to the distribution of 10,000 relative frequencies based on permuted results. For each of the 10,000 instances, for each considered window, we permute the footprint counts of the 10 position classes.</p><p>We performed our RRT analysis on the <xref ref-type="bibr" rid="bib28">Ingolia et al. (2009)</xref> data, with small modifications. We did not perform the checks of read quality and the number of mismatches, as this was taken care of in pre-processing steps (See Sequence Processing and Alignment). We also considered all reads with at least 24 nucleotides and performed our relative frequency calculations on the eight codons, because the majority of the reads were shorter than the reported size selection of RNA fragments &#x223c;27&#x2013;31 nucleotides in length.</p><p>The statistical significances shown in <xref ref-type="table" rid="tbl1">Table 1</xref> were obtained by constructing 10,000 simulated frequency distributions by randomly and independently permuting each region's frequency distribution prior to averaging. The rank of each observed positional peak among these simulated distributions established the p-value.</p></sec><sec id="s4-5"><title>Codon coherence analysis</title><p>We developed a p-value computation to assess whether the codons for a given amino acid behave similar to one another (i.e., are coherent) or not. Each codon's RRT values along the positions of a footprint may be considered as a k-dimensional vector, where k is the number of positions in the footprint (10 for long reads vs 7 for short reads). We consider the position in k-dimensional space of the end-point of this vector. For the set of synonymous codons for a particular amino acid, we consider the set of endpoints. For any given set of c such endpoints, we can compute the average pairwise distance d between them over all c(c-1)/2 pairs of points. If all codons for an amino acid behave similarly, then the endpoints are close together, and the distance d is relatively small, indicating codon coherence (amino-acid specific behavior), whereas if the various codons for a given amino acid behave differently (non-coherence, codon-specific behavior), then the distance d is relatively large.</p><p>To judge the sizes of these distances for a particular set of points, S, containing c codons (c ranges from 2 to 6) for a particular amino acid, we use a p-value. We construct 10,000 random samples of c codons drawn from the 61 possible sense codons. For each sample, we compute the average pairwise distance and compare this to the average pair distance of S. The rank of S in this distribution provides a p-value, which is significant if the vast bulk of random samples have greater pairwise distance than S. Results are shown in <xref ref-type="fig" rid="fig8">Figure 8</xref>.</p></sec><sec id="s4-6"><title>Estimates of ribosomes needed for differently-encoded transcriptomes</title><p>An mRNA encoding a given protein could use only the fastest codon for each amino acid or only the slowest or it could use a mixture. In each case, the mRNA would occupy, or titrate out, a different number of ribosomes. A transcriptome of mRNAs using only the slowest codons would require more ribosomes to make a given amount of total protein in a given time than a transcriptome of mRNAs using only the fastest codons. We roughly estimated the size of this effect for the range of codon decoding speeds we observed. We generated <italic>in silico</italic> a yeast transcriptome using only the fastest codon for each amino acid at position 6 (from <xref ref-type="table" rid="tbl1">Table 1</xref>) or only the slowest codon or a random mixture of codons. Furthermore, we weighted the abundance of each mRNA according to its actual abundance as measured by <xref ref-type="bibr" rid="bib37">Lipson et al. (2009)</xref>. We then compared the relative time required to translate each of these <italic>in silico</italic> transcriptomes by a set number of ribosomes based on the RRT values for each codon at position 5 and 6, and also assuming that the relevant delay is the delay at position 5 plus the delay at position 6 (since these two reactions must occur sequentially and not simultaneously before the ribosome can shift along the mRNA). In doing this, we noted that the RRT values for position 5 are negatively correlated with those at position 6. Results are as follows: the random encoding requires 1.050 as long as WT; the slowest encoding requires 1.168 as long as WT; and the fastest encoding requires 0.930 as long as WT. Note that this estimate uses the simplification that each species of mRNA will initiate translation at the same rate. A more accurate calculation in which the more abundant mRNAs initiate more rapidly than average would increase the difference between the WT and the random encodings.</p></sec><sec id="s4-7"><title>Note added in proof</title><p>When the accepted manuscript was published, RRT values from an earlier version of the algorithm were erroneously used for <xref ref-type="fig" rid="fig5">Figure 5</xref> (but not for other figures), giving a correlation of &#x2013;0.7 between RRT and codon usage. The current algorithm, used here, gives a corrected version of <xref ref-type="fig" rid="fig5">Figure 5</xref>, shown here, with a correlation of &#x2013;0.52.</p></sec></sec></body><back><ack id="ack"><title>Acknowledgements</title><p>We thank J Weissman and G Brar for their generosity in helping us learn ribosome profiling and for providing protocols and advice. Three anonymous reviewers provided insightful comments that greatly improved the final manuscript. This work was supported by NIH grant R01 GM098400 to BF and NSF grants DBI-1060572 and IIS-1017181 to SS.</p></ack><sec sec-type="additional-information"><title>Additional information</title><fn-group content-type="competing-interest"><title>Competing interests</title><fn fn-type="conflict" id="conf1"><p>The authors declare that no competing interests exist.</p></fn></fn-group><fn-group content-type="author-contribution"><title>Author contributions</title><fn fn-type="con" id="con1"><p>JG, Conception and design, Acquisition of data, Analysis and interpretation of data, Drafting or revising the article</p></fn><fn fn-type="con" id="con2"><p>RY, Wrote code., Conception and design, Analysis and interpretation of data</p></fn><fn fn-type="con" id="con3"><p>AY, Wrote code., Conception and design, Analysis and interpretation of data, Drafting or revising the article</p></fn><fn fn-type="con" id="con4"><p>YC, Acquisition of data</p></fn><fn fn-type="con" id="con5"><p>SS, Designed algorithm., Conception and design, Analysis and interpretation of data, Drafting or revising the article</p></fn><fn fn-type="con" id="con6"><p>BF, Conception and design, Analysis and interpretation of data, Drafting or revising the article</p></fn></fn-group></sec><sec sec-type="supplementary-material"><title>Additional files</title><supplementary-material id="SD1-data"><object-id pub-id-type="doi">10.7554/eLife.03735.015</object-id><label>Supplementary file 1.</label><caption><p>Complete Ribosome Residence Times for each codon at each of the 10 possible codon positions in a 30 nt (or, for Ingolia data, 24 nt) ribosome footprint. Each Excel spreadsheet is based on data from an independent biological experiment. Four of these experiments were done during the course of this work, two experiments by JG and two experiments by YC, while the fifth experiment was published by <xref ref-type="bibr" rid="bib28">Ingolia et al. (2009)</xref>. (<bold>A</bold>) Ribosome Residence Time analysis for all codons from the SC-lys expt. (<bold>B</bold>) Ribosome Residence Time analysis from the YPD1(WT) expt. (<bold>C</bold>) Ribosome Residence Time analysis from the YPD2 (whi3) expt. (<bold>D</bold>) Ribosome Residence Time analysis from the SC-his expt. (<bold>E</bold>) Ribosome Residence Time analysis from the Ingolia expt.</p><p><bold>DOI:</bold> <ext-link ext-link-type="doi" xlink:href="10.7554/eLife.03735.015">http://dx.doi.org/10.7554/eLife.03735.015</ext-link></p></caption><media xlink:href="elife03735s001.xls" mimetype="application" mime-subtype="xls"/></supplementary-material><supplementary-material id="SD2-data"><object-id pub-id-type="doi">10.7554/eLife.03735.016</object-id><label>Supplementary file 2.</label><caption><p>Complete Ribosome Residence Times for each codon at each of the 7 possible codon positions in a 21 nt ribosome footprint. Each Table is based on one of the three anisomycin datasets of <xref ref-type="bibr" rid="bib34">Lareau et al. (2014)</xref>. (<bold>A</bold>) RRT for short footprints; aniso2 dataset. (<bold>B</bold>) RRT for short footprints; aniso1B dataset. (<bold>C</bold>) RRT for short footprints; aniso1A dataset.</p><p><bold>DOI:</bold> <ext-link ext-link-type="doi" xlink:href="10.7554/eLife.03735.016">http://dx.doi.org/10.7554/eLife.03735.016</ext-link></p></caption><media xlink:href="elife03735s002.xls" mimetype="application" mime-subtype="xls"/></supplementary-material><supplementary-material id="SD3-data"><object-id pub-id-type="doi">10.7554/eLife.03735.017</object-id><label>Source code 1.</label><caption><p>Source code 1 is a plain text file containing stage 1 of the Perl code for Ribosome Residence Time analysis.</p><p><bold>DOI:</bold> <ext-link ext-link-type="doi" xlink:href="10.7554/eLife.03735.017">http://dx.doi.org/10.7554/eLife.03735.017</ext-link></p></caption><media xlink:href="elife03735s003.txt" mimetype="text" mime-subtype="txt"/></supplementary-material><supplementary-material id="SD4-data"><object-id pub-id-type="doi">10.7554/eLife.03735.018</object-id><label>Source code 2.</label><caption><p>Source code 2 is a plain text file containing stage 2 of the Perl code for Ribosome Residence Time analysis.</p><p><bold>DOI:</bold> <ext-link ext-link-type="doi" xlink:href="10.7554/eLife.03735.018">http://dx.doi.org/10.7554/eLife.03735.018</ext-link></p></caption><media xlink:href="elife03735s004.txt" mimetype="text" mime-subtype="txt"/></supplementary-material><sec sec-type="datasets"><title>Major datasets</title><p>The following dataset was generated:</p><p><related-object content-type="generated-dataset" id="dataro1"><collab collab-type="author">Gardin</collab>, <etal/>, <year>2014</year><x>, </x><source>Measurement of average decoding rates of the 61 sense codons in vivo</source><x>, </x><object-id pub-id-type="art-access-id">SRP044053</object-id><x>; </x><comment>NCBI SRA database.</comment></related-object></p><p>The following previously published dataset was used:</p><p><related-object content-type="existing-dataset" source-id="http://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE51164" source-id-type="uri" id="dataro2"><collab collab-type="author">Ying Cai</collab>, <year>2013</year><x>, </x><source>Ribosome profiling of whi3 mutant yeast</source><x>, </x><ext-link ext-link-type="uri" xlink:href="http://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE51164">www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc&#x3d;GSE51164</ext-link><x>, </x><comment>Publicly available at NCBI Gene Expression Omnibus.</comment></related-object></p></sec></sec><ref-list><title>References</title><ref id="bib1"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Andersson</surname><given-names>SG</given-names></name><name><surname>Kurland</surname><given-names>CG</given-names></name></person-group><year>1990</year><article-title>Codon preferences in free-living microorganisms</article-title><source>Microbiological Reviews</source><volume>54</volume><fpage>198</fpage><lpage>210</lpage></element-citation></ref><ref id="bib2"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Bennetzen</surname><given-names>JL</given-names></name><name><surname>Hall</surname><given-names>BD</given-names></name></person-group><year>1982</year><article-title>Codon selection in yeast</article-title><source>The Journal of Biological Chemistry</source><volume>257</volume><fpage>3026</fpage><lpage>3031</lpage></element-citation></ref><ref id="bib3"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Berndt</surname><given-names>U</given-names></name><name><surname>Oellerer</surname><given-names>S</given-names></name><name><surname>Zhang</surname><given-names>Y</given-names></name><name><surname>Johnson</surname><given-names>AE</given-names></name><name><surname>Rospert</surname><given-names>S</given-names></name></person-group><year>2009</year><article-title>A signal-anchor sequence stimulates signal recognition particle binding to ribosomes from inside the exit tunnel</article-title><source>Proceedings of the National Academy of Sciences of USA</source><volume>106</volume><fpage>1398</fpage><lpage>1403</lpage><pub-id pub-id-type="doi">10.1073/pnas.0808584106</pub-id></element-citation></ref><ref id="bib4"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Bhushan</surname><given-names>S</given-names></name><name><surname>Gartmann</surname><given-names>M</given-names></name><name><surname>Halic</surname><given-names>M</given-names></name><name><surname>Armache</surname><given-names>JP</given-names></name><name><surname>Jarasch</surname><given-names>A</given-names></name><name><surname>Mielke</surname><given-names>T</given-names></name><name><surname>Berninghausen</surname><given-names>O</given-names></name><name><surname>Wilson</surname><given-names>DN</given-names></name><name><surname>Beckmann</surname><given-names>R</given-names></name></person-group><year>2010</year><article-title>alpha-Helical nascent polypeptide chains visualized within distinct regions of the ribosomal exit tunnel</article-title><source>Nature Structural &#x26; Molecular Biology</source><volume>17</volume><fpage>313</fpage><lpage>317</lpage><pub-id pub-id-type="doi">10.1038/nsmb.1756</pub-id></element-citation></ref><ref id="bib5"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Bieling</surname><given-names>P</given-names></name><name><surname>Beringer</surname><given-names>M</given-names></name><name><surname>Adio</surname><given-names>S</given-names></name><name><surname>Rodnina</surname><given-names>MV</given-names></name></person-group><year>2006</year><article-title>Peptide bond formation does not involve acid-base catalysis by ribosomal residues</article-title><source>Nature Structural &#x26; Molecular Biology</source><volume>13</volume><fpage>423</fpage><lpage>428</lpage><pub-id pub-id-type="doi">10.1038/nsmb1091</pub-id></element-citation></ref><ref id="bib6"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Blaha</surname><given-names>G</given-names></name><name><surname>Gurel</surname><given-names>G</given-names></name><name><surname>Schroeder</surname><given-names>SJ</given-names></name><name><surname>Moore</surname><given-names>PB</given-names></name><name><surname>Steitz</surname><given-names>TA</given-names></name></person-group><year>2008</year><article-title>Mutations outside the anisomycin-binding site can make ribosomes drug-resistant</article-title><source>Journal of Molecular Biology</source><volume>379</volume><fpage>505</fpage><lpage>519</lpage><pub-id pub-id-type="doi">10.1016/j.jmb.2008.03.075</pub-id></element-citation></ref><ref id="bib7"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Brandman</surname><given-names>O</given-names></name><name><surname>Stewart-Ornstein</surname><given-names>J</given-names></name><name><surname>Wong</surname><given-names>D</given-names></name><name><surname>Larson</surname><given-names>A</given-names></name><name><surname>Williams</surname><given-names>CC</given-names></name><name><surname>Li</surname><given-names>GW</given-names></name><name><surname>Zhou</surname><given-names>S</given-names></name><name><surname>King</surname><given-names>D</given-names></name><name><surname>Shen</surname><given-names>PS</given-names></name><name><surname>Weibezahn</surname><given-names>J</given-names></name><name><surname>Dunn</surname><given-names>JG</given-names></name><name><surname>Rouskin</surname><given-names>S</given-names></name><name><surname>Inada</surname><given-names>T</given-names></name><name><surname>Frost</surname><given-names>A</given-names></name><name><surname>Weissman</surname><given-names>JS</given-names></name></person-group><year>2012</year><article-title>A ribosome-bound quality control complex triggers degradation of nascent peptides and signals translation stress</article-title><source>Cell</source><volume>151</volume><fpage>1042</fpage><lpage>1054</lpage><pub-id pub-id-type="doi">10.1016/j.cell.2012.10.044</pub-id></element-citation></ref><ref id="bib8"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Bulmer</surname><given-names>M</given-names></name></person-group><year>1987</year><article-title>Coevolution of codon usage and transfer RNA abundance</article-title><source>Nature</source><volume>325</volume><fpage>728</fpage><lpage>730</lpage><pub-id pub-id-type="doi">10.1038/325728a0</pub-id></element-citation></ref><ref id="bib9"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Burgess-Brown</surname><given-names>NA</given-names></name><name><surname>Sharma</surname><given-names>S</given-names></name><name><surname>Sobott</surname><given-names>F</given-names></name><name><surname>Loenarz</surname><given-names>C</given-names></name><name><surname>Oppermann</surname><given-names>U</given-names></name><name><surname>Gileadi</surname><given-names>O</given-names></name></person-group><year>2008</year><article-title>Codon optimization can improve expression of human genes in <italic>Escherichia coli</italic>: a multi-gene study</article-title><source>Protein Expression and Purification</source><volume>59</volume><fpage>94</fpage><lpage>102</lpage><pub-id pub-id-type="doi">10.1016/j.pep.2008.01.008</pub-id></element-citation></ref><ref id="bib10"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Cai</surname><given-names>Y</given-names></name><name><surname>Futcher</surname><given-names>B</given-names></name></person-group><year>2013</year><article-title>Effects of the yeast RNA-binding protein Whi3 on the half-life and abundance of CLN3 mRNA and other targets</article-title><source>PLOS ONE</source><volume>8</volume><fpage>e84630</fpage><pub-id pub-id-type="doi">10.1371/journal.pone.0084630</pub-id></element-citation></ref><ref id="bib11"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Charneski</surname><given-names>CA</given-names></name><name><surname>Hurst</surname><given-names>LD</given-names></name></person-group><year>2013</year><article-title>Positively charged residues are the major determinants of ribosomal velocity</article-title><source>PLOS Biology</source><volume>11</volume><fpage>e1001508</fpage><pub-id pub-id-type="doi">10.1371/journal.pbio.1001508</pub-id></element-citation></ref><ref id="bib12"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Chu</surname><given-names>D</given-names></name><name><surname>Kazana</surname><given-names>E</given-names></name><name><surname>Bellanger</surname><given-names>N</given-names></name><name><surname>Singh</surname><given-names>T</given-names></name><name><surname>Tuite</surname><given-names>MF</given-names></name><name><surname>von der Haar</surname><given-names>T</given-names></name></person-group><year>2014</year><article-title>Translation elongation can control translation initiation on eukaryotic mRNAs</article-title><source>The EMBO Journal</source><volume>33</volume><fpage>21</fpage><lpage>34</lpage><pub-id pub-id-type="doi">10.1002/embj.201385651</pub-id></element-citation></ref><ref id="bib13"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Daviter</surname><given-names>T</given-names></name><name><surname>Gromadski</surname><given-names>KB</given-names></name><name><surname>Rodnina</surname><given-names>MV</given-names></name></person-group><year>2006</year><article-title>The ribosome's response to codon-anticodon mismatches</article-title><source>Biochimie</source><volume>88</volume><fpage>1001</fpage><lpage>1011</lpage><pub-id pub-id-type="doi">10.1016/j.biochi.2006.04.013</pub-id></element-citation></ref><ref id="bib14"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Demeshkina</surname><given-names>N</given-names></name><name><surname>Jenner</surname><given-names>L</given-names></name><name><surname>Westhof</surname><given-names>E</given-names></name><name><surname>Yusupov</surname><given-names>M</given-names></name><name><surname>Yusupova</surname><given-names>G</given-names></name></person-group><year>2012</year><article-title>A new understanding of the decoding principle on the ribosome</article-title><source>Nature</source><volume>484</volume><fpage>256</fpage><lpage>259</lpage><pub-id pub-id-type="doi">10.1038/nature10913</pub-id></element-citation></ref><ref id="bib15"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Doerfel</surname><given-names>LK</given-names></name><name><surname>Wohlgemuth</surname><given-names>I</given-names></name><name><surname>Kothe</surname><given-names>C</given-names></name><name><surname>Peske</surname><given-names>F</given-names></name><name><surname>Urlaub</surname><given-names>H</given-names></name><name><surname>Rodnina</surname><given-names>MV</given-names></name></person-group><year>2013</year><article-title>EF-P is essential for rapid synthesis of proteins containing consecutive proline residues</article-title><source>Science</source><volume>339</volume><fpage>85</fpage><lpage>88</lpage><pub-id pub-id-type="doi">10.1126/science.1229017</pub-id></element-citation></ref><ref id="bib16"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Dong</surname><given-names>H</given-names></name><name><surname>Nilsson</surname><given-names>L</given-names></name><name><surname>Kurland</surname><given-names>CG</given-names></name></person-group><year>1996</year><article-title>Co-variation of tRNA abundance and codon usage in <italic>Escherichia coli</italic> at different growth rates</article-title><source>Journal of Molecular Biology</source><volume>260</volume><fpage>649</fpage><lpage>663</lpage><pub-id pub-id-type="doi">10.1006/jmbi.1996.0428</pub-id></element-citation></ref><ref id="bib17"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Drummond</surname><given-names>DA</given-names></name><name><surname>Wilke</surname><given-names>CO</given-names></name></person-group><year>2008</year><article-title>Mistranslation-induced protein misfolding as a dominant constraint on coding-sequence evolution</article-title><source>Cell</source><volume>134</volume><fpage>341</fpage><lpage>352</lpage><pub-id pub-id-type="doi">10.1016/j.cell.2008.05.042</pub-id></element-citation></ref><ref id="bib18"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Fitch</surname><given-names>WM</given-names></name></person-group><year>1976</year><article-title>Is there selection against wobble in codon-anticodon pairing?</article-title><source>Science</source><volume>194</volume><fpage>1173</fpage><lpage>1174</lpage><pub-id pub-id-type="doi">10.1126/science.996548</pub-id></element-citation></ref><ref id="bib19"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Forster</surname><given-names>AC</given-names></name></person-group><year>2012</year><article-title>Synthetic biology challenges long-held hypotheses in translation, codon bias and transcription</article-title><source>Biotechnology Journal</source><volume>7</volume><fpage>835</fpage><lpage>845</lpage><pub-id pub-id-type="doi">10.1002/biot.201200002</pub-id></element-citation></ref><ref id="bib20"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Futcher</surname><given-names>B</given-names></name><name><surname>Latter</surname><given-names>GI</given-names></name><name><surname>Monardo</surname><given-names>P</given-names></name><name><surname>Mclaughlin</surname><given-names>CS</given-names></name><name><surname>Garrels</surname><given-names>JI</given-names></name></person-group><year>1999</year><article-title>A sampling of the yeast proteome</article-title><source>Molecular and Cellular Biology</source><volume>19</volume><fpage>7357</fpage><lpage>7368</lpage></element-citation></ref><ref id="bib21"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Gromadski</surname><given-names>KB</given-names></name><name><surname>Daviter</surname><given-names>T</given-names></name><name><surname>Rodnina</surname><given-names>MV</given-names></name></person-group><year>2006</year><article-title>A uniform response to mismatches in codon-anticodon complexes ensures ribosomal fidelity</article-title><source>Molecular Cell</source><volume>21</volume><fpage>369</fpage><lpage>377</lpage><pub-id pub-id-type="doi">10.1016/j.molcel.2005.12.018</pub-id></element-citation></ref><ref id="bib22"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Gumbart</surname><given-names>J</given-names></name><name><surname>Schreiner</surname><given-names>E</given-names></name><name><surname>Wilson</surname><given-names>DN</given-names></name><name><surname>Beckmann</surname><given-names>R</given-names></name><name><surname>Schulten</surname><given-names>K</given-names></name></person-group><year>2012</year><article-title>Mechanisms of SecM-mediated stalling in the ribosome</article-title><source>Biophysical Journal</source><volume>103</volume><fpage>331</fpage><lpage>341</lpage><pub-id pub-id-type="doi">10.1016/j.bpj.2012.06.005</pub-id></element-citation></ref><ref id="bib23"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Gutierrez</surname><given-names>E</given-names></name><name><surname>Shin</surname><given-names>BS</given-names></name><name><surname>Woolstenhulme</surname><given-names>CJ</given-names></name><name><surname>Kim</surname><given-names>JR</given-names></name><name><surname>Saini</surname><given-names>P</given-names></name><name><surname>Buskirk</surname><given-names>AR</given-names></name><name><surname>Dever</surname><given-names>TE</given-names></name></person-group><year>2013</year><article-title>eIF5A promotes translation of polyproline motifs</article-title><source>Molecular Cell</source><volume>51</volume><fpage>35</fpage><lpage>45</lpage><pub-id pub-id-type="doi">10.1016/j.molcel.2013.04.021</pub-id></element-citation></ref><ref id="bib24"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Hansen</surname><given-names>JL</given-names></name><name><surname>Moore</surname><given-names>PB</given-names></name><name><surname>Steitz</surname><given-names>TA</given-names></name></person-group><year>2003</year><article-title>Structures of five antibiotics bound at the peptidyl transferase center of the large ribosomal subunit</article-title><source>Journal of Molecular Biology</source><volume>330</volume><fpage>1061</fpage><lpage>1075</lpage><pub-id pub-id-type="doi">10.1016/S0022-2836(03)00668-5</pub-id></element-citation></ref><ref id="bib25"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Hasegawa</surname><given-names>M</given-names></name><name><surname>Yasunaga</surname><given-names>T</given-names></name><name><surname>Miyata</surname><given-names>T</given-names></name></person-group><year>1979</year><article-title>Secondary structure of MS2 phage RNA and bias in code word usage</article-title><source>Nucleic Acids Research</source><volume>7</volume><fpage>2073</fpage><lpage>2079</lpage><pub-id pub-id-type="doi">10.1093/nar/7.7.2073</pub-id></element-citation></ref><ref id="bib26"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Ikemura</surname><given-names>T</given-names></name></person-group><year>1981</year><article-title>Correlation between the abundance of <italic>Escherichia coli</italic> transfer RNAs and the occurrence of the respective codons in its protein genes: a proposal for a synonymous codon choice that is optimal for the <italic>E. coli</italic> translational system</article-title><source>Journal of Molecular Biology</source><volume>151</volume><fpage>389</fpage><lpage>409</lpage><pub-id pub-id-type="doi">10.1016/0022-2836(81)90003-6</pub-id></element-citation></ref><ref id="bib27"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Ikemura</surname><given-names>T</given-names></name></person-group><year>1982</year><article-title>Correlation between the abundance of yeast transfer RNAs and the occurrence of the respective codons in protein genes. Differences in synonymous codon choice patterns of yeast and <italic>Escherichia coli</italic> with reference to the abundance of isoaccepting transfer RNAs</article-title><source>Journal of Molecular Biology</source><volume>158</volume><fpage>573</fpage><lpage>597</lpage><pub-id pub-id-type="doi">10.1016/0022-2836(82)90250-9</pub-id></element-citation></ref><ref id="bib28"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Ingolia</surname><given-names>NT</given-names></name><name><surname>Ghaemmaghami</surname><given-names>S</given-names></name><name><surname>Newman</surname><given-names>JR</given-names></name><name><surname>Weissman</surname><given-names>JS</given-names></name></person-group><year>2009</year><article-title>Genome-wide analysis in vivo of translation with nucleotide resolution using ribosome profiling</article-title><source>Science</source><volume>324</volume><fpage>218</fpage><lpage>223</lpage><pub-id pub-id-type="doi">10.1126/science.1168978</pub-id></element-citation></ref><ref id="bib29"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Ingolia</surname><given-names>NT</given-names></name><name><surname>Lareau</surname><given-names>LF</given-names></name><name><surname>Weissman</surname><given-names>JS</given-names></name></person-group><year>2011</year><article-title>Ribosome profiling of mouse embryonic stem cells reveals the complexity and dynamics of mammalian proteomes</article-title><source>Cell</source><volume>147</volume><fpage>789</fpage><lpage>802</lpage><pub-id pub-id-type="doi">10.1016/j.cell.2011.10.002</pub-id></element-citation></ref><ref id="bib30"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Jackson</surname><given-names>TJ</given-names></name><name><surname>Spriggs</surname><given-names>RV</given-names></name><name><surname>Burgoyne</surname><given-names>NJ</given-names></name><name><surname>Jones</surname><given-names>C</given-names></name><name><surname>Willis</surname><given-names>AE</given-names></name></person-group><year>2014</year><article-title>Evaluating bias-reducing protocols for RNA sequencing library preparation</article-title><source>BMC Genomics</source><volume>15</volume><fpage>569</fpage><pub-id pub-id-type="doi">10.1186/1471-2164-15-569</pub-id></element-citation></ref><ref id="bib31"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Johansson</surname><given-names>M</given-names></name><name><surname>Ieong</surname><given-names>KW</given-names></name><name><surname>Trobro</surname><given-names>S</given-names></name><name><surname>Strazewski</surname><given-names>P</given-names></name><name><surname>Aqvist</surname><given-names>J</given-names></name><name><surname>Pavlov</surname><given-names>MY</given-names></name><name><surname>Ehrenberg</surname><given-names>M</given-names></name></person-group><year>2011</year><article-title>pH-sensitivity of the ribosomal peptidyl transfer reaction dependent on the identity of the A-site aminoacyl-tRNA</article-title><source>Proceedings of the National Academy of Sciences of USA</source><volume>108</volume><fpage>79</fpage><lpage>84</lpage><pub-id pub-id-type="doi">10.1073/pnas.1012612107</pub-id></element-citation></ref><ref id="bib32"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Khade</surname><given-names>PK</given-names></name><name><surname>Joseph</surname><given-names>S</given-names></name></person-group><year>2011</year><article-title>Messenger RNA interactions in the decoding center control the rate of translocation</article-title><source>Nature Structural &#x26; Molecular Biology</source><volume>18</volume><fpage>1300</fpage><lpage>1302</lpage><pub-id pub-id-type="doi">10.1038/nsmb.2140</pub-id></element-citation></ref><ref id="bib33"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Lamm</surname><given-names>AT</given-names></name><name><surname>Stadler</surname><given-names>MR</given-names></name><name><surname>Zhang</surname><given-names>H</given-names></name><name><surname>Gent</surname><given-names>JI</given-names></name><name><surname>Fire</surname><given-names>AZ</given-names></name></person-group><year>2011</year><article-title>Multimodal RNA-seq using single-strand, double-strand, and CircLigase-based capture yields a refined and extended description of the <italic>C. elegans</italic> transcriptome</article-title><source>Genome Research</source><volume>21</volume><fpage>265</fpage><lpage>275</lpage><pub-id pub-id-type="doi">10.1101/gr.108845.110</pub-id></element-citation></ref><ref id="bib34"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Lareau</surname><given-names>LF</given-names></name><name><surname>Hite</surname><given-names>DH</given-names></name><name><surname>Hogan</surname><given-names>GJ</given-names></name><name><surname>Brown</surname><given-names>PO</given-names></name></person-group><year>2014</year><article-title>Distinct stages of the translation elongation cycle revealed by sequencing ribosome-protected mRNA fragments</article-title><source>eLife</source><volume>3</volume><fpage>e01257</fpage><pub-id pub-id-type="doi">10.7554/eLife.01257</pub-id></element-citation></ref><ref id="bib35"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Li</surname><given-names>GW</given-names></name><name><surname>Oh</surname><given-names>E</given-names></name><name><surname>Weissman</surname><given-names>JS</given-names></name></person-group><year>2012</year><article-title>The anti-Shine-Dalgarno sequence drives translational pausing and codon choice in bacteria</article-title><source>Nature</source><volume>484</volume><fpage>538</fpage><lpage>541</lpage><pub-id pub-id-type="doi">10.1038/nature10965</pub-id></element-citation></ref><ref id="bib36"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Lipman</surname><given-names>DJ</given-names></name><name><surname>Wilbur</surname><given-names>WJ</given-names></name></person-group><year>1983</year><article-title>Contextual constraints on synonymous codon choice</article-title><source>Journal of Molecular Biology</source><volume>163</volume><fpage>363</fpage><lpage>376</lpage><pub-id pub-id-type="doi">10.1016/0022-2836(83)90063-3</pub-id></element-citation></ref><ref id="bib37"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Lipson</surname><given-names>D</given-names></name><name><surname>Raz</surname><given-names>T</given-names></name><name><surname>Kieu</surname><given-names>A</given-names></name><name><surname>Jones</surname><given-names>DR</given-names></name><name><surname>Giladi</surname><given-names>E</given-names></name><name><surname>Thayer</surname><given-names>E</given-names></name><name><surname>Thompson</surname><given-names>JF</given-names></name><name><surname>Letovsky</surname><given-names>S</given-names></name><name><surname>Milos</surname><given-names>P</given-names></name><name><surname>Causey</surname><given-names>M</given-names></name></person-group><year>2009</year><article-title>Quantification of the yeast transcriptome by single-molecule sequencing</article-title><source>Nature Biotechnology</source><volume>27</volume><fpage>652</fpage><lpage>658</lpage><pub-id pub-id-type="doi">10.1038/nbt.1551</pub-id></element-citation></ref><ref id="bib38"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Lu</surname><given-names>J</given-names></name><name><surname>Deutsch</surname><given-names>C</given-names></name></person-group><year>2008</year><article-title>Electrostatics in the ribosomal tunnel modulate chain elongation rates</article-title><source>Journal of Molecular Biology</source><volume>384</volume><fpage>73</fpage><lpage>86</lpage><pub-id pub-id-type="doi">10.1016/j.jmb.2008.08.089</pub-id></element-citation></ref><ref id="bib39"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Lu</surname><given-names>J</given-names></name><name><surname>Hua</surname><given-names>Z</given-names></name><name><surname>Kobertz</surname><given-names>WR</given-names></name><name><surname>Deutsch</surname><given-names>C</given-names></name></person-group><year>2011</year><article-title>Nascent peptide side chains induce rearrangements in distinct locations of the ribosomal tunnel</article-title><source>Journal of Molecular Biology</source><volume>411</volume><fpage>499</fpage><lpage>510</lpage><pub-id pub-id-type="doi">10.1016/j.jmb.2011.05.038</pub-id></element-citation></ref><ref id="bib40"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Lu</surname><given-names>J</given-names></name><name><surname>Kobertz</surname><given-names>WR</given-names></name><name><surname>Deutsch</surname><given-names>C</given-names></name></person-group><year>2007</year><article-title>Mapping the electrostatic potential within the ribosomal exit tunnel</article-title><source>Journal of Molecular Biology</source><volume>371</volume><fpage>1378</fpage><lpage>1391</lpage><pub-id pub-id-type="doi">10.1016/j.jmb.2007.06.038</pub-id></element-citation></ref><ref id="bib41"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Maertens</surname><given-names>B</given-names></name><name><surname>Spriestersbach</surname><given-names>A</given-names></name><name><surname>von Groll</surname><given-names>U</given-names></name><name><surname>Roth</surname><given-names>U</given-names></name><name><surname>Kubicek</surname><given-names>J</given-names></name><name><surname>Gerrits</surname><given-names>M</given-names></name><name><surname>Graf</surname><given-names>M</given-names></name><name><surname>Liss</surname><given-names>M</given-names></name><name><surname>Daubert</surname><given-names>D</given-names></name><name><surname>Wagner</surname><given-names>R</given-names></name><name><surname>Schafer</surname><given-names>F</given-names></name></person-group><year>2010</year><article-title>Gene optimization mechanisms: a multi-gene study reveals a high success rate of full-length human proteins expressed in <italic>Escherichia coli</italic></article-title><source>Protein Science</source><volume>19</volume><fpage>1312</fpage><lpage>1326</lpage><pub-id pub-id-type="doi">10.1002/pro.408</pub-id></element-citation></ref><ref id="bib42"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Miyata</surname><given-names>T</given-names></name><name><surname>Hayashida</surname><given-names>H</given-names></name><name><surname>Yasunaga</surname><given-names>T</given-names></name><name><surname>Hasegawa</surname><given-names>M</given-names></name></person-group><year>1979</year><article-title>The preferential codon usages in variable and constant regions of immunoglobulin genes are quite distinct from each other</article-title><source>Nucleic Acids Research</source><volume>7</volume><fpage>2431</fpage><lpage>2438</lpage><pub-id pub-id-type="doi">10.1093/nar/7.8.2431</pub-id></element-citation></ref><ref id="bib43"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Muto</surname><given-names>H</given-names></name><name><surname>Ito</surname><given-names>K</given-names></name></person-group><year>2008</year><article-title>Peptidyl-prolyl-tRNA at the ribosomal P-site reacts poorly with puromycin</article-title><source>Biochemical and Biophysical Research Communications</source><volume>366</volume><fpage>1043</fpage><lpage>1047</lpage><pub-id pub-id-type="doi">10.1016/j.bbrc.2007.12.072</pub-id></element-citation></ref><ref id="bib44"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Novoa</surname><given-names>EM</given-names></name><name><surname>Ribas de Pouplana</surname><given-names>L</given-names></name></person-group><year>2012</year><article-title>Speeding with control: codon usage, tRNAs, and ribosomes</article-title><source>Trends in Genetics</source><volume>28</volume><fpage>574</fpage><lpage>581</lpage><pub-id pub-id-type="doi">10.1016/j.tig.2012.07.006</pub-id></element-citation></ref><ref id="bib45"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Pavlov</surname><given-names>MY</given-names></name><name><surname>Watts</surname><given-names>RE</given-names></name><name><surname>Tan</surname><given-names>Z</given-names></name><name><surname>Cornish</surname><given-names>VW</given-names></name><name><surname>Ehrenberg</surname><given-names>M</given-names></name><name><surname>Forster</surname><given-names>AC</given-names></name></person-group><year>2009</year><article-title>Slow peptide bond formation by proline and other N-alkylamino acids in translation</article-title><source>Proceedings of the National Academy of Sciences of USA</source><volume>106</volume><fpage>50</fpage><lpage>54</lpage><pub-id pub-id-type="doi">10.1073/pnas.0809211106</pub-id></element-citation></ref><ref id="bib46"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Peil</surname><given-names>L</given-names></name><name><surname>Starosta</surname><given-names>AL</given-names></name><name><surname>Lassak</surname><given-names>J</given-names></name><name><surname>Atkinson</surname><given-names>GC</given-names></name><name><surname>Virumae</surname><given-names>K</given-names></name><name><surname>Spitzer</surname><given-names>M</given-names></name><name><surname>Tenson</surname><given-names>T</given-names></name><name><surname>Jung</surname><given-names>K</given-names></name><name><surname>Remme</surname><given-names>J</given-names></name><name><surname>Wilson</surname><given-names>DN</given-names></name></person-group><year>2013</year><article-title>Distinct XPPX sequence motifs induce ribosome stalling, which is rescued by the translation elongation factor EF-P</article-title><source>Proceedings of the National Academy of Sciences of USA</source><volume>110</volume><fpage>15265</fpage><lpage>15270</lpage><pub-id pub-id-type="doi">10.1073/pnas.1310642110</pub-id></element-citation></ref><ref id="bib47"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Petrone</surname><given-names>PM</given-names></name><name><surname>Snow</surname><given-names>CD</given-names></name><name><surname>Lucent</surname><given-names>D</given-names></name><name><surname>Pande</surname><given-names>VS</given-names></name></person-group><year>2008</year><article-title>Side-chain recognition and gating in the ribosome exit tunnel</article-title><source>Proceedings of the National Academy of Sciences of USA</source><volume>105</volume><fpage>16549</fpage><lpage>16554</lpage><pub-id pub-id-type="doi">10.1073/pnas.0801795105</pub-id></element-citation></ref><ref id="bib48"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Plotkin</surname><given-names>JB</given-names></name><name><surname>Kudla</surname><given-names>G</given-names></name></person-group><year>2011</year><article-title>Synonymous but not the same: the causes and consequences of codon bias</article-title><source>Nature Reviews Genetics</source><volume>12</volume><fpage>32</fpage><lpage>42</lpage><pub-id pub-id-type="doi">10.1038/nrg2899</pub-id></element-citation></ref><ref id="bib49"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Polikanov</surname><given-names>YS</given-names></name><name><surname>Steitz</surname><given-names>TA</given-names></name><name><surname>Innis</surname><given-names>CA</given-names></name></person-group><year>2014</year><article-title>A proton wire to couple aminoacyl-tRNA accommodation and peptide-bond formation on the ribosome</article-title><source>Nature Structural &#x26; Molecular Biology</source><volume>21</volume><fpage>787</fpage><lpage>793</lpage><pub-id pub-id-type="doi">10.1038/nsmb.2871</pub-id></element-citation></ref><ref id="bib50"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Qian</surname><given-names>W</given-names></name><name><surname>Yang</surname><given-names>JR</given-names></name><name><surname>Pearson</surname><given-names>NM</given-names></name><name><surname>Maclean</surname><given-names>C</given-names></name><name><surname>Zhang</surname><given-names>J</given-names></name></person-group><year>2012</year><article-title>Balanced codon usage optimizes eukaryotic translational efficiency</article-title><source>PLOS Genetics</source><volume>8</volume><fpage>e1002603</fpage><pub-id pub-id-type="doi">10.1371/journal.pgen.1002603</pub-id></element-citation></ref><ref id="bib51"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Raabe</surname><given-names>CA</given-names></name><name><surname>Tang</surname><given-names>TH</given-names></name><name><surname>Brosius</surname><given-names>J</given-names></name><name><surname>Rozhdestvensky</surname><given-names>TS</given-names></name></person-group><year>2014</year><article-title>Biases in small RNA deep sequencing data</article-title><source>Nucleic Acids Research</source><volume>42</volume><fpage>1414</fpage><lpage>1426</lpage><pub-id pub-id-type="doi">10.1093/nar/gkt1021</pub-id></element-citation></ref><ref id="bib52"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Raue</surname><given-names>U</given-names></name><name><surname>Oellerer</surname><given-names>S</given-names></name><name><surname>Rospert</surname><given-names>S</given-names></name></person-group><year>2007</year><article-title>Association of protein biogenesis factors at the yeast ribosomal tunnel exit is affected by the translational status and nascent polypeptide sequence</article-title><source>The Journal of Biological Chemistry</source><volume>282</volume><fpage>7809</fpage><lpage>7816</lpage><pub-id pub-id-type="doi">10.1074/jbc.M611436200</pub-id></element-citation></ref><ref id="bib53"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Rodnina</surname><given-names>MV</given-names></name></person-group><year>2013</year><article-title>The ribosome as a versatile catalyst: reactions at the peptidyl transferase center</article-title><source>Current Opinion in Structural Biology</source><volume>23</volume><fpage>595</fpage><lpage>602</lpage><pub-id pub-id-type="doi">10.1016/j.sbi.2013.04.012</pub-id></element-citation></ref><ref id="bib54"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Roth</surname><given-names>AC</given-names></name></person-group><year>2012</year><article-title>Decoding properties of tRNA leave a detectable signal in codon usage bias</article-title><source>Bioinformatics</source><volume>28</volume><fpage>i340</fpage><lpage>i348</lpage><pub-id pub-id-type="doi">10.1093/bioinformatics/bts403</pub-id></element-citation></ref><ref id="bib55"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Semenkov</surname><given-names>YP</given-names></name><name><surname>Rodnina</surname><given-names>MV</given-names></name><name><surname>Wintermeyer</surname><given-names>W</given-names></name></person-group><year>2000</year><article-title>Energetic contribution of tRNA hybrid state formation to translocation catalysis on the ribosome</article-title><source>Nature Structural Biology</source><volume>7</volume><fpage>1027</fpage><lpage>1031</lpage><pub-id pub-id-type="doi">10.1038/80938</pub-id></element-citation></ref><ref id="bib56"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Sharp</surname><given-names>PM</given-names></name><name><surname>Li</surname><given-names>WH</given-names></name></person-group><year>1986</year><article-title>An evolutionary perspective on synonymous codon usage in unicellular organisms</article-title><source>Journal of Molecular Evolution</source><volume>24</volume><fpage>28</fpage><lpage>38</lpage><pub-id pub-id-type="doi">10.1007/BF02099948</pub-id></element-citation></ref><ref id="bib57"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Tuller</surname><given-names>T</given-names></name><name><surname>Carmi</surname><given-names>A</given-names></name><name><surname>Vestsigian</surname><given-names>K</given-names></name><name><surname>Navon</surname><given-names>S</given-names></name><name><surname>Dorfan</surname><given-names>Y</given-names></name><name><surname>Zaborske</surname><given-names>J</given-names></name><name><surname>Pan</surname><given-names>T</given-names></name><name><surname>Dahan</surname><given-names>O</given-names></name><name><surname>Furman</surname><given-names>I</given-names></name><name><surname>Pilpel</surname><given-names>Y</given-names></name></person-group><year>2010</year><article-title>An evolutionarily conserved mechanism for controlling the efficiency of protein translation</article-title><source>Cell</source><volume>141</volume><fpage>344</fpage><lpage>354</lpage><pub-id pub-id-type="doi">10.1016/j.cell.2010.03.031</pub-id></element-citation></ref><ref id="bib58"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Ude</surname><given-names>S</given-names></name><name><surname>Lassak</surname><given-names>J</given-names></name><name><surname>Starosta</surname><given-names>AL</given-names></name><name><surname>Kraxenberger</surname><given-names>T</given-names></name><name><surname>Wilson</surname><given-names>DN</given-names></name><name><surname>Jung</surname><given-names>K</given-names></name></person-group><year>2013</year><article-title>Translation elongation factor EF-P alleviates ribosome stalling at polyproline stretches</article-title><source>Science</source><volume>339</volume><fpage>82</fpage><lpage>85</lpage><pub-id pub-id-type="doi">10.1126/science.1228985</pub-id></element-citation></ref><ref id="bib59"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Wilson</surname><given-names>DN</given-names></name><name><surname>Beckmann</surname><given-names>R</given-names></name></person-group><year>2011</year><article-title>The ribosomal tunnel as a functional environment for nascent polypeptide folding and translational stalling</article-title><source>Current Opinion in Structural Biology</source><volume>21</volume><fpage>274</fpage><lpage>282</lpage><pub-id pub-id-type="doi">10.1016/j.sbi.2011.01.007</pub-id></element-citation></ref><ref id="bib60"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Wohlgemuth</surname><given-names>I</given-names></name><name><surname>Brenner</surname><given-names>S</given-names></name><name><surname>Beringer</surname><given-names>M</given-names></name><name><surname>Rodnina</surname><given-names>MV</given-names></name></person-group><year>2008</year><article-title>Modulation of the rate of peptidyl transfer on the ribosome by the nature of substrates</article-title><source>The Journal of Biological Chemistry</source><volume>283</volume><fpage>32229</fpage><lpage>32235</lpage><pub-id pub-id-type="doi">10.1074/jbc.M805316200</pub-id></element-citation></ref><ref id="bib61"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Wu</surname><given-names>C</given-names></name><name><surname>Wei</surname><given-names>J</given-names></name><name><surname>Lin</surname><given-names>PJ</given-names></name><name><surname>Tu</surname><given-names>L</given-names></name><name><surname>Deutsch</surname><given-names>C</given-names></name><name><surname>Johnson</surname><given-names>AE</given-names></name><name><surname>Sachs</surname><given-names>MS</given-names></name></person-group><year>2012</year><article-title>Arginine changes the conformation of the arginine attenuator peptide relative to the ribosome tunnel</article-title><source>Journal of Molecular Biology</source><volume>416</volume><fpage>518</fpage><lpage>533</lpage><pub-id pub-id-type="doi">10.1016/j.jmb.2011.12.064</pub-id></element-citation></ref><ref id="bib62"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Zeng</surname><given-names>X</given-names></name><name><surname>Chugh</surname><given-names>J</given-names></name><name><surname>Casiano-Negroni</surname><given-names>A</given-names></name><name><surname>Al-Hashimi</surname><given-names>HM</given-names></name><name><surname>Brooks</surname><given-names>CL</given-names><suffix>III</suffix></name></person-group><year>2014</year><article-title>Flipping of the ribosomal A-Site adenines provides a basis for tRNA selection</article-title><source>Journal of Molecular Biology</source><volume>426</volume><fpage>3201</fpage><lpage>3213</lpage><pub-id pub-id-type="doi">10.1016/j.jmb.2014.04.029</pub-id></element-citation></ref><ref id="bib63"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Zhou</surname><given-names>J</given-names></name><name><surname>Lancaster</surname><given-names>L</given-names></name><name><surname>Donohue</surname><given-names>JP</given-names></name><name><surname>Noller</surname><given-names>HF</given-names></name></person-group><year>2014</year><article-title>How the ribosome hands the A-site tRNA to the P site during EF-G-catalyzed translocation</article-title><source>Science</source><volume>345</volume><fpage>1188</fpage><lpage>1191</lpage><pub-id pub-id-type="doi">10.1126/science.1255030</pub-id></element-citation></ref></ref-list></back><sub-article article-type="article-commentary" id="SA1"><front-stub><article-id pub-id-type="doi">10.7554/eLife.03735.019</article-id><title-group><article-title>Decision letter</article-title></title-group><contrib-group content-type="section"><contrib contrib-type="editor"><name><surname>Sonenberg</surname><given-names>Nahum</given-names></name><role>Reviewing editor</role><aff><institution>McGill University</institution>, <country>Canada</country></aff></contrib></contrib-group></front-stub><body><boxed-text><p>eLife posts the editorial decision letter and author response on a selection of the published articles (subject to the approval of the authors). An edited version of the letter sent to the authors after peer review is shown, indicating the substantive concerns or comments; minor concerns are not usually shown. Reviewers have the opportunity to discuss the decision before the letter is sent (see <ext-link ext-link-type="uri" xlink:href="http://elifesciences.org/review-process">review process</ext-link>). Similarly, the author response typically shows only responses to the major concerns raised by the reviewers.</p></boxed-text><p>Thank you for sending your work entitled &#x201c;Measurement of decoding rates of all individual codons in vivo&#x201d; for consideration at <italic>eLife</italic>. Your article has been favorably evaluated by Aviv Regev (Senior editor), a Reviewing editor, and 3 reviewers.</p><p>The Reviewing editor and the reviewers discussed their comments before we reached this decision, and the Reviewing editor has assembled the following comments to help you prepare a revised submission.</p><p>Your manuscript addresses the question of variation in average decoding time for all the tRNAs in the yeast <italic>Saccharomyces cerevisiae</italic>, by using ribosome footprinting at codon resolution. It describes a novel statistics (Ribosome Residence Times, RRT) that was used for four ribosome profiling datasets obtained from <italic>S. cerevisiae</italic>. You show that the RRTs correlate with codon usage and therefore suggest that RRT could be used to characterize the decoding rates of codons that have the same sequence.</p><p>Two of the reviewers, who are experts in protein synthesis, but not in statistics/bioinformatics, wrote favorable reviews on your work. However, the third reviewer who is more knowledgeable in statistics/bioinformatics was rather critical, as detailed below. A major appeal of your work is that you arrive at a different conclusion than previous work Qian et al and Charneski &#x26; Hurst, using ribosome-profiling data, which has concluded that there is little or no difference in the rates of decoding by tRNAs. The latter conclusions contradict a large body of previous work in genetics, molecular biology and biochemistry of translation, which clearly showed that there are significant differences in the rates of decoding. This naturally led to a significant amount of confusion in the field. You suggest that the abundant transient pausing on mRNAs, in the previous published study, caused by other effects, probably made it impossible to see the relatively more subtle differences in ribosome residence time derived from differences in codon recognition.</p><p>The major comments you need to address are as follows:</p><p>1) The authors suggest that AT rich codons are decoded more rapidly than GC rich codons but this is not clearly shown in the manuscript. At first this seemed counterintuitive but the authors suggest an interesting possibility that for GC rich codons incorrect tRNAs might dwell in the A site longer than they would at weaker codons and the aggregate of those non-productive interactions might increase the step time at these codons. The authors do not cite any biochemical studies that would support this conclusion and in fact they do not cite previous biochemical studies in the manuscript in many places where that would be appropriate.</p><p>2) The manuscript at many places simply states conclusions without providing any reference (where that would be appropriate) or argument for the conclusion. Substantiation of the conclusions must be included.</p><p>3) The major critique of the more critical reviewer is as follows: &#x201c;Two of the datasets used in the manuscript were previously published by the same group and two have not been published before, however the authors state that the other two were also obtained &#x201d;for other reasons in the Futcher lab&#x201c;. Therefore while the manuscript provides some newly generated experimental data, the primary focus is on computational analysis and in particular on RRT. How valuable and useful is RRT? This, unfortunately, is not very clear. The very fact that it correlates with codon usage merits further investigation as it may indeed provide potentially useful characteristic for decoding rates distributions of the codons with the same sequence. Unfortunately RRT is poorly characterized in the manuscript. Its relationship to the decoding is not explored beyond correlations with codon usage. Also the manuscript misleadingly treats RRT as a measure (rather than potentially related statistics) of codon decoding rates.&#x201d;</p><p>4) The title &#x201c;Measurement of decoding rates of all individual codons in vivo&#x201d; is highly misleading.</p><p>Lys codon AAA at a position i of mRNA x is not the same as Lys codon AAA at a position j of mRNA y. AAAix and AAAjy likely would have different decoding rates. It would be wonderful to be able to measure decoding rates of individual codons, but this is not the case here. RRT is a relative footprint density of individual codons within a window of 10 neighbor codons averaged over all codons that share the same sequence.</p><p>5) Before averaging, it makes sense to show that the average is a useful characteristic of the distribution. It would be the case if the distributions were normal or at least could be approximated to normal, as this might not be the case.</p><p>If the distributions are not normal, the authors may explore other descriptive statistics of the distribution (for example median) for their relationship with codon usage, tRNA gene copy number or tAI. But it is highly important to obtain descriptive statistics of the distributions first. Even if the distributions are normal, the dispersion of these distributions may not be the same.</p><p>6) The procedure for RRT may have a hidden relationship with the codon usage, thus explaining the observed correlation. This could be the case because of intrinsic non-randomness of codon sequences, which effect codon distributions within the 10-codon window. Say a codon X may appear more frequently in the windows centered around the codon Y than in the windows centered around the codon Z. To explore whether there is a hidden relationship to codon usage, the authors should assign experimental footprint densities to codons randomly. Then calculate RRTs and explore how obtained RRT values relate to the codon usage.</p><p>In addition to that it would make sense to carry out RRT calculations for naked mRNA controls and explore obtained distributions in a similar manner.</p><p>7) Does RRT really relate to codon decoding rates? Intuitively it should, but this needs to be shown. We know from a number of experiments, including ribosome profiling that concentration of tRNAs effect the decoding rates. There are publicly available datasets where particular aminoacylated tRNAs were depleted, see for example <xref ref-type="bibr" rid="bib34">Lareau et al 2014</xref> eLife [PMID: 24842990] for S. cerevisiae or <xref ref-type="bibr" rid="bib35">Li et al 2012</xref> Nature [PMID:22456704] for bacterial organisms. There might be more. The authors should calculate RRT for these datasets and characterize RRTs between these datasets.</p><p>8) If RRT is indeed a good statistics it should work well not only in yeast, but also in other organisms, thus it is important to carry similar analysis on other publicly available datasets. The authors did that for the very first ribosomal profiling dataset <xref ref-type="bibr" rid="bib28">Ingolia et al 2009</xref> dataset, which was generated with cycloheximide pretreatment. It also has a relatively poor coverage. For yeast it would be advisable to use <xref ref-type="bibr" rid="bib34">Lareau et al 2014</xref> Elife [PMID: 24842990] data.</p><p>RRT should also correlate with other ways of measuring elongation speed. For example, the authors could calculate RRT for <xref ref-type="bibr" rid="bib29">Ingolia et al 2011</xref> Cell [PMID: 22056041] and explore whether an aggregated RRTs for the codons of individual coding sequences could predict differences in the speed of ribosomes over individual mRNAs estimated with the pulse chase experiment described in this work.</p><p>9) The critical reviewer is skeptical regarding the use of ribosomal profiling data for estimating decoding rates for several reasons. The time that ribosome spends at a particular codon is only one of the factors affecting the number of footprints aligning to that codon. The others are sequence coverage (i), initiation rate for the start codon of corresponding ORF (ii) and concentration of corresponding mRNA (iii). These factors can be estimated (e.g. mRNA-seq can estimate relative mRNA levels) or the data could be normalized in the ways that would minimize the contribution of these factors, e.g. a footprint density at a particular location can be normalized over the cumulative density for the entire dataset or for individual mRNAs. Such normalization procedures aren&#x27;t perfect and may generate certain artifacts and Gardin et al do discuss it rightfully to some extent. However, there are other factors that are more difficult to take into account, such as the effect of antibiotics on capture of the ribosomes at particular locations or even at specific ribosomal conformations leading to differing length of footprints, see <xref ref-type="bibr" rid="bib34">Lareau et al 2014</xref> <italic>eLife</italic> [PMID: 24842990] for more information. The reviewer is also surprised and disappointed that this very important and highly relevant article is not mentioned in the manuscript.</p><p>The other factors effecting densities are those related to the biases of cDNA library preparation. Their presence can be easily seen in the data analyzed here with RRT as well, e.g. <xref ref-type="fig" rid="fig2">Figure 2B</xref>; codon 10 corresponds to the 5&#x27; ends of the footprints and show a variability comparable to that of position 6 (btw plotting 64 curves on the same diagram is not very effective, the authors should explore other statistics for measuring variability within a distribution). Most likely this variability is due to sequence specificity of RNAse cleavage and/or adapter ligation. The other factor that is highly relevant to measuring decoding rates is PCR amplification of cDNA libraries. PCR amplifies fragments non-linearly, and the ratio between a low abounded fragment and a highly abounded fragment would likely increase after PCR. To control for that the initial step of cDNA amplification should be carried out with RT primers containing random indexes. So that only sequences corresponding to unique ribosome protected fragments are counted. The reviewer understands that doing ribosome profiling this way would require a new experiment and he does not expect the authors doing it for this work, but the issue needs to be at least caveated.</p></body></sub-article><sub-article article-type="reply" id="SA2"><front-stub><article-id pub-id-type="doi">10.7554/eLife.03735.020</article-id><title-group><article-title>Author response</article-title></title-group></front-stub><body><p>There seemed to be two general concerns, and some specific concerns. The two general concerns were, first, that there were not enough control experiments validating the &#x201c;Ribosome Residence Time&#x201d; method; and second, that we did not deal with a relevant, recently&#x2010;published paper by Lareau et al.</p><p>With regard to the manuscript&#x2019;s lack of control experiments validating the approach, we agree. When we first conceived the approach, we tested and validated it several ways, but then, when it became clear that the method worked, we lost interest in validation, moved on to getting results, and neglected to put the validation experiments into the manuscript. This was a mistake. Readers need to know what the evidence is that the approach really works. Reviewer 3 asked for some specific experiments, which we had previously done, but didn&#x2019;t show. We now have a new Figure (new <xref ref-type="fig" rid="fig2">Figure 2</xref>) with four panels, and a new Table (<xref ref-type="table" rid="tbl1">Table 1</xref>) showing some of our validation experiments, and these include most of what reviewer 3 requested. We think these experiments are very convincing. Inclusion of this validation material makes the manuscript somewhat longer. If this is a critical concern, this material could be moved to the supplement.</p><p>The second major concern was that we did not deal with a highly&#x2010;relevant paper, Lareau et al., May, 2014. We had finished writing our manuscript in April, before the Lareau paper came out. For various reasons, we did not submit the manuscript to <italic>eLife</italic> until June, and in the meantime we did not follow the relevant literature as closely as we should have. We were not aware of the Lareau paper until the reviewers pointed it out. Of course, it is a highly relevant paper that needs to be addressed. We have addressed it in this revised manuscript in a fair amount of detail. The biggest impact is that the Lareau paper provides us with short footprints from anisomycin arrest, and we have applied our method to these short footprints. The new results obtained are in our opinion really interesting, and they are described in a new section of text, a new Figure, and a new Table. We agree with Lareau et al that these short footprints are reporting on a different translational event than the long footprints, so the analysis of the short footprints does not in any way conflict with any of our previous conclusions, but does give us significant new conclusions. We also think it is striking that, although we think the Lareau et al. analysis was very good, nevertheless our analysis got quite a bit more out of the short footprint data, showing the value of our new method. Finally, Lareau et al. got some long footprints without use of cycloheximide, and when we analyzed these by our methods, we got a reasonable correlation (0.47) with our results, which argue that cycloheximide is not introducing severe artefacts.</p><p><italic>1) The authors suggest that AT rich codons are decoded more rapidly than GC rich codons but this is not clearly shown in the manuscript. At first this seemed counterintuitive but the authors suggest an interesting possibility that for GC rich codons incorrect tRNAs might dwell in the A site longer than they would at weaker codons and the aggregate of those non-productive interactions might increase the step time at these codons. The authors do not cite any biochemical studies that would support this conclusion and in fact they do not cite previous biochemical studies in the manuscript in many places where that would be appropriate</italic>.</p><p>We have cited more biochemical studies, including four relevant to the AT vs GC codon issue. But we are hesitant to go too far down this path. There have been many biochemical studies, and they have come to all sorts of conclusions, often conflicting, and in some cases we are just unable to evaluate these studies. We cannot cite them all, and we are uncomfortable with picking through them and citing the ones that are, in hindsight, compatible with our conclusions. So, we are citing a few, including ones we think are highly relevant, and reviews.</p><p>We have also added further information on the relative RRTs of GC vs AT rich codons, and done a statistical test, which indeed shows a significant difference between the AT&#x2010;rich codons and the GC&#x2010;rich codons (p &#x3c; 0.003).</p><p><italic>2) The manuscript at many places simply states conclusions without providing any reference (where that would be appropriate) or argument for the conclusion. Substantiation of the conclusions must be included</italic>.</p><p>No examples were given, so we are not sure exactly what conclusions are being referred to. We have gone through the manuscript looking for such cases, and have tried hard to add citations, or otherwise provide a reason. If the revised manuscript still has this defect, we would be happy to provide further citations if the reviewers will point out the relevant statements.</p><p><italic>3) The major critique of the more critical reviewer is as follows: &#x201c;Two of the datasets used in the manuscript were previously published by the same group and two have not been published before, however the authors state that the other two were also obtained &#x201d;for other reasons in the Futcher lab&#x201c;. Therefore while the manuscript provides some newly generated experimental data, the primary focus is on computational analysis and in particular on RRT. How valuable and useful is RRT? This, unfortunately, is not very clear. The very fact that it correlates with codon usage merits further investigation as it may indeed provide potentially useful characteristic for decoding rates distributions of the codons with the same sequence. Unfortunately RRT is poorly characterized in the manuscript. Its relationship to the decoding is not explored beyond correlations with codon usage. Also the manuscript misleadingly treats RRT as a measure (rather than potentially related statistics) of codon decoding rates</italic>.<italic>&#x201d;</italic></p><p>Yes. As we said above, we had originally done validation experiments, but (wrongly) neglected to put them in the manuscript. The new <xref ref-type="fig" rid="fig2">Figure 2</xref> now shows the results of four of our validation experiments. There is a positive and a negative control with simulated data; of these, the negative control with simulated data is the experiment reviewer 3 requested. There is also a positive and a negative control with real data (positive was a serine starvation experiment; negative was the RRT analysis of RNA seq data (i.e., 30 bp RNA fragments, but no footprinting). These two were also essentially experiments the reviewer asked for. The reviewer also asked about analysis of the histidine starvation data in the Lareau paper as another positive control, and we analyzed these data and give the result in the text. We will not go through these experiments in detail here, as we hope they are clear in the revised manuscript. Additional points:</p><p><italic>4) The title &#x201c;Measurement of decoding rates of all individual codons in vivo&#x201d; is highly misleading</italic>.</p><p><italic>Lys codon AAA at a position i of mRNA x is not the same as Lys codon AAA at a position j of mRNA y. AAAix and AAAjy likely would have different decoding rates. It would be wonderful to be able to measure decoding rates of individual codons, but this is not the case here. RRT is a relative footprint density of individual codons within a window of 10 neighbor codons averaged over all codons that share the same sequence</italic>.</p><p>We have changed the title.</p><p><italic>5) Before averaging, it makes sense to show that the average is a useful characteristic of the distribution. It would be the case if the distributions were normal or at least could be approximated to normal, as this might not be the case</italic>.</p><p><italic>If the distributions are not normal, the authors may explore other descriptive statistics of the distribution (for example median) for their relationship with codon usage, tRNA gene copy number or tAI. But it is highly important to obtain descriptive statistics of the distributions first. Even if the distributions are normal, the dispersion of these distributions may not be the same</italic>.</p><p>Reviewer 3 asks about the distributions of the window frequencies, and whether the mean is a good summary statistic or not. This was an interesting question. We have now provided information on this point in Materials and methods. To summarize, the distributions are not normal. But the reason is, perhaps, innocuous: we required windows that have at least 20 reads, and three non&#x2010;zero positions. But still, quite often, a frequency at a particular position in a particular window is zero, because there are no reads at that position, and so the distributions often have a mode at zero. We think that as the number of reads goes up and up, and the number of positions with zero reads goes down, the distributions would approach normal. The distribution is not one that makes the mean an inappropriate statistic.</p><p>In any case, we explored summary statistics other than the mean. We repeated all the analysis using the median instead of the mean. As now reported in Materials and Methods, the Spearman rank correlation between the results using the median and the results using the mean is 0.97. That is, results are essentially identical. We had a conversation amongst ourselves as to whether the mean was slightly better, or the median was slightly better, but with a correlation of 0.97 it didn&#x2019;t really matter, and the mean contains slightly more information. So we have stayed with the mean. We feel that when two summary statistics, the mean and the median, give the same answer, the results must be robust.</p><p><italic>6) The procedure for RRT may have a hidden relationship with the codon usage, thus explaining the observed correlation. This could be the case because of intrinsic non-randomness of codon sequences, which effect codon distributions within the 10-codon window. Say a codon X may appear more frequently in the windows centered around the codon Y than in the windows centered around the codon Z. To explore whether there is a hidden relationship to codon usage, the authors should assign experimental footprint densities to codons randomly. Then calculate RRTs and explore how obtained RRT values relate to the codon usage</italic>.</p><p><italic>In addition to that it would make sense to carry out RRT calculations for naked mRNA controls and explore obtained distributions in a similar manner</italic>.</p><p>We went through the negative control procedure the reviewer suggests, and there is no signal; all the RRT values come out to essentially 1. This is what is shown in <xref ref-type="fig" rid="fig2">Figure 2A</xref>. But also, this procedure is implicit in our method for calculating p&#x2010;values for the RRT scores (<xref ref-type="table" rid="tbl2">Table 2</xref>, Methods and Materials). That is, the small p&#x2010;values imply that in the randomization experiment, there is not any strong signal. Also, as suggested, we did the RRT calculations for naked mRNA controls, and again there is no signal (except at the termini due to enzyme base specificity); this is shown in <xref ref-type="fig" rid="fig2">Figure 2C</xref>.</p><p><italic>7) Does RRT really relate to codon decoding rates? Intuitively it should, but this needs to be shown. We know from a number of experiments, including ribosome profiling that concentration of tRNAs effect the decoding rates. There are publicly available datasets where particular aminoacylated tRNAs were depleted, see for example</italic> <xref ref-type="bibr" rid="bib34"><italic>Lareau et al 2014</italic></xref> <italic>eLife [PMID: 24842990] for S. cerevisiae or</italic> <xref ref-type="bibr" rid="bib35"><italic>Li et al 2012</italic></xref> <italic>Nature [PMID:22456704] for bacterial organisms. There might be more. The authors should calculate RRT for these datasets and characterize RRTs between these datasets</italic>.</p><p>The reviewer suggests we look at the dataset for serine&#x2010;starved <italic>E. coli</italic>. We had previously done this but not shown it; it was one of our first tests of the method. The RRT analysis shows big peaks for the serine codons, and this is now shown in <xref ref-type="fig" rid="fig2">Figure 2D</xref> and the accompanying Table. We also looked, less successfully, at the Lareau data for histidine starvation. This is now described in the text. It was less successful because the Lareau dataset for histidine starvation was actually rather small, too small for RRT analysis as described here. Nevertheless, with relaxed quality filters, we did see good&#x2010;sized peaks for the two His codons in the 3&#x2010;AT treated cultures, but, importantly, not in the non&#x2010;starved cultures. We are not putting the figure in the paper, because it is not real RRT analysis, because we had to relax the quality filters to get enough windows.</p><p><italic>8) If RRT is indeed a good statistics it should work well not only in yeast, but also in other organisms, thus it is important to carry similar analysis on other publicly available datasets. The authors did that for the very first ribosomal profiling dataset</italic> <xref ref-type="bibr" rid="bib28"><italic>Ingolia et al 2009</italic></xref> <italic>dataset, which was generated with cycloheximide pretreatment. It also has a relatively poor coverage. For yeast it would be advisable to use</italic> <xref ref-type="bibr" rid="bib34"><italic>Lareau et al 2014</italic></xref> <italic>Elife [PMID: 24842990] data</italic>.</p><p><italic>RRT should also correlate with other ways of measuring elongation speed. For example, the authors could calculate RRT for</italic> <xref ref-type="bibr" rid="bib29"><italic>Ingolia et al 2011</italic></xref> <italic>Cell [PMID: 22056041] and explore whether an aggregated RRTs for the codons of individual coding sequences could predict differences in the speed of ribosomes over individual mRNAs estimated with the pulse chase experiment described in this work</italic>.</p><p>Yes, the method works well in other organisms. You now see some evidence of that here. We have analysed a lot of existing ribosome profiling data for various organisms from databases. Obviously we cannot put it all in this manuscript. Several quite interesting things have come out of this analysis, and we are in the early stages of planning additional manuscripts. However, we did apply the method to the Lareau et al cycloheximide datasets, and do now report the results here in this manuscript. The correlations with our results are on the low side; 0.2 to 0.5, but still, they are all positive correlations, and we strongly believe that they are relatively modest because Lareau, like Ingolia, add cycloheximide first, then grow the cells a bit, then harvest, whereas we flash&#x2010;freeze first, then add cycloheximide to the frozen cells.</p><p>The fact that it works well with data in databases from other organisms is of course a reason to publish the paper, so that others will have access to these methods and can also do this analysis.</p><p><italic>9) The critical reviewer is skeptical regarding the use of ribosomal profiling data for estimating decoding rates for several reasons. The time that ribosome spends at a particular codon is only one of the factors affecting the number of footprints aligning to that codon. The others are sequence coverage (i), initiation rate for the start codon of corresponding ORF (ii) and concentration of corresponding mRNA (iii). These factors can be estimated (e.g. mRNA-seq can estimate relative mRNA levels) or the data could be normalized in the ways that would minimize the contribution of these factors, e.g. a footprint density at a particular location can be normalized over the cumulative density for the entire dataset or for individual mRNAs. Such normalization procedures aren&#x27;t perfect and may generate certain artifacts and Gardin et al do discuss it rightfully to some extent. However, there are other factors that are more difficult to take into account, such as the effect of antibiotics on capture of the ribosomes at particular locations or even at specific ribosomal conformations leading to differing length of footprints, see</italic> <xref ref-type="bibr" rid="bib34"><italic>Lareau et al 2014</italic></xref> <italic>eLife [PMID: 24842990] for more information. The reviewer is also surprised and disappointed that this very important and highly relevant article is not mentioned in the manuscript</italic>.</p><p><italic>The other factors effecting densities are those related to the biases of cDNA library preparation. Their presence can be easily seen in the data analyzed here with RRT as well, e.g.</italic> <xref ref-type="fig" rid="fig2"><italic>Figure 2B</italic></xref><italic>; codon 10 corresponds to the 5&#x27; ends of the footprints and show a variability comparable to that of position 6 (btw plotting 64 curves on the same diagram is not very effective, the authors should explore other statistics for measuring variability within a distribution). Most likely this variability is due to sequence specificity of RNAse cleavage and/or adapter ligation. The other factor that is highly relevant to measuring decoding rates is PCR amplification of cDNA libraries. PCR amplifies fragments non-linearly, and the ratio between a low abounded fragment and a highly abounded fragment would likely increase after PCR. To control for that the initial step of cDNA amplification should be carried out with RT primers containing random indexes. So that only sequences corresponding to unique ribosome protected fragments are counted. The reviewer understands that doing ribosome profiling this way would require a new experiment and he does not expect the authors doing it for this work, but the issue needs to be at least caveated</italic>.</p><p>The thrust of point 9 seems to be that we should be able to get similar results by other methods, by estimating the sizes of various effects. Well, maybe. But we are not pursuing other methods; we are trying to describe this one. Also, as we say at the beginning of this manuscript, we think the estimates and guesses involved in making the calculations the reviewer suggests are problematic, and quite likely to lead to the wrong answer. That our approach by&#x2010;passes all this guessing and estimating is a lot of the point.</p><p>The reviewer also mentions the well&#x2010;known fact that PCR can create sampling artefacts. But if any method can defeat PCR sampling problems, it is this one, because we simply consider each and every window as an independent experiment, no matter the frequency of reads in that window. Relatively rare PCR sampling artefacts will affect one window at a time (out of thousands) and so have a negligible impact on our approach. The fact that we have correlations up to 0.96 between our experiments demonstrates that random noise such as introduced by PCR sampling artefacts cannot be a big issue.</p><p>Yes, we need to cite and write about Lareau et al. We are sorry to have missed this paper, and thank the reviewers for pointing it out. The revised manuscript talks extensively about the Lareau et al. results, which are really interesting.</p></body></sub-article></article>