Permalink
Switch branches/tags
Nothing to show
Find file
Fetching contributors…
Cannot retrieve contributors at this time
1 lines (1 sloc) 174 KB
<?xml version="1.0" encoding="UTF-8"?><!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.1d1 20130915//EN" "JATS-archivearticle1.dtd"><article article-type="research-article" dtd-version="1.1d1" xmlns:xlink="http://www.w3.org/1999/xlink"><front><journal-meta><journal-id journal-id-type="nlm-ta">elife</journal-id><journal-id journal-id-type="hwp">eLife</journal-id><journal-id journal-id-type="publisher-id">eLife</journal-id><journal-title-group><journal-title>eLife</journal-title></journal-title-group><issn publication-format="electronic">2050-084X</issn><publisher><publisher-name>eLife Sciences Publications, Ltd</publisher-name></publisher></journal-meta><article-meta><article-id pub-id-type="publisher-id">04126</article-id><article-id pub-id-type="doi">10.7554/eLife.04126</article-id><article-categories><subj-group subj-group-type="display-channel"><subject>Research article</subject></subj-group><subj-group subj-group-type="heading"><subject>Biochemistry</subject></subj-group><subj-group subj-group-type="heading"><subject>Genomics and evolutionary biology</subject></subj-group></article-categories><title-group><article-title>Ancestral resurrection reveals evolutionary mechanisms of kinase plasticity</article-title></title-group><contrib-group><contrib contrib-type="author" equal-contrib="yes" id="author-16837"><name><surname>Howard</surname><given-names>Conor J</given-names></name><xref ref-type="aff" rid="aff1"/><xref ref-type="fn" rid="equal-contrib">†</xref><xref ref-type="other" rid="par-1"/><xref ref-type="fn" rid="con1"/><xref ref-type="fn" rid="conf1"/></contrib><contrib contrib-type="author" equal-contrib="yes" id="author-7135"><name><surname>Hanson-Smith</surname><given-names>Victor</given-names></name><xref ref-type="aff" rid="aff2"/><xref ref-type="fn" rid="equal-contrib">†</xref><xref ref-type="other" rid="par-4"/><xref ref-type="fn" rid="con2"/><xref ref-type="fn" rid="conf1"/></contrib><contrib contrib-type="author" id="author-16838"><name><surname>Kennedy</surname><given-names>Kristopher J</given-names></name><xref ref-type="aff" rid="aff1"/><xref ref-type="other" rid="par-1"/><xref ref-type="fn" rid="con3"/><xref ref-type="fn" rid="conf1"/></contrib><contrib contrib-type="author" id="author-16839"><name><surname>Miller</surname><given-names>Chad J</given-names></name><xref ref-type="aff" rid="aff3"/><xref ref-type="other" rid="par-2"/><xref ref-type="other" rid="par-3"/><xref ref-type="fn" rid="con6"/><xref ref-type="fn" rid="conf1"/></contrib><contrib contrib-type="author" id="author-16840"><name><surname>Lou</surname><given-names>Hua Jane</given-names></name><xref ref-type="aff" rid="aff3"/><xref ref-type="other" rid="par-2"/><xref ref-type="other" rid="par-3"/><xref ref-type="fn" rid="con7"/><xref ref-type="fn" rid="conf1"/></contrib><contrib contrib-type="author" id="author-8311"><name><surname>Johnson</surname><given-names>Alexander D</given-names></name><xref ref-type="aff" rid="aff2"/><xref ref-type="other" rid="par-4"/><xref ref-type="fn" rid="con8"/><xref ref-type="fn" rid="conf1"/></contrib><contrib contrib-type="author" corresp="yes" id="author-7040"><name><surname>Turk</surname><given-names>Benjamin E</given-names></name><xref ref-type="aff" rid="aff3"/><xref ref-type="corresp" rid="cor1">*</xref><xref ref-type="other" rid="par-2"/><xref ref-type="other" rid="par-3"/><xref ref-type="fn" rid="con4"/><xref ref-type="fn" rid="conf1"/></contrib><contrib contrib-type="author" corresp="yes" id="author-16735"><name><surname>Holt</surname><given-names>Liam J</given-names></name><contrib-id contrib-id-type="orcid">http://orcid.org/0000-0002-4002-0861</contrib-id><xref ref-type="aff" rid="aff1"/><xref ref-type="corresp" rid="cor2">*</xref><xref ref-type="other" rid="par-1"/><xref ref-type="fn" rid="con5"/><xref ref-type="fn" rid="conf1"/></contrib><aff id="aff1"><institution content-type="dept">Department of Molecular and Cell Biology</institution>, <institution>University of California, Berkeley</institution>, <addr-line><named-content content-type="city">Berkeley</named-content></addr-line>, <country>United States</country></aff><aff id="aff2"><institution content-type="dept">Department of Microbiology and Immunology</institution>, <institution>University of California, San Francisco</institution>, <addr-line><named-content content-type="city">San Francisco</named-content></addr-line>, <country>United States</country></aff><aff id="aff3"><institution content-type="dept">Department of Pharmacology</institution>, <institution>Yale University School of Medicine</institution>, <addr-line><named-content content-type="city">New Haven</named-content></addr-line>, <country>United States</country></aff></contrib-group><contrib-group content-type="section"><contrib contrib-type="editor"><name><surname>Ferrell</surname><given-names>James</given-names></name><role>Reviewing editor</role><aff><institution>Stanford University</institution>, <country>United States</country></aff></contrib></contrib-group><author-notes><corresp id="cor1"><label>*</label>For correspondence: <email>ben.turk@yale.edu</email> (BET);</corresp><corresp id="cor2"><label>*</label>For correspondence: <email>liamholt@berkeley.edu</email> (LJH)</corresp><fn fn-type="con" id="equal-contrib"><label>†</label><p>These authors contributed equally to this work</p></fn></author-notes><pub-date date-type="pub" publication-format="electronic"><day>13</day><month>10</month><year>2014</year></pub-date><pub-date pub-type="collection"><year>2014</year></pub-date><volume>3</volume><elocation-id>e04126</elocation-id><history><date date-type="received"><day>23</day><month>07</month><year>2014</year></date><date date-type="accepted"><day>09</day><month>10</month><year>2014</year></date></history><permissions><copyright-statement>© 2014, Howard et al</copyright-statement><copyright-year>2014</copyright-year><copyright-holder>Howard et al</copyright-holder><license xlink:href="http://creativecommons.org/licenses/by/4.0/"><license-p>This article is distributed under the terms of the <ext-link ext-link-type="uri" xlink:href="http://creativecommons.org/licenses/by/4.0/">Creative Commons Attribution License</ext-link>, which permits unrestricted use and redistribution provided that the original author and source are credited.</license-p></license></permissions><self-uri content-type="pdf" xlink:href="elife04126.pdf"/><abstract><object-id pub-id-type="doi">10.7554/eLife.04126.001</object-id><p>Protein kinases have evolved diverse specificities to enable cellular information processing. To gain insight into the mechanisms underlying kinase diversification, we studied the CMGC protein kinases using ancestral reconstruction. Within this group, the cyclin dependent kinases (CDKs) and mitogen activated protein kinases (MAPKs) require proline at the +1 position of their substrates, while Ime2 prefers arginine. The resurrected common ancestor of CDKs, MAPKs, and Ime2 could phosphorylate substrates with +1 proline or arginine, with preference for proline. This specificity changed to a strong preference for +1 arginine in the lineage leading to Ime2 via an intermediate with equal specificity for proline and arginine. Mutant analysis revealed that a variable residue within the kinase catalytic cleft, DFGx, modulates +1 specificity. Expansion of Ime2 kinase specificity by mutation of this residue did not cause dominant deleterious effects in vivo. Tolerance of cells to new specificities likely enabled the evolutionary divergence of kinases.</p><p><bold>DOI:</bold> <ext-link ext-link-type="doi" xlink:href="10.7554/eLife.04126.001">http://dx.doi.org/10.7554/eLife.04126.001</ext-link></p></abstract><abstract abstract-type="executive-summary"><object-id pub-id-type="doi">10.7554/eLife.04126.002</object-id><title>eLife digest</title><p>All living things have enzymes called protein kinases that transfer chemical tags called phosphates onto other proteins. Adding a phosphate to a protein can change the protein's activity—for example, by switching it on or off—and many biological processes involve large networks of kinases that phosphorylate hundreds of proteins.</p><p>Humans have approximately 500 different protein kinases, which can each phosphorylate many proteins, and so human cells are regulated by tens of thousands of different phosphorylation sites. This raises a number of questions: how have these different kinases evolved over evolutionary history? And how have they come to recognize, and phosphorylate, so many different sites?</p><p>Howard, Hanson-Smith et al. studied members of a large family of protein kinases called the CMGC group. These enzymes are found in all organisms that have a cell nucleus, including animals, plants, and fungi. All proteins, including kinases, are built up of a chain of smaller molecules called amino acids, and the ability of a kinase to phosphorylate a protein depends on the kinase recognizing a short string of amino acids known as a motif. The phosphate is added in the middle of this motif at the so-called ‘0’ position. Howard, Hanson-Smith et al. found that all of the CMGC protein kinases tested ‘preferred’ an arginine or proline amino acid at the ‘+1’ position of this motif. However, some kinases preferred motifs with an arginine amino acid at this position, and others preferred a proline instead.</p><p>Howard, Hanson-Smith et al. predicted how the ancestors of a number of CMGC protein kinases might have looked and then ‘resurrected’ them by producing them in yeast cells. When the preference of these ancestral enzymes was tested, the oldest ancestor was found to slightly prefer motifs that had a proline amino acid at the +1 position. Testing six more recent ancestors showed that, over a billion years of evolution, this amino acid preference became broader to include both proline and arginine—and that some modern protein kinases subsequently evolved and specialized to prefer arginine at the +1 position, thus creating a new specificity.</p><p>Kinases are sometimes likened to microchips in complex electronic networks. In this analogy, expanding the specificity of a kinase could be like creating many ‘loose wires’ and cause short-circuits. From their evolutionary analysis, Howard, Hanson-Smith et al. were able to identify a structural change in the enzyme that causes an expansion of kinase specificity, which allowed them to directly test this idea in cells. Expanding the specificity of a protein kinase that controls sexual cell division in yeast cells did not stop the yeast from dividing to produce spores, suggesting that these changes are more readily tolerated than was expected. Howard, Hanson-Smith et al. suggest that this unexpected robustness of cellular circuits enabled the evolution of the wide variety of protein kinases seen today.</p><p><bold>DOI:</bold> <ext-link ext-link-type="doi" xlink:href="10.7554/eLife.04126.002">http://dx.doi.org/10.7554/eLife.04126.002</ext-link></p></abstract><kwd-group kwd-group-type="author-keywords"><title>Author keywords</title><kwd>protein kinase</kwd><kwd>evolution</kwd><kwd>ancestral reconstruction</kwd><kwd>cyclin-dependent kinase</kwd><kwd>Ime2</kwd><kwd>phosphoregulatory networks</kwd></kwd-group><kwd-group kwd-group-type="research-organism"><title>Research organism</title><kwd>human</kwd><kwd>mouse</kwd><kwd>Neurospora</kwd><kwd><italic>S. cerevisiae</italic></kwd><kwd>other</kwd></kwd-group><funding-group><award-group id="par-1"><funding-source><institution-wrap><institution-id institution-id-type="FundRef">http://dx.doi.org/10.13039/100006978</institution-id><institution>University of California Berkeley</institution></institution-wrap></funding-source><award-id>Bowes Research Fellows Program</award-id><principal-award-recipient><name><surname>Howard</surname><given-names>Conor J</given-names></name><name><surname>Kennedy</surname><given-names>Kristopher J</given-names></name><name><surname>Holt</surname><given-names>Liam J</given-names></name></principal-award-recipient></award-group><award-group id="par-2"><funding-source><institution-wrap><institution-id institution-id-type="FundRef">http://dx.doi.org/10.13039/100000057</institution-id><institution>National Institute of General Medical Sciences</institution></institution-wrap></funding-source><award-id>R01 GM105947</award-id><principal-award-recipient><name><surname>Miller</surname><given-names>Chad J</given-names></name><name><surname>Lou</surname><given-names>Hua Jane</given-names></name><name><surname>Turk</surname><given-names>Benjamin E</given-names></name></principal-award-recipient></award-group><award-group id="par-3"><funding-source><institution-wrap><institution-id institution-id-type="FundRef">http://dx.doi.org/10.13039/100000057</institution-id><institution>National Institute of General Medical Sciences</institution></institution-wrap></funding-source><award-id>R01 GM104047</award-id><principal-award-recipient><name><surname>Miller</surname><given-names>Chad J</given-names></name><name><surname>Lou</surname><given-names>Hua Jane</given-names></name><name><surname>Turk</surname><given-names>Benjamin E</given-names></name></principal-award-recipient></award-group><award-group id="par-4"><funding-source><institution-wrap><institution-id institution-id-type="FundRef">http://dx.doi.org/10.13039/100000057</institution-id><institution>National Institute of General Medical Sciences</institution></institution-wrap></funding-source><award-id>GM037049</award-id><principal-award-recipient><name><surname>Hanson-Smith</surname><given-names>Victor</given-names></name><name><surname>Johnson</surname><given-names>Alexander D</given-names></name></principal-award-recipient></award-group><funding-statement>The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.</funding-statement></funding-group><custom-meta-group><custom-meta><meta-name>elife-xml-version</meta-name><meta-value>2</meta-value></custom-meta><custom-meta specific-use="meta-only"><meta-name>Author impact statement</meta-name><meta-value>Reconstructing ancestral enzymes has revealed that a switch in kinase substrate preference evolved via an expanded specificity intermediate that is tolerated in vivo, thus providing a path for kinase diversification.</meta-value></custom-meta></custom-meta-group></article-meta></front><body><sec id="s1" sec-type="intro"><title>Introduction</title><p>Phosphorylation networks coordinate many cellular processes. Their importance is underscored by the prevalence of kinases: the human genome encodes &gt;500 kinases (<xref ref-type="bibr" rid="bib52a">Manning et al., 2002b</xref>) and over 100,000 phosphorylation sites have been identified (<xref ref-type="bibr" rid="bib38">Hornbeck et al., 2012</xref>). The number and diversity of kinases expanded with increasing numbers of cell types during the evolution of metazoa (<xref ref-type="bibr" rid="bib52">Manning et al., 2002a</xref>). The addition of new kinase families with new specificities presumably increases the information processing capacity of the cell, thus enabling the emergence of more complex biological processes (<xref ref-type="bibr" rid="bib8">Beltrao et al., 2009</xref>; <xref ref-type="bibr" rid="bib49">Lim and Pawson, 2010</xref>).</p><p>To achieve precise regulation, kinases have evolved mechanisms to selectively phosphorylate specific substrates. This specificity is encoded at multiple levels. The active site of some kinases is optimized to bind to a defined peptide sequence, referred to as the primary specificity. Kinases may have additional peptide interaction surfaces that bind to docking motifs on the substrate distinct from the site of phosphate transfer. Non-substrate proteins called scaffolds can also form tertiary complexes to direct the interaction between kinase and substrate. The sub-cellular localization of a kinase can limit access to a subset of proteins. Finally, systems-level effects such as substrate competition and the opposing activities of phosphatases all affect the degree to which substrates are phosphorylated in the context of the cell (Reviewed in <xref ref-type="bibr" rid="bib63">Remenyi et al., 2005</xref>; <xref ref-type="bibr" rid="bib74">Ubersax and Ferrell, 2007</xref>).</p><p>Phosphoregulatory networks are well suited to rapid information processing because phosphorylation reactions act on time scales of minutes (<xref ref-type="bibr" rid="bib60">Olsen et al., 2006</xref>). For this reason, kinase networks are crucial for processes that require a high degree of temporal control, such as the cellular division programs, mitosis, and meiosis. Taking mitosis as an example, kinase networks control a wide range of cell sizes (2 μm to several mm) and cell biology (from a single to a thousand chromosomes, closed vs open mitosis). Thus the phosphorylation networks that underlie these processes must adapt to enable these changes in cell biology. There has been considerable progress in the understanding of transcription-factor network evolution in recent years, and these studies have helped understand the generation of morphological diversity (<xref ref-type="bibr" rid="bib15">Carroll, 2008</xref>) and key principles of transcriptional rewiring (<xref ref-type="bibr" rid="bib73">Tuch et al., 2008</xref>). Despite recent progress (<xref ref-type="bibr" rid="bib37">Holt et al., 2009</xref>; <xref ref-type="bibr" rid="bib70">Tan et al., 2009</xref>; <xref ref-type="bibr" rid="bib2">Alexander et al., 2011</xref>; <xref ref-type="bibr" rid="bib21">Cross et al., 2011</xref>; <xref ref-type="bibr" rid="bib26">Freschi et al., 2011</xref>; <xref ref-type="bibr" rid="bib62">Pearlman et al., 2011</xref>; <xref ref-type="bibr" rid="bib14">Capra et al., 2012</xref>; <xref ref-type="bibr" rid="bib48">Lee et al., 2012</xref>; <xref ref-type="bibr" rid="bib7">Beltrao et al., 2013</xref>; <xref ref-type="bibr" rid="bib20">Coyle et al., 2013</xref>; <xref ref-type="bibr" rid="bib31">Goldman et al., 2014</xref>), there is still relatively little known about the evolution of kinase signaling networks.</p><p>Phosphoregulatory networks evolve by the gain or loss of protein–protein interactions, either by changes to substrates or by changes to kinase specificity. Within substrates, the gain or loss of kinase interaction motifs and phosphorylation sites have occurred relatively rapidly (changes occur within a few millions of years of divergence, <xref ref-type="bibr" rid="bib37">Holt et al., 2009</xref>; <xref ref-type="bibr" rid="bib8">Beltrao et al., 2009</xref>). These substrate mutations affect only one protein at a time; therefore detrimental pleiotropic effects are avoided. Alternatively, networks can evolve by changing kinase specificity. Kinases act as hubs of phosphoregulatory networks and can coordinate the activities of hundreds or even thousands of substrates (<xref ref-type="bibr" rid="bib52">Manning et al., 2002a</xref>; <xref ref-type="bibr" rid="bib53">Matsuoka et al., 2007</xref>; <xref ref-type="bibr" rid="bib37">Holt et al., 2009</xref>; <xref ref-type="bibr" rid="bib38">Hornbeck et al., 2012</xref>). Changing the specificity of a kinase, therefore, can destroy many network connections, while also potentially creating a large number of new connections. It might be expected that there is a strong negative selection pressure against such drastic remodeling of kinase networks, but it is nevertheless clear that kinases have evolved diverse specificities, particularly following gene duplication (<xref ref-type="bibr" rid="bib55">Mok et al., 2010</xref>). The mechanisms underlying this diversification are poorly understood, and it is unknown how kinases successfully evolve significant changes to their biochemistry and network biology.</p><p>To learn how kinase specificity evolves, we studied the evolutionary history of the CMGC (Cyclin Dependent Kinase [CDK], Mitogen Activated Protein Kinase [MAPK], Glycogen Synthase Kinase [GSK], and Casein Kinase [CK]) group of kinases. The CMGC group also contains the CDK-Like kinases (CDKL), SR-kinases, Homeodomain-Interacting Kinases (HIPKs), CDC-Like Kinases (CLKs), Dual-Specificity Tyrosine Regulated Kinases (DYRKs), and a paralogous superfamily of kinases including LF4, the mammalian RCK kinases (ICK, MOK, and MAK), and the fungal IME2 kinases. CMGC kinases coordinate a wide range of cellular functions in different species. CDKs are the major coordinators of cell division in both mitosis and meiosis (<xref ref-type="bibr" rid="bib56">Morgan, 2007</xref>). MAPKs are crucial for many cellular decisions, including proliferation, differentiation, and stress responses (<xref ref-type="bibr" rid="bib17">Chen and Thorner, 2007</xref>; <xref ref-type="bibr" rid="bib57">Morrison, 2012</xref>). The Ime2 kinase is crucial for meiosis in <italic>S. cerevisiae</italic> (<xref ref-type="bibr" rid="bib22">Dirick et al., 1998</xref>; <xref ref-type="bibr" rid="bib9">Benjamin, 2003</xref>), while its orthologs in other <italic>Ascomycetes</italic> control distinct processes including mating (<xref ref-type="bibr" rid="bib65">Sherwood et al., 2014</xref>), differentiation (<xref ref-type="bibr" rid="bib40">Hutchison and Glass, 2010</xref>), and response to light (<xref ref-type="bibr" rid="bib6">Bayram et al., 2009</xref>, for a review see <xref ref-type="bibr" rid="bib42">Irniger, 2011</xref>). The Ime2 paralogs in mammals (the RCK kinases) control diverse processes including spermatogenesis and control of retinal cilia-length (MAK), as well as intestinal cell biology, control of cell proliferation, organogenesis, and cellular differentiation (MOK and ICK) (<xref ref-type="bibr" rid="bib28">Fu, 2012</xref>).</p><p>Within the evolutionary history of CMGC kinases, gene duplications followed by diversification resulted in multiple paralogous kinases with distinct specificities that coordinate diverse biological functions. For example, the specificities of Cdk1 and Ime2 are mostly non-overlapping (<xref ref-type="bibr" rid="bib36">Holt et al., 2007</xref>). In addition to acquiring distinct modes of regulation, it is likely that the divergence of the biological functions of this kinase family is, in part, due to evolution of their primary specificities. Therefore, understanding the mechanisms that drive specificity change and the consequences of these changes is crucial to rationalize the structures of modern phosphoregulatory networks. The shared evolutionary history of CMGC kinases, combined with their diverse specificities, make them an ideal gene family for studying the evolution of kinase specificity.</p><p>In this study, we determined the primary substrate specificity of eight extant kinases from the IME2/RCK/LF4 group of kinases and found variation in the amino acid that is preferred immediately C-terminal to the phosphoacceptor (the +1 position). To determine the mechanisms by which these specificities evolved, we used maximum likelihood phylogenetic models to reconstruct sequences for all ancestors of the CMGC kinases. We then resurrected seven ancestral kinases in the lineage starting with <bold>AncCMGI</bold>, which is the last common ancestor of the <bold>C</bold>DK, CDKL, <bold>M</bold>APK, <bold>G</bold>SK, CLK, and <bold>I</bold>ME2/RCK/LF4 kinases, up until the modern LF4, RCK, and IME2 kinases. Biochemical characterization of these resurrected kinases allowed us to trace the evolution of primary specificity in this lineage. In addition, we determined a key residue that modulates primary specificity at the +1 position. By mutating this residue in modern IME2 we showed that, at least in some circumstances, the cell can readily tolerate changes that expand kinase specificity.</p></sec><sec id="s2" sec-type="results"><title>Results</title><sec id="s2-1"><title>The Ime2/RCK/LF4 kinase family has variable +1 specificity</title><p>To understand how kinase specificity changes over a long evolutionary timescale, we determined the phosphorylation site specificities of eight kinases from the superfamily of kinase paralogs that includes fungal Ime2, the mammalian RCK kinases (ICK, MOK and MAK), and the LF4 kinases in algae and protists. This superfamily controls diverse biological processes, and we hypothesized that differences in primary specificity may underlie some of this functional divergence. In addition, previous work has shown that <italic>S. cerevisiae</italic> Ime2 and mouse ICK differ in their +1 specificities (<xref ref-type="bibr" rid="bib29">Fu et al., 2006</xref>; <xref ref-type="bibr" rid="bib36">Holt et al., 2007</xref>).</p><p>We used a positional scanning peptide library (PSPL, <xref ref-type="bibr" rid="bib41">Hutti et al., 2004</xref>) to characterize the full primary specificity of these kinases (<xref ref-type="fig" rid="fig1">Figure 1A</xref>). Briefly, we used a set of 182 peptide mixtures, in which a central phosphoacceptor position (an equal mixture of serine and threonine) was surrounded by random sequence. Within each mixture, one of nine positions was fixed to a single amino acid residue (see schematic, <xref ref-type="fig" rid="fig1">Figure 1A</xref>, top). Peptides were subjected in parallel to a radiolabelled kinase assay, and the extent of radiolabel incorporation indicates which residues are preferred or disallowed by the kinase at each position within the peptide sequence.<fig-group><fig id="fig1" position="float"><object-id pub-id-type="doi">10.7554/eLife.04126.003</object-id><label>Figure 1.</label><caption><title>The IME2/RCK/LF4 superfamily of kinases has variable specificity at the +1 position.</title><p>(<bold>A</bold>) Positional scanning peptide libraries were used to profile the specificity of various kinases: left, <italic>M. musculus</italic> MOK; middle, <italic>N. gruberi</italic> LF4; right, <italic>S. cerevisiae</italic> Ime2. Yellow indicates preference for a given amino acid while blue indicates counter selection. A schematic of the peptide library is shown above (see text for details). Data show the average of two replicates for each kinase. Raw data for these kinases and four other superfamily members are shown in <xref ref-type="fig" rid="fig1s1">Figure 1—figure supplement 1</xref>. Data shown here exclude peptides containing fixed Ser and Thr residues that typically produce an artificially increased signal due to the presence of an additional phosphoacceptor residue; heat maps of full peptide array results for all extant kinases are shown in <xref ref-type="fig" rid="fig1s2">Figure 1—figure supplement 2</xref>. (<bold>B</bold>) Michaelis–Menten plots for individual 8-mer IME2/RCK/LF4 consensus peptides (schematic, below) in which the +1 position is varied to be either proline (red) or arginine (blue). (<bold>C</bold>) Positional scanning peptide library data showing the average +1 preference of 8 kinases from the IME2/RCK/LF4 subgroup of CMGC kinases.</p><p><bold>DOI:</bold> <ext-link ext-link-type="doi" xlink:href="10.7554/eLife.04126.003">http://dx.doi.org/10.7554/eLife.04126.003</ext-link></p></caption><graphic xlink:href="elife04126f001"/></fig><fig id="fig1s1" position="float" specific-use="child-fig"><object-id pub-id-type="doi">10.7554/eLife.04126.004</object-id><label>Figure 1—figure supplement 1.</label><caption><title>Raw data for positional scanning peptide library arrays.</title><p>A set of 182 peptides with the general sequence shown at bottom, but with the indicated position in the array fixed to the indicated amino acid residue, were phosphorylated with the indicated kinase with radiolabeled ATP. Aliquots of each reaction were transferred to a streptavidin-coated membrane, which was washed and dried for autoradiography. Spot intensities reflect the extent of radiolabel incorporation into the indicated peptide mixture. Note: Peptide mixtures having fixed Ser and Thr residues generally give spuriously higher signals than other components of the library due to the presence of an additional phosphoacceptor site. This artifact is particularly evident at the +4 position for most kinases (red asterisks) as an artifact that arises from the nature of the linker sequence that follows the +4 position. This linker contains two residues (+1 Ala and +2 Gly relative to the +4 S/T) that are fixed to preferred amino acids, and so the +4 S/T gets heavily phosphorylated. We have previously verified that there is actually no preference for S/T at the +4 position (<xref ref-type="bibr" rid="bib36">Holt et al., 2007</xref>).</p><p><bold>DOI:</bold> <ext-link ext-link-type="doi" xlink:href="10.7554/eLife.04126.004">http://dx.doi.org/10.7554/eLife.04126.004</ext-link></p></caption><graphic xlink:href="elife04126fs001"/></fig><fig id="fig1s2" position="float" specific-use="child-fig"><object-id pub-id-type="doi">10.7554/eLife.04126.005</object-id><label>Figure 1—figure supplement 2.</label><caption><title>Quantified positional scanning peptide library data for all extant kinases analyzed.</title><p>Spot intensities shown in <xref ref-type="fig" rid="fig1s1">Figure 1—figure supplement 1</xref> and a replicate run of the same kinase were quantified (using BioRad QuantityOne software) and normalized so that the average signal in a given row was equal to unity. Data from the two replicates were averaged, log<sub>2</sub> transformed, and used to generate heat maps in Microsoft Excel. Positively selected residues are shown in yellow and negatively selected residues are shown in blue according to the scale at bottom right. Data for ICK were previously published (Fu et al., 2006) and are shown here in heat map form for comparison.</p><p><bold>DOI:</bold> <ext-link ext-link-type="doi" xlink:href="10.7554/eLife.04126.005">http://dx.doi.org/10.7554/eLife.04126.005</ext-link></p></caption><graphic xlink:href="elife04126fs002"/></fig></fig-group></p><p>As previously reported for <italic>S. cerevisiae</italic> Ime2 and mouse ICK, PSPL analysis revealed that all kinases assayed share a strong preference for arginine at the −3 position and proline at the −2 position (<xref ref-type="fig" rid="fig1s1 fig1s2">Figure 1—figure supplements 1 and 2</xref>). However, we found that selectivity for residues C-terminal to the phosphoacceptor was more variable. Specifically, the preferred residue at the +1 position varied between arginine (+1R) for the <italic>S. cerevisiae</italic>, <italic>Candida glabrata,</italic> and <italic>Yarrowia lipolytica</italic> Ime2 homologs and proline (+1P) for the three mammalian RCK kinases. <italic>Neurospora crassa</italic> Ime2 and <italic>Naegleria gruberi</italic> LF4 phosphorylated peptides with +1R and +1P relatively equally (<xref ref-type="fig" rid="fig1">Figure 1C</xref>). All kinases also tolerated alanine (+1A) relatively equally.</p><p>Additional biochemical characterization using four consensus peptides that were varied at the phosphoacceptor (0) and +1 positions (acetyl-R-P-R-S/T-R/P-R-amide) revealed differences in steady-state kinetics underlying the +1 specificity switch. <xref ref-type="fig" rid="fig1">Figure 1B</xref> shows Michaelis–Menten curves for a pair of peptide substrates with identical sequence except for having either proline (red) or arginine (blue) at the +1 position. <italic>S. cerevisiae</italic> Ime2 phosphorylated the +1R peptide with 65-fold greater efficiency (<italic>k</italic><sub>cat</sub>/<italic>K</italic><sub><italic>M</italic></sub>) than the corresponding +1P peptide, and these differences were attributable to differences in both the <italic>k</italic><sub>cat</sub> and the <italic>K</italic><sub><italic>M</italic></sub> values. As anticipated, the <italic>N. gruberi</italic> homolog LF4 phosphorylated the +1R and +1P peptides with similar kinetics, while mouse MOK showed a twofold preference for the +1P peptide (<xref ref-type="table" rid="tbl1">Table 1</xref>).<table-wrap id="tbl1" position="float"><object-id pub-id-type="doi">10.7554/eLife.04126.006</object-id><label>Table 1.</label><caption><p>Michaelis-Menten kinetic parameters for three members of the IME2/RCK/LF4 subgroup of kinases</p><p><bold>DOI:</bold> <ext-link ext-link-type="doi" xlink:href="10.7554/eLife.04126.006">http://dx.doi.org/10.7554/eLife.04126.006</ext-link></p></caption><table frame="hsides" rules="groups"><thead><tr><th>Kinase</th><th>+1 residue</th><th>K<sub>M</sub> (μM)</th><th>k<sub>cat</sub> (min<sup>-1</sup>)</th><th>K<sub>cat</sub>/K<sub>M</sub> (min<sup>-1</sup>μM<sup>-1</sup>)</th><th/></tr></thead><tbody><tr><td><italic>S. cerevisiae Ime2</italic></td><td><bold>R</bold></td><td>2.2 ± 0.3</td><td>10.8 ± 0.4</td><td>4.84 ± 0.67</td><td><italic>n = 4</italic></td></tr><tr><td/><td><bold>P</bold></td><td>44.5 ± 12.2</td><td>3.3 ± 0.3</td><td>0.07 ± 0.02</td><td><italic>n = 4</italic></td></tr><tr><td><italic>N. gruberi LF4</italic></td><td><bold>R</bold></td><td>1.2 ± 0.4</td><td>5.8 ± 0.5</td><td>4.76 ± 1.6</td><td><italic>n = 3</italic></td></tr><tr><td/><td><bold>P</bold></td><td>2.7 ± 0.6</td><td>7.8 ± 0.5</td><td>2.95 ± 0.69</td><td><italic>n = 3</italic></td></tr><tr><td><italic>M. musculus Mok</italic></td><td><bold>R</bold></td><td>41.9 ± 10.8</td><td>5.9 ± 0.5</td><td>0.14 ± 0.04</td><td><italic>n = 4</italic></td></tr><tr><td/><td><bold>P</bold></td><td>37 ± 2.7</td><td>11.7 ± 0.3</td><td>0.32 ± 0.03</td><td><italic>n = 4</italic></td></tr></tbody></table></table-wrap></p><p>In summary, primary specificity at the +1 position is relatively plastic among IME2/RCK/LF4 kinases, while the other major specificity determinants remained strongly conserved. Taken together with previous characterization of other CMGC kinases (<xref ref-type="bibr" rid="bib67">Songyang et al., 1996</xref>, <xref ref-type="bibr" rid="bib66">1994</xref>; <xref ref-type="bibr" rid="bib29">Fu et al., 2006</xref>; <xref ref-type="bibr" rid="bib36">Holt et al., 2007</xref>; <xref ref-type="bibr" rid="bib64">Sheridan et al., 2008</xref>; <xref ref-type="bibr" rid="bib55">Mok et al., 2010</xref>; <xref ref-type="bibr" rid="bib2">Alexander et al., 2011</xref>; <xref ref-type="bibr" rid="bib13">Bullock et al., 2009</xref>; <xref ref-type="bibr" rid="bib44">Kettenbach et al., 2012</xref>), our results suggest two evolutionary hypotheses. One possibility is that the ancestor of modern CMGC kinases had dual specificity for arginine and proline and then lost either proline or arginine to specialize extant kinases. Alternatively, the +1 specificity for arginine could have evolved as a switch from proline to arginine.</p></sec><sec id="s2-2"><title>Ancestral reconstruction of the CMGC group of kinases</title><p>We sought to reconstruct the evolutionary events that led to the modern diversity of IME2/RCK/LF4 specificities. To achieve this goal, we curated a library of 329 amino acid sequences sampled broadly from across the CMGC group and then reconstructed their evolutionary history using maximum likelihood phylogenetic methods (<xref ref-type="bibr" rid="bib72">Thornton, 2004</xref>; see ‘Materials and methods’). The resulting phylogeny and reconstructed ancestral sequences were strongly supported by the evolutionary model (<xref ref-type="fig" rid="fig2">Figure 2</xref>, <xref ref-type="fig" rid="fig2s1">Figure 2—figure supplement 1</xref>). The full library of CMGC kinase ancestors is available at <ext-link ext-link-type="uri" xlink:href="http://www.phylobox.com/cmgc.10.2013/">http://www.phylobox.com/cmgc.10.2013/</ext-link>.<fig-group><fig id="fig2" position="float"><object-id pub-id-type="doi">10.7554/eLife.04126.007</object-id><label>Figure 2.</label><caption><title>The common ancestor of CMGI kinases had a slight preference for proline at the +1 position of the substrate peptide.</title><p>(<bold>A</bold>) Summary of current knowledge about CMGC group kinase specificity in the context of the maximum likelihood phylogeny of protein sequences. Major groups, such as IME2, MAK, ICK, etc, have been collapsed for simplicity. Branch lengths express the number of amino acid substitutions per protein sequence site. Branch support values are approximate likelihood ratios (aLRs), expressing the ratio of the likelihood of the maximum likelihood phylogeny to the next-best phylogeny lacking the indicated branch. For example, an aLR value of 10 indicates that the branch is 10 times more likely than the next-best phylogenetic hypothesis. The position of the common ancestor of CDK, MAPK, CDKL, GSK, and the IME2/LF4/RCK superfamily (AncCMGI), is indicated by a purple circle. (<bold>B</bold>) Position scanning peptide libraries were used to determine the primary specificity of the maximum likelihood resurrected AncCMGI kinase. Raw peptide data are shown in <xref ref-type="fig" rid="fig2s1">Figure 2—figure supplement 1</xref>. A complete repeat of the PSPL for a Bayesian sampled alternative reconstruction of AncCMGI (AncCMGI-B2) is shown in <xref ref-type="fig" rid="fig2s2">Figure 2—figure supplement 2</xref>. A structural model of AncCMGI is shown in <xref ref-type="fig" rid="fig2s3">Figure 2—figure supplement 3</xref> in phylogenetic context along with structures and models for extant kinases that were derived from this ancestor.</p><p><bold>DOI:</bold> <ext-link ext-link-type="doi" xlink:href="10.7554/eLife.04126.007">http://dx.doi.org/10.7554/eLife.04126.007</ext-link></p></caption><graphic xlink:href="elife04126f002"/></fig><fig id="fig2s1" position="float" specific-use="child-fig"><object-id pub-id-type="doi">10.7554/eLife.04126.008</object-id><label>Figure 2—figure supplement 1.</label><caption><title>Support for reconstrutctions.</title><p>We characterized the support for each ancestor by binning their posterior probabilities of states into 5%-sized bins and counting the proportion of ancestral sites within each bin. (<bold>A</bold>) Posterior probability of each maximum likelihood amino acid (P(ML)) is shown as a function of position within the kinase primary sequence. Sites with lower support generally correspond to loop regions. (<bold>B</bold>) Histogram showing distribution of posterior probabilities of maximum likelihood amino acids. Mean (x) and standard deviation (σ2) values are indicated. Complete reconstruction data with complete probabilities at every position is available at <ext-link ext-link-type="uri" xlink:href="http://www.phylobox.com/cmgc.10.2013/">http://www.phylobox.com/cmgc.10.2013/</ext-link>.</p><p><bold>DOI:</bold> <ext-link ext-link-type="doi" xlink:href="10.7554/eLife.04126.008">http://dx.doi.org/10.7554/eLife.04126.008</ext-link></p></caption><graphic xlink:href="elife04126fs003"/></fig><fig id="fig2s2" position="float" specific-use="child-fig"><object-id pub-id-type="doi">10.7554/eLife.04126.009</object-id><label>Figure 2—figure supplement 2.</label><caption><title>Raw data and selectivity values for a positional scanning peptide library array of an alternate reconstruction of AncCMGI.</title><p>(<bold>A</bold>) Raw PSPL result for an alternative reconstruction of AncCMGI. (<bold>B</bold>) Averaged quantified data from (<bold>A</bold>) and a replicate analysis with AncCMGI. Data were collected and quantified as in <xref ref-type="fig" rid="fig1s1 fig1s2">Figure 1—figure supplements 1 and 2</xref>.</p><p><bold>DOI:</bold> <ext-link ext-link-type="doi" xlink:href="10.7554/eLife.04126.009">http://dx.doi.org/10.7554/eLife.04126.009</ext-link></p></caption><graphic xlink:href="elife04126fs004"/></fig><fig id="fig2s3" position="float" specific-use="child-fig"><object-id pub-id-type="doi">10.7554/eLife.04126.010</object-id><label>Figure 2—figure supplement 3.</label><caption><title>Structural evolution in the CMGC kinase group.</title><p>A model of AncCMGI was generated from the CDK2 structure using Phyre2. This structure is compared to the structures of extant CMGC kinases in their phylogenetic context. The core structure present in AncCMGI is colored grey. Additional domains in modern kinases are indicated in various colors: e.g. C-terminal extension of CDKL (cyan); C-terminal extension of MAPK (red); Cyclin (beige).</p><p><bold>DOI:</bold> <ext-link ext-link-type="doi" xlink:href="10.7554/eLife.04126.010">http://dx.doi.org/10.7554/eLife.04126.010</ext-link></p></caption><graphic xlink:href="elife04126fs005"/></fig></fig-group></p></sec><sec id="s2-3"><title>The common ancestor of CDK, CDKL, MAPK, GSK, and IME2/RCK/LF4 (AncCMGI) had a modest +1 proline preference and was cyclin-independent</title><p>We synthesized DNA encoding the maximum likelihood common ancestor of <bold>C</bold>DK, CDKL, <bold>M</bold>APK, <bold>G</bold>SK, and <bold>I</bold>ME2/RCK/LF4 (referred to as AncCMGI). We expressed and purified the kinase from both <italic>S. cerevisiae</italic> and <italic>E. coli.</italic> We obtained active kinase in both cases. We employed positional scanning arrays to determine the primary specificity of AncCMGI, as described above and in <xref ref-type="fig" rid="fig1">Figure 1</xref>. Many kinases have fairly degenerate primary specificities (<xref ref-type="bibr" rid="bib55">Mok et al., 2010</xref>) and rely on docking, scaffolding, and localization to discriminate their correct substrates (<xref ref-type="bibr" rid="bib74">Ubersax and Ferrell, 2007</xref>). We therefore reasoned that an ancestral kinase might have lower specificity than the extant enzymes derived from this ancestor. Indeed, there have been several studies in which enzymes have become sub-specialized from broad-specificity ancestors following gene duplication during evolution (<xref ref-type="bibr" rid="bib71">Thornton, 2003</xref>). However, we observed that AncCMGI had a well-defined primary specificity, with a strong preference for arginine at the −3 position and proline at the −2 position (<xref ref-type="fig" rid="fig2">Figure 2B</xref>). These N-terminal determinants correspond to the conserved motif found in the IME2/RCK/LF4 kinases as well as the DYRK kinases (<xref ref-type="fig" rid="fig2">Figure 2A</xref>). Interestingly, AncCMGI could phosphorylate peptides having either a proline or an arginine residue at the +1 position, though it displayed a 5.6-fold preference for proline (<xref ref-type="fig" rid="fig2">Figure 2B</xref>). Thus AncCMGI appeared to have a modest +1 proline preference, in contrast to the more stringent proline requirement of the extant CDK and MAPK families (<xref ref-type="bibr" rid="bib35">Himpel et al., 2000</xref>; <xref ref-type="bibr" rid="bib29">Fu et al., 2006</xref>; <xref ref-type="bibr" rid="bib36">Holt et al., 2007</xref>; <xref ref-type="bibr" rid="bib55">Mok et al., 2010</xref>; <xref ref-type="bibr" rid="bib2">Alexander et al., 2011</xref>). Thus, the specificity of AncCMGI contains elements of the diverged specificities of many major sub-families of the CMGC group.</p><p>Notably, the domain architecture of AncCMGI is most similar to IME2/RCK/LF4 kinases. That is, AncCMGI contains the canonical CMGC insert loop, but it lacks any C-terminal extension (found in MAPKs). Furthermore, AncCMGI does not appear to require cyclin for activity (as with CDKs): we observed no significant co-purifying proteins, and <italic>E. coli</italic> does not encode any cyclin orthologs (<xref ref-type="fig" rid="fig2">Figure 2</xref>, <xref ref-type="fig" rid="fig2s3">Figure2—figure supplement 3</xref>). These data indicate that the cyclin dependence of CDKs and the requirement for an additional C-terminal extension to stabilize the Cα helix in MAPKs are characteristics that arose later during evolution (<italic>E. coli</italic> does not encode any cyclin orthologs [<xref ref-type="fig" rid="fig2">Figure 2</xref>, <xref ref-type="fig" rid="fig2s3">Figure2—figure supplement 3</xref>]). In addition, AncCMGI contains a MAPK-like TXY motif in the activation loop. Phosphorylation of this motif is required for the activation of MAPKs. Because <italic>E. coli</italic> lack endogenous kinases capable of phosphorylating this TXY motif, AncCMGI is likely activated through autophosphorylation, similar to extant mammalian DYRK and GSK family kinases (<xref ref-type="bibr" rid="bib18">Cole et al., 2004</xref>; <xref ref-type="bibr" rid="bib51">Lochhead et al., 2005</xref>).</p></sec><sec id="s2-4"><title>+1 specificity evolved from modest proline preference to strong arginine preference via an expanded specificity intermediate</title><p>In order to learn the trajectory by which kinase specificity at the +1 position evolved, we reconstructed ancestral kinases within the CMGC group at multiple evolutionary time points before and after the +1 specificity change from proline to arginine was presumed to have occurred (<xref ref-type="fig" rid="fig3">Figure 3A</xref>). These kinases were assayed using consensus peptide substrates with identical sequence except for having either proline (red) or arginine (blue) at the +1 position. The log-ratio of arginine/proline preference from this assay is plotted in <xref ref-type="fig" rid="fig3">Figure 3B</xref>. Because the +1 specificity could be dependent on the surrounding sequence context present in the consensus peptide substrates, we also characterized the full primary specificities of AncLF4 and AncICK kinases by PSPL arrays (<xref ref-type="fig" rid="fig3">Figure 3</xref>, <xref ref-type="fig" rid="fig3s1">Figure 3—figure supplement 1</xref>). While the specificity for arginine at the −3 position and proline at the −2 position was conserved among these ancestors, we observed significant variation in their relative preference for +1R vs +1P (<xref ref-type="fig" rid="fig3">Figure 3B,C</xref>; <xref ref-type="fig" rid="fig3">Figure 3</xref>, <xref ref-type="fig" rid="fig3s1">Figure 3—figure supplement 1</xref>). As described above, AncCMGI appears to have preferred +1P substrates. On the phylogenetic branch leading to the common ancestor of the IME2/RCK/LF4 group (i.e., AncNgru), the +1 specificity relaxed to equally accommodate both +1R and +1P (+1PR). This hybrid specificity was conserved in the LF4 lineage and also on the branch leading to AncICK. Evolution after AncICK, however, proceeded along two divergent evolutionary paths. Namely, the specificity reverted to the ancestral +1P-preferring state along the branch leading to the mammalian RCK kinases (ICK and MAK). In contrast, the specificity shifted to +1R in the fungal lineage leading to the ancestor of the IME2 kinases (i.e., AncIME2). AncIME2 had a moderate preference for +1R, and this preference is maintained in other fungal IME2 ancestors, becoming more pronounced in the ancestor of <italic>Yarrowia lipolytica</italic> (AncYlip). These results are summarized in their phylogenetic context in <xref ref-type="fig" rid="fig3">Figure 3A</xref> and are robust to statistical uncertainties about the reconstructed ancestral sequences, although the degree of +1 proline selectivity was slightly lower in AncCMGI-B1 and higher in AncCMGI-B2 (see ‘Materials and methods’, <xref ref-type="fig" rid="fig3">Figure 3</xref>, <xref ref-type="fig" rid="fig3s2">Figure 3—figure supplement 2</xref>).<fig-group><fig id="fig3" position="float"><object-id pub-id-type="doi">10.7554/eLife.04126.011</object-id><label>Figure 3.</label><caption><title>The substrate peptide +1 specificity evolved from proline in AncCMGI to arginine in <italic>S. cerevisiae</italic> Ime2 via an expanded specificity intermediate.</title><p>(<bold>A</bold>) Phylogenetic tree for the IME2/LF4/RCK superfamily, also showing the positions of other major CMGC group families. The positions of ancestral nodes resurrected in this study are indicated by circles. The tree is color-coded: red indicates +1 proline preference, blue indicates +1 arginine preference, and purple indicates equal tolerance of both Arg and Pro at the +1 position. (<bold>B</bold>) Seven resurrected kinases were incubated with 45 μM peptide (see schematic, below). Bars show the log<sub>2</sub> ratio of +1R and +1P initial velocities (V<sub>0</sub>R/V<sub>0</sub>P). Black and white bars indicate Ser and Thr respectively as phosphoacceptor. Error bars are standard error of three assays. (<bold>C</bold>) Peptide phosphorylation rates for the same resurrected kinases shown in (<bold>B</bold>) using the peptides from the positional scanning peptide library having the indicated residue fixed at the +1 position. Data show the average of two replicates and are normalized and log<sub>2</sub> transformed so that the average value for a given kinase is zero. The heat map follows the same color scheme as in <xref ref-type="fig" rid="fig1">Figure 1A</xref>.</p><p><bold>DOI:</bold> <ext-link ext-link-type="doi" xlink:href="10.7554/eLife.04126.011">http://dx.doi.org/10.7554/eLife.04126.011</ext-link></p></caption><graphic xlink:href="elife04126f003"/></fig><fig id="fig3s1" position="float" specific-use="child-fig"><object-id pub-id-type="doi">10.7554/eLife.04126.012</object-id><label>Figure 3—figure supplement 1.</label><caption><title>Raw data and selectivity values for full positional scanning peptide arrays of AncLF4 and AncICK.</title><p>(<bold>A</bold>) Raw PSPL result for maximum likelihood reconstructions of AncNgru and AncICK. (<bold>B</bold>) Averaged, quantified selectivity values for two replicate runs of the kinases shown in (<bold>A</bold>). Data were collected and processed as in <xref ref-type="fig" rid="fig1s1 fig1s2">Figure 1—figure supplements 1 and 2</xref>.</p><p><bold>DOI:</bold> <ext-link ext-link-type="doi" xlink:href="10.7554/eLife.04126.012">http://dx.doi.org/10.7554/eLife.04126.012</ext-link></p></caption><graphic xlink:href="elife04126fs006"/></fig><fig id="fig3s2" position="float" specific-use="child-fig"><object-id pub-id-type="doi">10.7554/eLife.04126.013</object-id><label>Figure 3—figure supplement 2.</label><caption><title>The substrate peptide +1 specificity evolved from proline in AncCMGI to arginine in <italic>S. cerevisiae</italic> Ime2 via an expanded specificity intermediate – robustness to uncertainty in reconstructions.</title><p>(<bold>A</bold>) Alternative reconstructions of each of the seven resurrected kinases (‘Materials and methods’) were incubated with 45 μM peptide (see schematic, bottom). Bars show the log<sub>2</sub> ratio of +1R and +1P initial velocities (V<sub>0</sub>R/V<sub>0</sub>P). Black and white bars indicate Ser and Thr respectively as phosphoacceptor. Error bars are standard error of three assays. (<bold>B</bold>) Peptide phosphorylation rates for the same resurrected kinases shown in (<bold>A</bold>), using the peptides from the positional scanning peptide library having the indicated residue fixed at the +1 position. Data show the average of two to three replicates and are normalized and log<sub>2</sub> transformed so that the average value for a given kinase is zero. The heat map follows the same color scheme as in <xref ref-type="fig" rid="fig1">Figure 1A</xref>.</p><p><bold>DOI:</bold> <ext-link ext-link-type="doi" xlink:href="10.7554/eLife.04126.013">http://dx.doi.org/10.7554/eLife.04126.013</ext-link></p></caption><graphic xlink:href="elife04126fs007"/></fig></fig-group></p></sec><sec id="s2-5"><title>The phosphoacceptor influences +1 specificity</title><p>In initial results comparing +1 specificities obtained from our positional scanning peptide library arrays to those from our ratiometric peptide assays, we noticed that ICK was an outlier among the mammalian RCKs: this kinase phosphorylated peptides with +1R and +1P equally in our ratiometric assay, while the arrays showed a clear +1P preference (albeit with detectable phosphorylation of the +1R peptide mixture, Figure 4, <xref ref-type="fig" rid="fig4s1">Figure 4—figure supplement 1</xref>). We had initially used peptides with only serine as a phosphoacceptor in our ratiometric assay, while the PSPL array peptides contained equal mixtures of serine and threonine. We therefore reasoned that the nature of the phosphoacceptor might influence the +1 specificity of kinases. To test this hypothesis, we analyzed additional peptide sets with equal mixes of serine and threonine, or with only threonine as the phosphoacceptor in our ratiometric assay. From these experiments, we found that, indeed, the phosphoacceptor affects +1 specificity: serine causes a shift towards +1R preference, while threonine causes a shift towards +1P preference (Figure 4, <xref ref-type="fig" rid="fig4s1">Figure 4—figure supplement 1</xref>). This dependence of +1 specificity on the phosphoacceptor is present in AncCMGI and is maintained in all ancestors and extant members of the IME2/RCK/LF4 family (Figures 3B and 4C, <xref ref-type="fig" rid="fig3s2">Figure 3—figure supplement 2</xref>).</p></sec><sec id="s2-6"><title>The DFGx residue in the kinase activation loop is a determinant of +1 specificity</title><p>To understand how kinase phosphorylation site specificity changes in evolution from a structural standpoint, we sought to identify specific residues in the kinase catalytic domain that mediate +1 specificity. Kinase substrate co-crystallography and biochemical analysis of large numbers of kinases have revealed some of the rules connecting kinase sequence to specificity (<xref ref-type="bibr" rid="bib80">Zhu et al., 2005</xref>; <xref ref-type="bibr" rid="bib32">Goldsmith et al., 2007</xref>; <xref ref-type="bibr" rid="bib55">Mok et al., 2010</xref>). The peptide-binding groove is formed from a number of structural elements within the kinase catalytic domain. One key point of kinase-substrate interaction is the activation loop, a conformationally flexible region that extends between two highly conserved amino acid motifs, DFG, and APE, connecting the N- and C-terminal kinase lobes (<xref ref-type="bibr" rid="bib39">Huse and Kuriyan, 2002</xref>). Previous work revealed that the amino acid immediately C-terminal to the conserved DFG motif contributes to preference for serine vs threonine at the phosphoacceptor (<xref ref-type="bibr" rid="bib16">Chen et al., 2014</xref>). This residue was previously referred to as the DFG+1 residue, but here we will refer to this amino acid as the <bold>DFGx</bold> residue to avoid confusion with the +1 amino acid position of the substrate peptide. Since the DFGx residue communicates with the phosphoacceptor, and the phosphoacceptor in turn influences +1 specificity, we hypothesized that the identity of the DFGx residue may be a determinant for +1P vs +1R specificity. This hypothesis is consistent with X-ray crystal structures of kinase peptide complexes, in which the +1 residue in the substrate is in close proximity to the DFGx residue (<xref ref-type="bibr" rid="bib68">Soundararajan et al., 2013</xref>) (<xref ref-type="fig" rid="fig4">Figure 4A</xref>). Our ancestral reconstructions indicate that the DFGx residue was a leucine in AncCMGI and then mutated to serine multiple times in the CMGC family (<xref ref-type="fig" rid="fig4">Figure 4B</xref>). Further, the presence of leucine vs serine at DFGx in present-day kinases correlates with specificity for +1P vs +1R. Taken together, this suggests that Leu vs Ser at DFGx could affect kinase specificity at the +1 site. To test this hypothesis, we examined the effect of mutating Leu to Ser in mammalian ICK (L146S). This single mutation at DFGx made ICK approximately threefold more proline specific, such that its specificity resembled that of MOK (<xref ref-type="fig" rid="fig4">Figure 4C</xref>). Conversely, mutating the DFGx residue in the opposite direction in MOK (S148L) had the reverse effect of converting MOK from +1P preference to a non-selective kinase (+1PR). We also mutated the DFGx residue in <italic>S. cerevisae</italic> Ime2 and in the ancestors AncLF4(S152L), AncICK(L152S), and AncCMGI(L151S). In all cases, we observed that mutation from leucine to serine at DFGx shifted kinase function towards +1P specificity, while mutation from serine to leucine at DFGx shifted the kinases towards +1R specificity (<xref ref-type="fig" rid="fig4">Figure 4C</xref>). We note that though DFGx mutation was sufficient to substantially shift the +1 residue preference, other residues must also be important for +1 specificity, since proline to arginine selectivity shifts occur in the evolution of CMGC kinases without a DFGx mutation.<fig-group><fig id="fig4" position="float"><object-id pub-id-type="doi">10.7554/eLife.04126.014</object-id><label>Figure 4.</label><caption><title>The DFGx amino acid and the phosphoacceptor influence the +1 specificity of extant and ancestral kinases.</title><p>(<bold>A</bold>) Structural model of AncCMGI in complex with a consensus peptide substrate. The box shows the active site with the position of the DFGx amino acid highlighted in orange. ATP is blue and the substrate peptide is red. For clarity, sidechains are only shown for residues discussed in the text. (<bold>B</bold>) Phylogenetic tree indicating the identity of the DFGx amino acid and the transitions that occurred in the evolution of the CMGC group. Numbers indicate support for ancestral reconstructions. (<bold>C</bold>) Kinases were incubated with 45 μM peptide and initial velocities measured. In general, L to S mutations shift substrate preference towards +1P while S to L mutations shift preference towards +1 R. Bars show the log<sub>2</sub> ratio of +1R and +1P initial velocities (V<sub>0</sub>R/V<sub>0</sub>P). Black and white bars indicate wild type or maximum likelihood kinases incubated with peptides that contain serine and threonine respectively as phosphoacceptor. Dark and light orange indicate DFGx mutant kinases incubated with peptides that contain serine and threonine respectively as phosphoacceptor. Error bars are standard error of three assays. <xref ref-type="fig" rid="fig4s1">Figure 4—figure supplement 1</xref> shows data for ICK compared to PSPL results. <xref ref-type="fig" rid="fig4s2">Figure 4—figure supplement 2</xref> shows full Michaelis–Menten curves for selected kinases and DFGx mutants.</p><p><bold>DOI:</bold> <ext-link ext-link-type="doi" xlink:href="10.7554/eLife.04126.014">http://dx.doi.org/10.7554/eLife.04126.014</ext-link></p></caption><graphic xlink:href="elife04126f004"/></fig><fig id="fig4s1" position="float" specific-use="child-fig"><object-id pub-id-type="doi">10.7554/eLife.04126.015</object-id><label>Figure 4—figure supplement 1.</label><caption><title>The phosphoacceptor affects the +1 specificity of ICK. ICK kinase was incubated with 45 μM peptide and initial velocities measured.</title><p>Bars show the log<sub>2</sub> ratio of +1R/+1P initial velocities (V<sub>0</sub>R/V<sub>0</sub>P). Black and white bars indicate that ICK was incubated with peptides that contain serine and threonine respectively as phosphoacceptor. The gray bar indicates an equal mix of serine and threonine as phosphoacceptor. The lower gray bar is ratio data taken from the PSPL array in <xref ref-type="fig" rid="fig1s1">Figure 1—figure supplement 1</xref> i.e. a ratio of phosphate incorporation into Y-A-X-X-X-X-X-S/T-<bold>R</bold>-X-X-X-A-G-K-K-biotin vs Y-A-X-X-X-X-X-S/T-<bold>P</bold>-X-X-X-A-G-K-K-biotin peptides, where S/T indicates an equal mixture of serine or threonine as phosphoacceptor and X indicates an equal mixture of all amino acids except C, S, or T at all other positions except +1 and the terminal linker amino acids.</p><p><bold>DOI:</bold> <ext-link ext-link-type="doi" xlink:href="10.7554/eLife.04126.015">http://dx.doi.org/10.7554/eLife.04126.015</ext-link></p></caption><graphic xlink:href="elife04126fs008"/></fig><fig id="fig4s2" position="float" specific-use="child-fig"><object-id pub-id-type="doi">10.7554/eLife.04126.016</object-id><label>Figure 4—figure supplement 2.</label><caption><title>Michaelis–Menten curves for selected kinases.</title><p><italic>Saccharomyces cerevisiae</italic> Ime2, <italic>Saccharomyces cerevisiae</italic> Ime2-DFGx (L231S), <italic>Mus musculus</italic> ICK, and <italic>Mus musculus</italic> ICK-DFGx (L146S) were incubated with varying peptide concentrations (x-axis), and initial velocities for the reaction are plotted on the y-axis. The curves on the left contain serine as phosphoacceptor (S-peptides) while those on the right contain threonine (T-peptides). Peptides with Pro at the +1 position are colored red and marked with triangles, peptides with Arg at the +1 position are colored blue and marked with circles (see schematic, top).</p><p><bold>DOI:</bold> <ext-link ext-link-type="doi" xlink:href="10.7554/eLife.04126.016">http://dx.doi.org/10.7554/eLife.04126.016</ext-link></p></caption><graphic xlink:href="elife04126fs009"/></fig><fig id="fig4s3" position="float" specific-use="child-fig"><object-id pub-id-type="doi">10.7554/eLife.04126.017</object-id><label>Figure 4—figure supplement 3.</label><caption><title>Variable phosphoacceptor preference for extant IME2/RCK/LF4 kinases.</title><p>Kinases were incubated with 45 μM peptide and initial velocities measured. Bars show the log<sub>2</sub> ratio of initial velocities with serine or threonine as phosphoacceptor (V<sub>0</sub>S/V<sub>0</sub>T). Red and blue bars indicate the presence of arginine and proline, respectively, at the +1 substrate position. Error bars are standard deviations of three replicates.</p><p><bold>DOI:</bold> <ext-link ext-link-type="doi" xlink:href="10.7554/eLife.04126.017">http://dx.doi.org/10.7554/eLife.04126.017</ext-link></p></caption><graphic xlink:href="elife04126fs010"/></fig><fig id="fig4s4" position="float" specific-use="child-fig"><object-id pub-id-type="doi">10.7554/eLife.04126.018</object-id><label>Figure 4—figure supplement 4.</label><caption><title>Phosphoacceptor preference shows a general shift from threonine towards threonine during evolution in the IME2/RCK/LF4 lineage.</title><p>Kinases were incubated with 45 μM peptide and initial velocities measured. Bars show the log<sub>2</sub> ratio of initial velocities with serine or threonine as phosphoacceptor (V<sub>0</sub>S/V<sub>0</sub>T). Red and blue bars indicate the presence of arginine and proline, respectively, at the +1 substrate position. Error bars are standard deviations of three replicates.</p><p><bold>DOI:</bold> <ext-link ext-link-type="doi" xlink:href="10.7554/eLife.04126.018">http://dx.doi.org/10.7554/eLife.04126.018</ext-link></p></caption><graphic xlink:href="elife04126fs011"/></fig></fig-group></p><p>In addition to affecting kinase selectivity at the +1 position, as anticipated mutation of the DFGx residue also impacted phosphoacceptor preference (see <xref ref-type="fig" rid="fig4">Figure 4</xref>, <xref ref-type="fig" rid="fig4s2">Figure 4—figure supplement 2</xref>). However, the identity of the DFGx residue appeared to only modestly affect the phosphoacceptor specificity in comparison to specificity at the +1 position, and did not follow a clear pattern. These results are in keeping with previous observations that CMGC kinases are generally less specific than other kinase groups for the phosphoacceptor residue (<xref ref-type="bibr" rid="bib16">Chen et al., 2014</xref>; <xref ref-type="fig" rid="fig4">Figure 4</xref>, <xref ref-type="fig" rid="fig4s3">Figure 4—figure supplement 3</xref>). We also examined phosphoacceptor preference in our resurrected CMGC ancestors. A general trend is observed in which AncCMGI has a slight preference for serine and this shifts towards a slight threonine preference in the lineage leading to Ime2 (<xref ref-type="fig" rid="fig4">Figure 4</xref>, <xref ref-type="fig" rid="fig4s4">Figure 4—figure supplement 4</xref>).</p></sec><sec id="s2-7"><title>The <italic>S. cerevisiae</italic> meiotic phosphoregulatory network tolerates expansion of kinase specificity</title><p>In <italic>S. cerevisiae</italic>, Ime2 is expressed exclusively during meiosis and is required for all stages of this process, including meiotic initiation, S-phase, the meiotic divisions, and gametogenesis (<xref ref-type="bibr" rid="bib79">Yoshida et al., 1990</xref>; <xref ref-type="bibr" rid="bib22">Dirick et al., 1998</xref>; <xref ref-type="bibr" rid="bib9">Benjamin, 2003</xref>; <xref ref-type="bibr" rid="bib36">Holt et al., 2007</xref>; <xref ref-type="bibr" rid="bib54">McDonald et al., 2009</xref>). This meiotic exclusive expression allows us to engineer allelic replacements of <italic>IME2</italic> without any impact on vegetative cells, and then assess the ability of strains to complete various aspects of the meiotic program.</p><p>The Ime2 DFGx mutant shifts from a strong +1R preference to an expanded specificity that tolerates both proline and arginine at the +1 position. This mutant has a comparable turnover to the wild-type kinase (<xref ref-type="fig" rid="fig4">Figure 4</xref>, <xref ref-type="fig" rid="fig4s2">Figure 4—figure supplement 2</xref>) and therefore can be used to meaningfully test the effect of changing primary specificity on phosphoregulatory networks in vivo. To this end, we replaced the endogenous <italic>IME2</italic> gene with the <italic>ime2-DFGx</italic> allele and assayed the ability of cells to undergo meiosis.</p><p>As reported previously (<xref ref-type="bibr" rid="bib9">Benjamin, 2003</xref>), a kinase-dead version of Ime2 (<italic>ime2-K97R</italic>) failed to support meiosis (not shown). However, cells with both copies of <italic>IME2</italic> replaced with a DFGx mutant (<italic>ime2-L231S</italic>) completed meiosis, but with a reduction of sporulation efficiency thus indicating that the <italic>ime2-DFGx</italic> allele has significant activity in vivo (<xref ref-type="fig" rid="fig5">Figure 5A</xref>). The homozygous <italic>ime2-DFGx</italic> cells that did correctly form tetrads had reduced spore viability (<xref ref-type="fig" rid="fig5">Figure 5B</xref>, <xref ref-type="fig" rid="fig5s1">Figure 5—figure supplement 1</xref>), and initiated S-phase (<xref ref-type="fig" rid="fig5">Figure 5C</xref>) and the meiotic divisions (<xref ref-type="fig" rid="fig5">Figure 5D</xref>) with a 2–3 hr delay. These defects may be caused by a weakening of network connections due to a reduced preference for arginine at the +1 position in the DFGx mutant.<fig-group><fig id="fig5" position="float"><object-id pub-id-type="doi">10.7554/eLife.04126.019</object-id><label>Figure 5.</label><caption><title>The <italic>S. cerevisiae</italic> meiotic phosphoregulatory network tolerates an expanded specificity DFGx mutant.</title><p>(<bold>A</bold>) Sporulation efficiency with various <italic>IME2</italic> alleles: wild-type <italic>IME2</italic> (WT), an <italic>ime2-(L231S)</italic> heterozygote (WT/DFGx), or an <italic>ime2-(L231S)</italic> homozygote (DFGx/DFGx). (<bold>B</bold>) Fraction of tetrad spores that, when dissected, gave rise to colonies (representative images shown in <xref ref-type="fig" rid="fig5s1">Figure 5—figure supplement 1</xref>). (<bold>C</bold>) Synchronous meiosis was induced and DNA content analyzed by SYTOX-Green staining and flow cytometry (representative raw data shown in <xref ref-type="fig" rid="fig5s2">Figure 5—figure supplement 2</xref>). (<bold>D</bold>) Synchronous meiosis was induced and DNA segregation events were scored by fluorescence microscopy. Error bars represent standard error of three or more biological replicates.</p><p><bold>DOI:</bold> <ext-link ext-link-type="doi" xlink:href="10.7554/eLife.04126.019">http://dx.doi.org/10.7554/eLife.04126.019</ext-link></p></caption><graphic xlink:href="elife04126f005"/></fig><fig id="fig5s1" position="float" specific-use="child-fig"><object-id pub-id-type="doi">10.7554/eLife.04126.020</object-id><label>Figure 5—figure supplement 1.</label><caption><title>Representative tetrad dissections.</title><p>Synchronous meiosis was induced for wild-type IME2 (WT), an <italic>ime2-(L231S)</italic> heterozygote (WT/DFGx), or an <italic>ime2-(L231S)</italic> homozygote (DFGx/DFGx).</p><p><bold>DOI:</bold> <ext-link ext-link-type="doi" xlink:href="10.7554/eLife.04126.020">http://dx.doi.org/10.7554/eLife.04126.020</ext-link></p></caption><graphic xlink:href="elife04126fs012"/></fig><fig id="fig5s2" position="float" specific-use="child-fig"><object-id pub-id-type="doi">10.7554/eLife.04126.021</object-id><label>Figure 5—figure supplement 2.</label><caption><title>Representative cytometry data from meiosis experiments.</title><p>Synchronous meiosis was induced and DNA content analyzed by SYTOX-Green staining and flow cytometry. Approximate positions of 2c and 4c DNA content are indicated. Beyond 8 hr, gametogenesis occurs, leading to packaging of DNA into spores and resulting in additional peaks.</p><p><bold>DOI:</bold> <ext-link ext-link-type="doi" xlink:href="10.7554/eLife.04126.021">http://dx.doi.org/10.7554/eLife.04126.021</ext-link></p></caption><graphic xlink:href="elife04126fs013"/></fig></fig-group></p><p>The defects in the homozygous <italic>ime2-DFGx (L231S)</italic> strain could be due to either a weakening of existing network connections, a gain of new network connections that interfere with meiotic processes or a combination of both. We reasoned that the loss of important phosphorylation events would be recessive and could be compensated for by the presence of a wild-type <italic>IME2</italic> allele, while the gain of new phosphorylation sites would be dominant. To test for dominant effects in the DFGx mutant, we generated heterozygous <italic>IME2/ime2-DFGx</italic> strains. <italic>IME2/ime2-DFGx</italic> strains had no noticeable defects in meiosis or sporulation. DNA replication and divisions occurred on schedule, spore formation was normal, and spore viability was high (<xref ref-type="fig" rid="fig5">Figure 5</xref>). Therefore, we conclude that the defects seen in the homozygous DFGx mutant strains are most likely due to weakening of network connections, rather than the creation of new deleterious phosphorylation sites. This experiment is analogous to gene duplication followed by paralog divergence in evolution and indicates that the meiotic phosphoregulatory network can tolerate divergence of specificity in a second copy of the <italic>IME2</italic> gene.</p></sec></sec><sec id="s3" sec-type="discussion"><title>Discussion</title><p>We resurrected multiple ancestral CMGC kinases and characterized their specificities in order to reveal the evolutionary trajectories by which specificity at the +1 site in present-day kinases diversified. The ancestor of all CMGC kinases had slight specificity for proline at its +1 site (+1P), and then subsequently evolved broader specificity for both proline and arginine (+1PR) along the lineage leading to mammalian RCK and fungal IME2 kinases. After the (+1PR) hybrid ancestor, evolution followed various trajectories. In mammalian RCK kinases, the specificity reverted to the ancient modest proline specificity, while in <italic>N. gruberi</italic> LF4 and <italic>N. crassa</italic> Ime2 the equal preference for proline and arginine was maintained. In <italic>S. cerevisiae, Y. liplolytica,</italic> and <italic>C. glabrata</italic>, the hybrid intermediate kinase evolved towards arginine specificity (+1R). Our molecular manipulation experiments showed that substitutions at a single residue in the active site of the kinase sequence, at the DFGx site, had a significant effect in determining +1P vs +1R specificity. Finally, by studying kinase function in vivo, we showed that cells tolerate the hybrid +1PR specificity. Taken together with the measured specificities of reconstructed CMGC kinases, these results suggest that the evolutionary trajectory from +1P to +1R passed through a historic ancestor with hybrid +1PR specificity. However, due to the modest degree of +1 specificity in AncCMGI (5.6-fold by PSPL and 2-fold in the context of an otherwise perfect consensus peptide) together with slight fluctuations in degree of specificity in alternate reconstructions, an alternative possibility is that AncCMGI was a broad specificity kinase that sub-functionalized to modern proline or arginine specific kinases.</p><p>Our results are consistent with previous phylogenetic studies suggesting that IME2 kinases are closer to the ancestral state and that CDKs are more derived (<xref ref-type="bibr" rid="bib45">Krylov et al., 2003</xref>). In fact, we observed that AncCMGI, the maximum likelihood common ancestor of <bold>C</bold>DK, CDKL, <bold>M</bold>APK, <bold>G</bold>SK, and <bold>I</bold>ME2/RCK/LF4, has essentially the same specificity as the mouse MOK kinase and does not require a cyclin or any C-terminal extension for its activation. Our phylogenetic reconstructions indicate, therefore, that cyclin dependence is a derived characteristic. Although our study focused on the evolutionary trajectory leading from AncCMGI to the IME2/RCK/LF4 superfamily, we in fact reconstructed ancestral sequences in all lineages of the CMGC tree, including the ancestors leading to mammalian CDKs and MAPKs. This library of ancestral kinases is a rich resource that can be used to elucidate the evolutionary paths by which MAPKs and CDKs acquired their unique forms of allosteric regulation.</p><p>Selectivity for proline at the +1 position is a unique characteristic of the CMGC group, as kinases from other groups appear to strongly disfavor proline (<xref ref-type="bibr" rid="bib80">Zhu et al., 2005</xref>). This proline selectivity has been attributed to an arginine residue found at the C-terminus of the activation loop (at the xAPE position) that is unique to the CMCG group. In structures of CMGC kinases, this arginine residue interacts with the backbone carbonyl group of another residue in the activation loop, termed the ‘toggle residue’, to orient it away from the bound substrate. In contrast, in other kinase groups the toggle residue carbonyl is oriented toward the substrate, forming a hydrogen bond with the amide proton of the +1 residue. Thus, the presence of arginine at the xAPE position is thought to explain the +1 preference of CMGC kinases for Pro, a secondary amino acid lacking an amide proton. However, because all CMGC kinases have arginine at this position, and not all CMGC kinases prefer proline at the +1 position, other residues must be responsible for our observed differences in substrate specificity among members of this group. We have identified the DFGx residue as one such determinant. Our ancestral reconstructions predict several additional amino acids as determinants of +1 specificity, and future work will illuminate their effects.</p><p>Computational analyses have suggested that kinases display little inter-positional dependence in their primary peptide specificity and that each substrate peptide amino acid interacts with the kinase active site more or less independently (<xref ref-type="bibr" rid="bib43">Joughin et al., 2012</xref>). Our observation that +1 specificity is dependent on the identity of the phosphoacceptor represents an exception to this model. This inter-positional dependence is ancestral and has been maintained in the entire IME2/RCK/LF4 family of kinases. The common role for the DFGx residue in influencing specificity for both the +1 and phosphoacceptor residues provides a structural basis for inter-dependence between the two positions. It is likely that similar inter-positional dependencies exist in other kinase families in cases where the two residues share an overlapping binding site. For example, crystal structures of tyrosine kinase–peptide complexes have revealed a single binding pocket accommodating both the +1 and +3 residues (<xref ref-type="bibr" rid="bib11">Bose et al., 2006</xref>). Such information should be considered when designing future phosphorylation site prediction algorithms.</p><p>Previous studies have shown that specificity-shifting mutations tend to lead to large losses in enzyme activity. As such, the evolution of new specificity often proceeds by first acquiring permissive amino acid substitutions that stabilize the protein conformation and then next acquiring substitutions that shift specificity. For example, in the case of the glucocorticoid receptors, it was inferred that neutral mutations that stabilized a new conformation must have been acquired before the specificity-shifting mutation could arise in the receptor's active site (<xref ref-type="bibr" rid="bib61">Ortlund et al., 2007</xref>). In another example, it was shown that permissive mutations were required in influenza neuraminidase prior to acquisition of drug resistance mutations that subtly altered binding specificities (<xref ref-type="bibr" rid="bib10">Bloom et al., 2010</xref>). In directed evolution experiments, additional compensating mutations were required to restore wild-type levels of activity to proteases mutated in their specificity pockets (<xref ref-type="bibr" rid="bib75">Varadarajan et al., 2008</xref>). This requirement for multiple epistatic mutations is likely to slow the evolution of specificity in these systems, while also significantly reducing the chance of convergent specificity evolution. The evolution of CMGC kinase specificity at the +1 site, however, is an outlier to this paradigm. CMGC kinases seem to be relatively tolerant to modulation of +1 specificity by mutation of the DFGx residue: this mutation did not lead to a significant loss of activity in any of the six kinases we tested. This is an unusual case where a single amino acid mutation can drive divergence of specificity without the need for additional stabilizing mutations. Perhaps this tolerance explains the repeated convergent evolution of the DFGx residue.</p><p>There has been considerable evolutionary diversification to the primary specificities of the CMGC kinases such that paralogs diverged by more than two billion years have almost no overlap in their preferences. In order to make progress studying a tractable period of evolutionary change, we focused our analysis on the sub-family rooted at the common ancestor of Cdk1 and Ime2 (i.e., AncCMGI). Following AncCMGI, a gene duplication occurred, and the specificities of the two descendant paralogs almost completely diverged. Cdk1 recognizes a [S/T*]-P-x-[K/R] motif, while its paralog Ime2 recognizes R-P-x-[S/T*]-R (<xref ref-type="fig" rid="fig2">Figure 2A</xref>; <xref ref-type="bibr" rid="bib36">Holt et al., 2007</xref>). We studied the mutational trajectory of the paralog leading to Ime2, and a similar analysis awaits for the paralog leading to Cdk1. The mechanism by which both paralogs were retained is unclear, although previous work has revealed how other gene paralogs were retained according to various evolutionary behaviors, including sub-functionalization (<xref ref-type="bibr" rid="bib25">Force et al., 1999</xref>; <xref ref-type="bibr" rid="bib23">Finnigan et al., 2012</xref>), neo-functionalization (<xref ref-type="bibr" rid="bib12">Bridgham, 2006</xref>; <xref ref-type="bibr" rid="bib19">Conant and Wolfe, 2008</xref>), and avoidance of paralog interference (<xref ref-type="bibr" rid="bib5">Baker et al., 2013</xref>). Future work studying the Cdk1 paralog will reveal to what extent these models fit the evolution of CMGC kinases.</p><p>The evolution of phosphoregulatory networks is analogous to the evolution of gene transcription regulatory networks (<xref ref-type="bibr" rid="bib58">Moses and Landry, 2010</xref>). In both cases, changing the specificity of the regulator gene—either a kinase or a transcription factor—can potentially lead to the loss of essential network connections and also potentially create a large number of new connections (<xref ref-type="fig" rid="fig6">Figure 6</xref>). The gain or loss of connections might be a barrier to evolution if the changes were to interfere with existing biology. Nevertheless, it is clear that both transcription regulators and kinases have undergone extensive diversification in their specificities and network connections. Although much has been written about the evolution of transcription regulators, there has been comparatively little work on the evolutionary patterns and processes of kinases and phosophoregulatory networks. In light of the fact that the DFGx mutation shifts specificity from +1R to an expanded +1RP without a loss of activity, this allele provides a model for us to investigate whether dominant effects might constrain phosphoregulatory network evolution. The specificity modulation in the DFGx mutant could in theory generate hundreds of new phosphorylation events and modulate the rates of many more. However, our initial genetic analysis of the DFGx mutant suggests that expansion of Ime2 specificity is not highly toxic to cells. As long as a wild-type copy of IME2 is present, the second copy can expand its activity without any noticeable reduction of meiotic efficiency.<fig id="fig6" position="float"><object-id pub-id-type="doi">10.7554/eLife.04126.022</object-id><label>Figure 6.</label><caption><title>Simplified schematic of one possible path of kinase network evolution.</title><p>Hexagons represent kinases, circles represent substrates, lines connecting kinase to substrate indicate potential phosphoregulation.</p><p><bold>DOI:</bold> <ext-link ext-link-type="doi" xlink:href="10.7554/eLife.04126.022">http://dx.doi.org/10.7554/eLife.04126.022</ext-link></p></caption><graphic xlink:href="elife04126f006"/></fig></p><p>There are several possible explanations for the tolerance of the meiotic network to the DFGx heterozygous state. First, it could be that the modulation of primary specificity does not lead to a large number of phosphor transfer events to novel peptides. This would be the case if secondary specificity determinants such as docking interactions, scaffolding, or localization were dominant over primary specificity. For example, if the kinase was constrained to a subset of proteins by strong docking interactions, the potential to phosphorylate other proteins might be limited. Second, it could be that new phosphorylation events are occurring but that they have mild or insignificant biological consequence (<xref ref-type="bibr" rid="bib46">Landry et al., 2009</xref>). It remains an open question what proportion of the tens of thousands of phosphates detectable in the cell serve to modulate substrate function: perhaps it is a minority. Thirdly, it could be that phosphatases in the cell dominate over the novel phosphorylation events from the DFGx mutant kinase. In this case, phosphates would be transferred to new substrate peptides, but phosphatases would quickly remove them, such that steady-state substrate phosphorylation levels are low. Genetic experiments with phosphatase mutants could be informative here, and these experiments are ongoing.</p><p>In conclusion, we combined ancestral reconstruction and functional biochemistry to elucidate part of the mechanism by which the ancestor of the CDK and IME2 kinase families switched from a slight +1P to +1R specificity. This specificity change is likely to have evolved via an expanded specificity intermediate, suggesting that phosphoregulatory networks are robust enough to tolerate the introduction of new specificities. We have obtained initial evidence that the extant network that controls meiosis in <italic>S. cerevisiae</italic> can tolerate the introduction of the expanded specificity <italic>ime2-DFGx</italic> allele without a large loss of fitness. Evolution of new kinases is likely to begin with the chance generation of new specificities that give some advantage. However, to ensure an evolutionary future for new kinases, novel substrate connections need to arise and the balance with competing phosphatases must be tuned to generate advantageous networks that support the ever-elaborating diversity of life.</p><sec id="s3-1"><title>Note added in proof</title><p>During the process of submission the authors have learned that the DFGx specificity mutation described in this study is also observed in multiple cancer kinases including Aurora A, PKCgamma, Haspin, DDR1, ITK, TRKA, IRAK3 and BRAF (Creixell et al. unpublished), suggesting that amino acid changes that occur during evolution are resampled in cancer. It will be interesting to see if these mutations cause specificity changes in the context of oncogenesis.</p></sec></sec><sec id="s4" sec-type="materials|methods"><title>Materials and methods</title><sec id="s4-1"><title>Plasmids</title><p>Kinases were either cloned by PCR from respective organisms or from gifts: <italic>N. crassa</italic> Ime2 was a gift from Louise Glass, ICK and MOK cDNA were gifts from Tom Sturgill and Zheng Fu (<xref ref-type="bibr" rid="bib29">Fu et al., 2006</xref>), MAK cDNA was a gift from Alex Bullock, <italic>N. gruberi</italic> genomic DNA was a gift from Lillian Fritz-Laylin (<xref ref-type="bibr" rid="bib27">Fritz-Laylin et al., 2010</xref>). Ancestral kinases were synthesized either from gBlock gene fragments (IDT) or by Genscript. All plasmids were assembled by Gibson isothermal assembly (<xref ref-type="bibr" rid="bib30">Gibson et al., 2009</xref>), cloned in <italic>E. coli</italic> XL1-blue strains, and prepared by miniprep (Qiagen, Redwood City, CA). The list of plasmids used in this study is presented in <xref ref-type="supplementary-material" rid="SD1-data">Supplementary file 1</xref>.</p></sec><sec id="s4-2"><title>Clarification of mammalian RCK family gene names</title><p>The RCK family of kinase is identified by various synonyms in the literature. Therefore, to avoid confusion, the mammalian RCK kinases used in this study are:<list list-type="order"><list-item><p><bold>ICK</bold> <italic>Mus musculus</italic> (MGI:1934157, Mouse synonym: 2210420N10Rik; Human orthologue ICK, HGNC:21219, Human synonyms: <bold>I</bold>ntestinal <bold>C</bold>ell <bold>K</bold>inase, KIAA0936, LCK2, MGC46090, MRK);</p></list-item><list-item><p><bold>MOK</bold> <italic>Mus musculus</italic> (MGI:1336881, Mouse synonyms: <bold>M</bold>APK/MAK/MRK <bold>O</bold>verlapping <bold>K</bold>inase, MOK, Rage, Stk30; Human orthologue MOK, HGNC:9833, Human synonyms: Renal Tumor Antigen, RAGE1, STK30);</p></list-item><list-item><p><bold>MAK</bold> <italic>Homo sapiens</italic> (HGNC:6816, <bold>M</bold>ale germ cell-<bold>A</bold>ssociated <bold>K</bold>inase, Human synonyms: dJ417M14.2, RP62).</p></list-item></list></p></sec><sec id="s4-3"><title>Yeast strains</title><p>Yeast strains were generated by standard transformation and crossing protocols. Protein purification was performed from W303 strains. We initially performed meiotic experiments in SK1 strains derived from the Herskowitz collection (<xref ref-type="bibr" rid="bib9">Benjamin, 2003</xref>), but later switched to SK1 strains from Angelika Amon (a gift from Elçin Ünal). Both SK1 strains gave similar results but the Amon background was more consistent. All yeast strains were generated by standard LiAc transformation (<xref ref-type="bibr" rid="bib4">Amberg et al., 2005</xref>). SK1 and W303 strains were heat shocked at 42°C for 15 and 40 min respectively. Point mutations of <italic>IME2</italic> in the SK1 background were generated by 2-step loop-in, loop-out gene replacement technique using selection and counter-selection of the <italic>URA3</italic> marker at the <italic>IME2</italic> genomic locus. The list of yeast strains used in this study is presented in <xref ref-type="supplementary-material" rid="SD2-data">Supplementary file 2</xref>.</p></sec><sec id="s4-4"><title>Synchronous sporulation timecourses</title><p>Liquid sporulation was conducted at 30°C as follows: strains were thawed on YP + 3% glycerol plates overnight, then patched on YPD plates, and grown overnight. 2 ml YPD liquid cultures were inoculated from patches and grown to saturation by shaking at 30°C, 250 rpm for 21–23 hr (OD<sub>600</sub> ≈ 7.0). Cultures were diluted in 50 mL of YP + 1% KOAc to OD<sub>600</sub> = 0.25 and grown overnight by shaking at 30°C, 250 rpm for 15–16 hr. Cells were pelleted and washed once with sterile water, then resuspended in 1% KOAc sporulation media to OD<sub>600</sub> ≈ 2.5. Sporulation cultures were shaken at 30°C, 250 rpm and samples were collected every hour for DNA staining and flow cytometry.</p></sec><sec id="s4-5"><title>DNA staining for flow cytometry and imaging</title><p>Cells were fixed by mixing 0.5 ml of sporulation culture with 1 ml of EtOH. Fixed cells were pelleted and resuspended in water, then sonicated on a Branson Sonifier Model 450 at 10% amplitude for 3 s to break up cell clumps. Cells were pelleted, then resuspended in 100 µl of 40 µg/ml RNase A + 0.05% NP-40 + 50 mM Tris pH 7.4, and incubated at 37°C for 1 hr. Finally, 50 µl of 40 µg/ml proteinase K + 1 µM SYTOX Green + 50 mM Tris pH 7.4 was added to each sample and incubated at 55°C for 1 hr prior to analysis by cytometry.</p></sec><sec id="s4-6"><title>Measurement of meiotic divisions and sporulation efficiency</title><p>Progression of meiotic divisions was measured by epifluorescence of SYTOX Green-stained cells. 100–200 cells were counted per sample. To measure sporulation efficiency, we counted the proportion of tetrad, dyad, and monad or unsporulated cells from synchronous sporulation cultures after 24–48 hr.</p></sec><sec id="s4-7"><title>Protein purification</title><p>W303 <italic>S. cerevisiae</italic> strains containing a 2 μm P<sub>GAL1</sub>-kinase-TAP plasmid (pRSAB1234 backbone, originally a gift from Erin O'Shea) were grown overnight to log phase in SC-URA media containing 2% raffinose (Sigma), and then expression of N-terminal kinase domains was induced by addition of 2% galactose (Sigma) for 4 hr at 30°C. Cells were harvested by centrifugation at 8000 rpm, cell pellet washed and resuspended in 1× cell volume of lysis buffer containing 25 mM HEPES pH 8.0, 300 mM NaCl, 0.1% NP-40, 30 mM EGTA, 1 mM EDTA, and a protease/phosphatase inhibitor set was added immediately prior to harvest including 80 mM β-glycerophosphate, 50 mM NaF, 1 mM DTT, 1 mM Na<sub>3</sub>O<sub>4</sub>V, and 1 mM PMSF. The cell slurry was slowly dripped into liquid nitrogen to produce frozen pellets. These pellets were then pulverized in a cryogenic ball mill (Retsch MM301 with 50 ml stainless steel grinding jars) by five rounds of agitation at 15 Hz for 2 min, re-cooling the grinding jars in liquid N<sub>2</sub> after each cycle. The grindate was then thawed and cell debris was cleared by centrifugation at 8000 rpm for 30 min followed by sequential filtration through 2.7 and 1.6 μm Whatman GD/X filters (GE). C-terminally TAP-tagged kinases were immobilized on IgG Sepharose 6 Fast Flow beads (GE). These beads (∼500 μl slurry per 1 l culture) had been pre-equilibrated in lysis buffer with inhibitors and were then incubated with lysate for 1 hr at 4°C. Bound beads were then loaded into a disposable Bio-Spin column (cat. #732-6008; BioRad) by pipette and washed with 20 ml total wash buffer (lysis buffer + 10% glycerol, 1 mM DTT) at 4°C. The column was then rotated for 20 min at 23°C in 700 μl wash buffer as a final wash to mimic elution conditions. The bound protein was then cleaved from the IgG beads by TEV protease in 600 μl elution buffer (0.21 mg/ml TEV protease [QB3 MacroLab, UC Berkeley], 25 mM HEPES pH 8.0, 310 mM NaCl, 0.09% NP-40, 26.9 mM EGTA, 0.9 mM EDTA, 1 mM DTT, and 10% glycerol) for 1 hr at 23°C.</p><p>For bacterial purification, proteins were expressed with an N-terminal 6× HIS tag in Rosetta2 DE3 pLysS competent cells (QB3 MacroLab, UC Berkeley) by induction with 100 µM IPTG for 18 hr at 16°C. The cell pellet was resuspended in an equal volume of bacterial wash buffer (50 mM Tris-Cl pH 8.0, 300 mM NaCl, 10% glycerol, and 0.1% Triton X-100), freeze-thawed once with liquid nitrogen, then lysed by 3× french press (American Instrument Company) cycles in the presence of 0.05 mg/ml DNAse I, 2 mM MgCl<sub>2</sub>, 5 mM β-mercaptoethanol, and 1 mM PMSF. Lysate was cleared by centrifugation and filtration as above. NiNTA-Sepharose beads (GE) (∼660 μl slurry per 1 l culture) were equilibrated in bacterial wash buffer containing 20 mM imidazole and combined at 1:1 ratio with lysate before adding a further 2.5 mM βME, 1 mM Na<sub>3</sub>O<sub>4</sub>V, 80 mM β-glycerophosphate, 50 mM NaF, and 0.5 mM PMSF. This lysate was incubated with beads for batch binding overnight at 4°C. Bound beads were loaded onto a disposable column (as above) and rinsed with the remaining unbound fraction. The column was washed 3 × with 10 ml bacterial wash buffer containing 10 mM imidazole and 2 mM βME, 3 × with 10 ml bacterial wash buffer containing 20 mM imidazole and 2 mM βME, then one final time with 20 ml bacterial wash buffer containing 20 mM imidazole, 2 mM βME, and 2 mM ATP. The column was rotated in previous ATP solution for 15 min at 4°C in an attempt to remove chaperones, before rinsing once more with 5 ml bacterial wash buffer containing 20 mM imidazole and 2 mM βME to remove ATP. The protein was released by incubation with 300 μl of the above buffer containing 250 mM imidazole twice, followed by incubation with 300 μl of the above buffer containing 500 mM imidazole twice. All purification samples were analyzed by SDS-PAGE with Coomassie Brilliant Blue R-250 stain.</p></sec><sec id="s4-8"><title>Positional scanning peptide library assays</title><p>The library consisted of 182 peptide mixtures having the general sequence Y-A-x-x-x-x-x-S/T-x-x-x-x-A-G-K-K(biotin), where X represents an equimolar mixture of the 17 proteogenic amino acid residues (excluding Ser, Thr, and Cys), S/T indicates an even mixture of Ser and Thr, and biotin is conjugated through an aminohexanoic acid spacer to the C-terminal Lys residue. In each mixture, a single residue at one of the 9 ‘x’ positions was fixed as one of the 20 amino acids. In addition, we included two peptide mixtures in which all ‘x’ positions were degenerated, but the phosphoacceptor residue was fixed as either Ser or Thr. Peptides were arrayed in 1536 well plates to a final concentration of 50 μM in 2 μl reaction buffer (50 mM HEPES, pH 7.4, 150 mM NaCl, 10 mM MgCl<sub>2</sub>, 0.1% Tween 20) per well. Reactions were initiated by adding kinase mixed with ATP (final concentration 50 μM with 0.03 μCi/μl [γ-<sup>33</sup>P]ATP). Plates were incubated at 30°C for 2 hr, and then 200 nl aliquots were transferred to streptavidin-coated membrane (Promega), which was quenched by immersion in 0.1% SDS, 10 mM Tris–HCl, pH 7.5, 140 mM NaCl. Membranes were then washed twice with the same solution, twice with 2 M NaCl, and twice with 1% H<sub>3</sub>PO<sub>4</sub>, 2 M NaCl. After briefly rinsing with ddH<sub>2</sub>O, membranes were air-dried and exposed to a phosphor imager screen. Following scanning on a phosphor imager (BioRad), radiolabel incorporation was quantified using QuantityOne software. Data were normalized so that the average signal for a given peptide position was 1. For visualization normalized data from two separate runs were averaged, log transformed, and used to generate heat maps in Microsoft Excel using the color scheme shown in the figures.</p></sec><sec id="s4-9"><title>Kinase Peptide assays</title><p>Ratiometric specificities were profiled in buffer containing 77.5 mM HEPES pH 7.5, 77.5 mM NaCl, 15.5 mM MgCl<sub>2</sub>, 250 μM ATP, 0.45 mg/ml BSA, 4.5% glycerol, and 0.2 μCi [γ-<sup>32</sup>P]ATP. Minimal kinase concentrations sufficient for signal were determined empirically and ranged from 5 to 50 nM. Peptides obtained from Tufts University Core Facility (<ext-link ext-link-type="uri" xlink:href="http://www.tucf.com/">http://www.tucf.com</ext-link>) were added to a final concentration of 45 μM to start the reaction. Comparative peptide assays were always performed in parallel. Reaction assays were aliquoted onto Whatman P81 phosphocellulose (GE) strips, which were then quenched and washed 5 × in 75 mM phosphoric acid to remove free [γ-<sup>32</sup>P] ATP. Samples were dried on a slab gel dryer (Model 1125B; BIORAD) and exposed to a phosphor screen (Molecular Dynamics) to determine the rate of [γ-<sup>32</sup>P] ATP incorporation. Phosphor screens were analyzed with a Typhoon 9400 scanner (Amersham) using ImageQuant software (GE). Final Image quantification was performed using ImageJ (<ext-link ext-link-type="uri" xlink:href="http://imagej.nih.gov/ij/">http://imagej.nih.gov/ij/</ext-link>). Michaelis–Menten curves were generated in a similar manner, except the buffer contained 50 mM HEPES pH 7.5, 50 mM NaCl, 10 mM MgCl<sub>2</sub>, 500 µM ATP, 83.3 μg/ml BSA, 0.833% glycerol, and 0.2 μCi [γ-<sup>32</sup>P]ATP. In this case, substrate concentration was varied for each kinase peptide combination. Data were fit by nonlinear regression to the Michaelis–Menten model V<sub>0</sub> = V<sub>max</sub>*[S]/K<sub>M</sub>+[S] using Prism (GraphPad software) and Matlab (MathWorks) and this fit was used to determine values for V<sub>max</sub> and K<sub>M</sub>.</p></sec><sec id="s4-10"><title>Phylogenetic reconstruction of ancestral CMGC kinases</title><p>Orthologs of the CMGC gene family were identified by a BLAST search based on the amino acid sequence of <italic>S. cerevisiae</italic> IME2 and <italic>H. sapiens</italic> CDK1, using the NCBI BLAST tool (<xref ref-type="bibr" rid="bib3">Altschul et al., 1990</xref>). To eliminate false positives, hit sequences were reverse BLAST queried, and we eliminated any hits that did not have either IME2 or CDK1 as a result with at least 50% sequence identity. Using the remaining 329 amino acid sequences, a multiple sequence alignment was inferred using MSAProbs with default settings (<xref ref-type="bibr" rid="bib50">Liu et al., 2010</xref>). This alignment was best-fit by the LG model (<xref ref-type="bibr" rid="bib47">Le and Gascuel, 2008</xref>) with a gamma-distributed set of evolutionary rates (<xref ref-type="bibr" rid="bib77">Yang and Kumar, 1996</xref>), according to the Akaike Information Criterion as implemented in PROTTEST (<xref ref-type="bibr" rid="bib1">Abascal et al., 2005</xref>).</p><p>Using LG+G, we used a maximum likelihood (ML) algorithm (<xref ref-type="bibr" rid="bib78">Yang et al., 1995</xref>) to infer the ancestral amino sequences with the highest probability of producing all the extant sequence data. Specifically, we used RAxML version 7.2.8 to infer the ML topology, branch lengths, and evolutionary rates (<xref ref-type="bibr" rid="bib69">Stamatakis, 2006</xref>). We exported this ML phylogeny to another software package, PhyML (<xref ref-type="bibr" rid="bib33">Guindon et al., 2010</xref>), in order to calculate statistical support for branches as approximate likelihood ratios. We next reconstructed ML ancestral states at each site for all ancestral nodes using the software package Lazarus (<xref ref-type="bibr" rid="bib34">Hanson-Smith et al., 2010</xref>). We used sequences from the CK family as the outgroup to root the tree. We placed ancestral insertion/deletion characters according to Fitch's parsimony (<xref ref-type="bibr" rid="bib24">Fitch, 1971</xref>), in which each indel character was treated independently.</p><p>We extracted the ancestral sequences from the phylogenetic nodes corresponding to AncCMGI, AncLF4, AncNgru, AncICK, AncCneo, AncNcra, and AncYlip. We characterized the support for these ancestors by binning their posterior probabilities of states into 5%-sized bins and counting the proportion of ancestral sites within each bin (<xref ref-type="fig" rid="fig2s1">Figure 2—figure supplement 1</xref>). We also generated alternate versions of these ancestral sequences by randomly sampling from their posterior distributions to generate between 2 and 3 alternate ancestors for every node, as described in <xref ref-type="bibr" rid="bib76">Williams et al. (2006)</xref>.</p></sec><sec id="s4-11"><title>Robustness to uncertainties in sequence reconstruction</title><p>Ancestral sequence reconstruction is a probabilistic method and involves uncertainties in the amino acid identities. Even for relatively well-conserved protein families like kinases, these uncertainties become more pronounced when attempting to reconstruct deep ancestors. A summary of the statistical supports for each of the resurrected kinases is presented in <xref ref-type="fig" rid="fig2s1">Figure 2—figure supplement 1</xref>. For each of the seven resurrected kinases between 60 and 90 alternative amino acids were sampled by a Bayesian method to address the impact of uncertainties on our results. In all cases, both the overall primary specificities and the +1 specificities as determined by individual peptides were robust to uncertainties in ancestral sequence reconstruction. That is, kinases that we resurrected with amino acid substitutions that explored different possible amino acids had very similar activity levels and specificities as the maximum likelihood ancestors presented in the main figures. The degree of preference for proline varies slightly in the deepest alternative reconstruction, but the general trend of evolution from +1P preference, to expanded specificity, and finally to +1R preference, is maintained. The data from alternative reconstructions is presented in <xref ref-type="fig" rid="fig2s2">Figure 2—figure supplement 2</xref> and <xref ref-type="fig" rid="fig3s2">Figure 3—figure supplement 2</xref>. Together, these results give us confidence in the validity of our approach.</p></sec></sec></body><back><ack id="ack"><title>Acknowledgements</title><p>The authors thank Elçin Ünal for yeast strains and advice on meiosis protocols; Lillian Fritz-Laylin, Louise Glass, Alexander Bullock, Zheng Fu and Tom Sturgill for DNA; Rune Linding and Pau Creixell for sharing results prior to publication; Doug Koshland, members of the Koshland lab for discussion of the project; and Woj Wojtowicz, Carlos Pantoja, Vincent Guacci, William Ludington, and David Morgan for critical reading of the manuscript. Work in the Johnson laboratory was supported by NIH grant GM037049. Work in the Turk laboratory was supported by NIH grants R01 GM105947 and R01 GM104047, and CJM was supported by training grant T32 GM007324. Work in the Holt laboratory was supported by the William Bowes Research Fellowship.</p></ack><sec sec-type="additional-information"><title>Additional information</title><fn-group content-type="competing-interest"><title>Competing interests</title><fn fn-type="conflict" id="conf1"><p>The authors declare that no competing interests exist.</p></fn></fn-group><fn-group content-type="author-contribution"><title>Author contributions</title><fn fn-type="con" id="con1"><p>CJH, Conception and design, Acquisition of data, Analysis and interpretation of data, Drafting or revising the article</p></fn><fn fn-type="con" id="con2"><p>VH-S, Conception and design, Acquisition of data, Analysis and interpretation of data, Drafting or revising the article</p></fn><fn fn-type="con" id="con3"><p>KJK, Conception and design, Acquisition of data, Analysis and interpretation of data, Drafting or revising the article</p></fn><fn fn-type="con" id="con4"><p>BET, Conception and design, Acquisition of data, Analysis and interpretation of data, Drafting or revising the article</p></fn><fn fn-type="con" id="con5"><p>LJH, Conception and design, Acquisition of data, Analysis and interpretation of data, Drafting or revising the article</p></fn><fn fn-type="con" id="con6"><p>CJM, Acquisition of data, Analysis and interpretation of data, Drafting or revising the article</p></fn><fn fn-type="con" id="con7"><p>HJL, Acquisition of data, Analysis and interpretation of data</p></fn><fn fn-type="con" id="con8"><p>ADJ, Analysis and interpretation of data, Drafting or revising the article</p></fn></fn-group></sec><sec sec-type="supplementary-material"><title>Additional files</title><supplementary-material id="SD1-data"><object-id pub-id-type="doi">10.7554/eLife.04126.023</object-id><label>Supplementary file 1.</label><caption><p>List of plasmids generated in this study.</p><p><bold>DOI:</bold> <ext-link ext-link-type="doi" xlink:href="10.7554/eLife.04126.023">http://dx.doi.org/10.7554/eLife.04126.023</ext-link></p></caption><media mime-subtype="xlsx" mimetype="application" xlink:href="elife04126s001.xlsx"/></supplementary-material><supplementary-material id="SD2-data"><object-id pub-id-type="doi">10.7554/eLife.04126.024</object-id><label>Supplementary file 2.</label><caption><p>List of yeast strains generated in this study.</p><p><bold>DOI:</bold> <ext-link ext-link-type="doi" xlink:href="10.7554/eLife.04126.024">http://dx.doi.org/10.7554/eLife.04126.024</ext-link></p></caption><media mime-subtype="xlsx" mimetype="application" xlink:href="elife04126s002.xlsx"/></supplementary-material></sec><ref-list><title>References</title><ref id="bib1"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Abascal</surname><given-names>F</given-names></name><name><surname>Zardoya</surname><given-names>R</given-names></name><name><surname>Posada</surname><given-names>D</given-names></name></person-group><year>2005</year><article-title>ProtTest: selection of best-fit models of protein evolution</article-title><volume>21</volume><fpage>2104</fpage><lpage>2105</lpage><pub-id pub-id-type="doi">10.1093/bioinformatics/bti263</pub-id></element-citation></ref><ref id="bib2"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Alexander</surname><given-names>J</given-names></name><name><surname>Lim</surname><given-names>D</given-names></name><name><surname>Joughin</surname><given-names>BA</given-names></name><name><surname>Hegemann</surname><given-names>B</given-names></name><name><surname>Hutchins</surname><given-names>JR</given-names></name><name><surname>Ehrenberger</surname><given-names>T</given-names></name><name><surname>Ivins</surname><given-names>F</given-names></name><name><surname>Sessa</surname><given-names>F</given-names></name><name><surname>Hudecz</surname><given-names>O</given-names></name><name><surname>Nigg</surname><given-names>EA</given-names></name><name><surname>Fry</surname><given-names>AM</given-names></name><name><surname>Musacchio</surname><given-names>A</given-names></name><name><surname>Stukenberg</surname><given-names>PT</given-names></name><name><surname>Mechtler</surname><given-names>K</given-names></name><name><surname>Peters</surname><given-names>JM</given-names></name><name><surname>Smerdon</surname><given-names>SJ</given-names></name><name><surname>Yaffe</surname><given-names>MB</given-names></name></person-group><year>2011</year><article-title>Spatial exclusivity combined with positive and negative selection of phosphorylation motifs is the basis for context-dependent mitotic signaling</article-title><source>Science Signaling</source><volume>4</volume><fpage>ra42</fpage><pub-id pub-id-type="doi">10.1126/scisignal.2001796</pub-id></element-citation></ref><ref id="bib3"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Altschul</surname><given-names>SF</given-names></name><name><surname>Gish</surname><given-names>W</given-names></name><name><surname>Miller</surname><given-names>W</given-names></name><name><surname>Myers</surname><given-names>EW</given-names></name><name><surname>Lipman</surname><given-names>DJ</given-names></name></person-group><year>1990</year><article-title>Basic local alignment search tool</article-title><source>Journal of Molecular Biology</source><volume>215</volume><fpage>403</fpage><lpage>410</lpage><pub-id pub-id-type="doi">10.1016/S0022-2836(05)80360-2</pub-id></element-citation></ref><ref id="bib4"><element-citation publication-type="book"><person-group person-group-type="author"><name><surname>Amberg</surname><given-names>DC</given-names></name><name><surname>Burke</surname><given-names>D</given-names></name><name><surname>Strathern</surname><given-names>JN</given-names></name></person-group><year>2005</year><source>Methods in Yeast Genetics</source><publisher-name>CSHL Press</publisher-name></element-citation></ref><ref id="bib5"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Baker</surname><given-names>CR</given-names></name><name><surname>Hanson-Smith</surname><given-names>V</given-names></name><name><surname>Johnson</surname><given-names>AD</given-names></name></person-group><year>2013</year><article-title>Following gene duplication, paralog interference constrains transcriptional circuit evolution</article-title><source>Science</source><volume>342</volume><fpage>104</fpage><lpage>108</lpage><pub-id pub-id-type="doi">10.1126/science.1240810</pub-id></element-citation></ref><ref id="bib6"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Bayram</surname><given-names>O</given-names></name><name><surname>Sari</surname><given-names>F</given-names></name><name><surname>Braus</surname><given-names>GH</given-names></name><name><surname>Irniger</surname><given-names>S</given-names></name></person-group><year>2009</year><article-title>The protein kinase ImeB is required for light-mediated inhibition of sexual development and for mycotoxin production in <italic>Aspergillus Nidulans</italic></article-title><volume>71</volume><fpage>1278</fpage><lpage>1295</lpage><pub-id pub-id-type="doi">10.1111/j.1365-2958.2009.06606.x</pub-id></element-citation></ref><ref id="bib7"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Beltrao</surname><given-names>P</given-names></name><name><surname>Bork</surname><given-names>P</given-names></name><name><surname>Krogan</surname><given-names>NJ</given-names></name><name><surname>van Noort</surname><given-names>V</given-names></name></person-group><year>2013</year><article-title>Evolution and functional cross-talk of protein post-translational modifications</article-title><source>Molecular Systems Biology</source><volume>9</volume><fpage>714</fpage><pub-id pub-id-type="doi">10.1002/msb.201304521</pub-id></element-citation></ref><ref id="bib8"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Beltrao</surname><given-names>P</given-names></name><name><surname>Trinidad</surname><given-names>JC</given-names></name><name><surname>Fiedler</surname><given-names>D</given-names></name><name><surname>Roguev</surname><given-names>A</given-names></name><name><surname>Lim</surname><given-names>WA</given-names></name><name><surname>Shokat</surname><given-names>KM</given-names></name><name><surname>Burlingame</surname><given-names>AL</given-names></name><name><surname>Krogan</surname><given-names>NJ</given-names></name></person-group><year>2009</year><article-title>Evolution of phosphoregulation: comparison of phosphorylation patterns across yeast species</article-title><source>PLOS Biology</source><volume>7</volume><fpage>e1000134</fpage><pub-id pub-id-type="doi">10.1371/journal.pbio.1000134</pub-id></element-citation></ref><ref id="bib9"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Benjamin</surname><given-names>KR</given-names></name></person-group><year>2003</year><article-title>Control of landmark events in meiosis by the CDK Cdc28 and the meiosis-specific kinase Ime2</article-title><source>Genes &amp; Development</source><volume>17</volume><fpage>1524</fpage><lpage>1539</lpage><pub-id pub-id-type="doi">10.1101/gad.1101503</pub-id></element-citation></ref><ref id="bib10"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Bloom</surname><given-names>JD</given-names></name><name><surname>I Gong</surname><given-names>L</given-names></name><name><surname>Baltimore</surname><given-names>D</given-names></name></person-group><year>2010</year><article-title>Permissive Secondary Mutations Enable the Evolution of Influenza Oseltamivir Resistance</article-title><source>Science</source><volume>328</volume><fpage>1272</fpage><lpage>1275</lpage><pub-id pub-id-type="doi">10.1126/science.1187816</pub-id></element-citation></ref><ref id="bib11"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Bose</surname><given-names>R</given-names></name><name><surname>Holbert</surname><given-names>MA</given-names></name><name><surname>Pickin</surname><given-names>KA</given-names></name><name><surname>Cole</surname><given-names>PA</given-names></name></person-group><year>2006</year><article-title>Protein tyrosine kinase-substrate interactions</article-title><source>Current Opinion in Structural Biology</source><volume>16</volume><fpage>668</fpage><lpage>675</lpage><pub-id pub-id-type="doi">10.1016/j.sbi.2006.10.012</pub-id></element-citation></ref><ref id="bib12"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Bridgham</surname><given-names>JT</given-names></name></person-group><year>2006</year><article-title>Evolution of hormone-receptor complexity by molecular exploitation</article-title><source>Science</source><volume>312</volume><fpage>97</fpage><lpage>101</lpage><pub-id pub-id-type="doi">10.1126/science.1123348</pub-id></element-citation></ref><ref id="bib13"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Bullock</surname><given-names>AN</given-names></name><name><surname>Das</surname><given-names>S</given-names></name><name><surname>Debreczeni</surname><given-names>JE</given-names></name><name><surname>Rellos</surname><given-names>P</given-names></name><name><surname>Fedorov</surname><given-names>O</given-names></name><name><surname>Niesen</surname><given-names>FH</given-names></name><name><surname>Guo</surname><given-names>K</given-names></name><name><surname>Papagrigoriou</surname><given-names>E</given-names></name><name><surname>Amos</surname><given-names>AL</given-names></name><name><surname>Cho</surname><given-names>S</given-names></name><name><surname>Turk</surname><given-names>BE</given-names></name><name><surname>Ghosh</surname><given-names>G</given-names></name><name><surname>Knapp</surname><given-names>S</given-names></name></person-group><year>2009</year><article-title>Kinase domain insertions define distinct roles of CLK kinases in SR protein phosphorylation</article-title><source>Structure</source><volume>17</volume><fpage>352</fpage><lpage>362</lpage><pub-id pub-id-type="doi">10.1016/j.str.2008.12.023</pub-id></element-citation></ref><ref id="bib14"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Capra</surname><given-names>EJ</given-names></name><name><surname>Perchuk</surname><given-names>BS</given-names></name><name><surname>Skerker</surname><given-names>JM</given-names></name><name><surname>Laub</surname><given-names>MT</given-names></name></person-group><year>2012</year><article-title>Adaptive mutations that prevent crosstalk enable the expansion of paralogous signaling protein families</article-title><source>Cell</source><volume>150</volume><fpage>222</fpage><lpage>232</lpage><pub-id pub-id-type="doi">10.1016/j.cell.2012.05.033</pub-id></element-citation></ref><ref id="bib15"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Carroll</surname><given-names>SB</given-names></name></person-group><year>2008</year><article-title>Evo-devo and an expanding evolutionary synthesis: a genetic theory of morphological evolution</article-title><source>Cell</source><volume>134</volume><fpage>25</fpage><lpage>36</lpage><pub-id pub-id-type="doi">10.1016/j.cell.2008.06.030</pub-id></element-citation></ref><ref id="bib16"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Chen</surname><given-names>C</given-names></name><name><surname>Ha</surname><given-names>BH</given-names></name><name><surname>Thévenin</surname><given-names>AF</given-names></name><name><surname>Lou</surname><given-names>HJ</given-names></name><name><surname>Zhang</surname><given-names>R</given-names></name><name><surname>Yip</surname><given-names>KY</given-names></name><name><surname>Peterson</surname><given-names>JR</given-names></name><name><surname>Gerstein</surname><given-names>M</given-names></name><name><surname>Kim</surname><given-names>PM</given-names></name><name><surname>Filippakopoulos</surname><given-names>P</given-names></name><name><surname>Knapp</surname><given-names>S</given-names></name><name><surname>Boggon</surname><given-names>TJ</given-names></name><name><surname>Turk</surname><given-names>BE</given-names></name></person-group><year>2014</year><article-title>Identification of a major determinant for serine-threonine kinase phosphoacceptor specificity</article-title><source>Molecular Cell</source><volume>53</volume><fpage>140</fpage><lpage>147</lpage><pub-id pub-id-type="doi">10.1016/j.molcel.2013.11.013</pub-id></element-citation></ref><ref id="bib17"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Chen</surname><given-names>RE</given-names></name><name><surname>Thorner</surname><given-names>J</given-names></name></person-group><year>2007</year><article-title>Function and regulation in MAPK signaling pathways: lessons learned from the yeast <italic>Saccharomyces Cerevisiae</italic></article-title><source>Biochimica Et Biophysica Acta</source><volume>1773</volume><fpage>1311</fpage><lpage>1340</lpage><pub-id pub-id-type="doi">10.1016/j.bbamcr.2007.05.003</pub-id></element-citation></ref><ref id="bib18"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Cole</surname><given-names>A</given-names></name><name><surname>Frame</surname><given-names>S</given-names></name><name><surname>Cohen</surname><given-names>P</given-names></name></person-group><year>2004</year><article-title>Further evidence that the tyrosine phosphorylation of glycogen synthase kinase-3 (GSK3) in mammalian cells is an autophosphorylation event</article-title><source>The Biochemical Journal</source><volume>377</volume><fpage>249</fpage><lpage>255</lpage><pub-id pub-id-type="doi">10.1042/BJ20031259</pub-id></element-citation></ref><ref id="bib19"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Conant</surname><given-names>GC</given-names></name><name><surname>Wolfe</surname><given-names>KH</given-names></name></person-group><year>2008</year><article-title>Turning a hobby into a job: how duplicated genes find new functions</article-title><volume>9</volume><fpage>938</fpage><lpage>950</lpage><pub-id pub-id-type="doi">10.1038/nrg2482</pub-id></element-citation></ref><ref id="bib20"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Coyle</surname><given-names>SM</given-names></name><name><surname>Flores</surname><given-names>J</given-names></name><name><surname>Lim</surname><given-names>WA</given-names></name></person-group><year>2013</year><article-title>Exploitation of latent allostery enables the evolution of new modes of map kinase regulation</article-title><source>Cell</source><volume>154</volume><fpage>875</fpage><lpage>887</lpage><pub-id pub-id-type="doi">10.1016/j.cell.2013.07.019</pub-id></element-citation></ref><ref id="bib21"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Cross</surname><given-names>FR</given-names></name><name><surname>Buchler</surname><given-names>NE</given-names></name><name><surname>Skotheim</surname><given-names>JM</given-names></name></person-group><year>2011</year><article-title>Evolution of networks and sequences in Eukaryotic cell cycle control</article-title><source>Philosophical Transactions of the Royal Society B: Biological Sciences</source><volume>366</volume><fpage>3532</fpage><lpage>3544</lpage><pub-id pub-id-type="doi">10.1098/rstb.2011.0078</pub-id></element-citation></ref><ref id="bib22"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Dirick</surname><given-names>L</given-names></name><name><surname>Goetsch</surname><given-names>L</given-names></name><name><surname>Ammerer</surname><given-names>G</given-names></name><name><surname>Byers</surname><given-names>B</given-names></name></person-group><year>1998</year><article-title>Regulation of meiotic S phase by Ime2 and a Clb5,6-associated kinase in <italic>Saccharomyces Cerevisiae</italic></article-title><source>Science</source><volume>281</volume><fpage>1854</fpage><lpage>1857</lpage><pub-id pub-id-type="doi">10.1126/science.281.5384.1854</pub-id></element-citation></ref><ref id="bib23"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Finnigan</surname><given-names>GC</given-names></name><name><surname>Hanson-Smith</surname><given-names>V</given-names></name><name><surname>Stevens</surname><given-names>TH</given-names></name><name><surname>Thornton</surname><given-names>JW</given-names></name></person-group><year>2012</year><article-title>Evolution of increased complexity in a molecular machine</article-title><source>Nature</source><volume>481</volume><fpage>360</fpage><lpage>364</lpage><pub-id pub-id-type="doi">10.1038/nature10724</pub-id></element-citation></ref><ref id="bib24"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Fitch</surname><given-names>WM</given-names></name></person-group><year>1971</year><article-title>Toward defining the course of evolution: minimum change for a specific tree topology</article-title><source>Systematic Zoology</source><volume>20</volume><issue>4</issue><fpage>406</fpage><lpage>416</lpage><pub-id pub-id-type="doi">10.1093/sysbio/20.4.406</pub-id></element-citation></ref><ref id="bib25"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Force</surname><given-names>A</given-names></name><name><surname>Lynch</surname><given-names>M</given-names></name><name><surname>Pickett</surname><given-names>FB</given-names></name><name><surname>Amores</surname><given-names>A</given-names></name><name><surname>Yan</surname><given-names>YL</given-names></name><name><surname>Postlethwait</surname><given-names>J</given-names></name></person-group><year>1999</year><article-title>Preservation of duplicate genes by complementary, degenerative mutations</article-title><source>Genetics</source><volume>151</volume><fpage>1531</fpage><lpage>1545</lpage></element-citation></ref><ref id="bib26"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Freschi</surname><given-names>L</given-names></name><name><surname>Courcelles</surname><given-names>M</given-names></name><name><surname>Thibault</surname><given-names>P</given-names></name><name><surname>Michnick</surname><given-names>SW</given-names></name><name><surname>Landry</surname><given-names>CR</given-names></name></person-group><year>2011</year><article-title>Phosphorylation network rewiring by gene duplication</article-title><source>Molecular Systems Biology</source><volume>7</volume><fpage>504</fpage><pub-id pub-id-type="doi">10.1038/msb.2011.43</pub-id></element-citation></ref><ref id="bib27"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Fritz-Laylin</surname><given-names>LK</given-names></name><name><surname>Prochnik</surname><given-names>SE</given-names></name><name><surname>Ginger</surname><given-names>ML</given-names></name><name><surname>Dacks</surname><given-names>JB</given-names></name><name><surname>Carpenter</surname><given-names>ML</given-names></name><name><surname>Field</surname><given-names>MC</given-names></name><name><surname>Kuo</surname><given-names>A</given-names></name><name><surname>Paredez</surname><given-names>A</given-names></name><name><surname>Chapman</surname><given-names>J</given-names></name><name><surname>Pham</surname><given-names>J</given-names></name><name><surname>Shu</surname><given-names>S</given-names></name><name><surname>Neupane</surname><given-names>R</given-names></name><name><surname>Cipriano</surname><given-names>M</given-names></name><name><surname>Mancuso</surname><given-names>J</given-names></name><name><surname>Tu</surname><given-names>H</given-names></name><name><surname>Salamov</surname><given-names>A</given-names></name><name><surname>Lindquist</surname><given-names>E</given-names></name><name><surname>Shapiro</surname><given-names>H</given-names></name><name><surname>Lucas</surname><given-names>S</given-names></name><name><surname>Grigoriev</surname><given-names>IV</given-names></name><name><surname>Cande</surname><given-names>WZ</given-names></name><name><surname>Fulton</surname><given-names>C</given-names></name><name><surname>Rokhsar</surname><given-names>DS</given-names></name><name><surname>Dawson</surname><given-names>SC</given-names></name></person-group><year>2010</year><article-title>The genome of naegleria gruberi illuminates early eukaryotic versatility</article-title><source>Cell</source><volume>140</volume><fpage>631</fpage><lpage>642</lpage><pub-id pub-id-type="doi">10.1016/j.cell.2010.01.032</pub-id></element-citation></ref><ref id="bib28"><element-citation publication-type="book"><person-group person-group-type="author"><name><surname>Fu</surname><given-names>ZJ</given-names></name></person-group><year>2012</year><article-title>Regulations and functions of ICK/MAK/MOK - a novel MAPK-related kinase family linked to human diseases</article-title><person-group person-group-type="editor"><name><surname>Da Silva Xavier</surname><given-names>G</given-names></name></person-group><source>Protein kinases</source><pub-id pub-id-type="doi">10.5772/37964</pub-id></element-citation></ref><ref id="bib29"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Fu</surname><given-names>Z</given-names></name><name><surname>Larson</surname><given-names>KA</given-names></name><name><surname>Chitta</surname><given-names>RK</given-names></name><name><surname>Parker</surname><given-names>SA</given-names></name><name><surname>Turk</surname><given-names>BE</given-names></name><name><surname>Lawrence</surname><given-names>MW</given-names></name><name><surname>Kaldis</surname><given-names>P</given-names></name><name><surname>Galaktionov</surname><given-names>K</given-names></name><name><surname>Cohn</surname><given-names>SM</given-names></name><name><surname>Shabanowitz</surname><given-names>J</given-names></name><name><surname>Hunt</surname><given-names>DF</given-names></name><name><surname>Sturgill</surname><given-names>TW</given-names></name></person-group><year>2006</year><article-title>Identification of Yin-Yang regulators and a phosphorylation consensus for male germ cell-associated kinase (MAK)-related kinase</article-title><source>Molecular and Cellular Biology</source><volume>26</volume><fpage>8639</fpage><lpage>8654</lpage><pub-id pub-id-type="doi">10.1128/MCB.00816-06</pub-id></element-citation></ref><ref id="bib30"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Gibson</surname><given-names>DG</given-names></name><name><surname>Young</surname><given-names>L</given-names></name><name><surname>Chuang</surname><given-names>RY</given-names></name><name><surname>Venter</surname><given-names>JC</given-names></name><name><surname>Hutchison</surname><given-names>CA</given-names></name><name><surname>Smith</surname><given-names>HO</given-names></name></person-group><year>2009</year><article-title>Enzymatic assembly of DNA molecules up to several hundred Kilobases</article-title><source>Nature Methods</source><volume>6</volume><fpage>343</fpage><lpage>345</lpage><pub-id pub-id-type="doi">10.1038/nmeth.1318</pub-id></element-citation></ref><ref id="bib31"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Goldman</surname><given-names>A</given-names></name><name><surname>Roy</surname><given-names>J</given-names></name><name><surname>Bodenmiller</surname><given-names>B</given-names></name><name><surname>Wanka</surname><given-names>S</given-names></name><name><surname>Landry</surname><given-names>CR</given-names></name><name><surname>Aebersold</surname><given-names>R</given-names></name><name><surname>Cyert</surname><given-names>MS</given-names></name></person-group><year>2014</year><article-title>The calcineurin signaling network evolves via conserved kinase-phosphatase modules that transcend substrate identity</article-title><source>Molecular Cell</source><volume>55</volume><fpage>422</fpage><lpage>435</lpage><pub-id pub-id-type="doi">10.1016/j.molcel.2014.05.012</pub-id></element-citation></ref><ref id="bib32"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Goldsmith</surname><given-names>EJ</given-names></name><name><surname>Akella</surname><given-names>R</given-names></name><name><surname>Min</surname><given-names>X</given-names></name><name><surname>Zhou</surname><given-names>T</given-names></name><name><surname>Humphreys</surname><given-names>JM</given-names></name></person-group><year>2007</year><article-title>Substrate and docking interactions in Serine/Threonine protein kinases</article-title><source>Chemical Reviews</source><volume>107</volume><fpage>5065</fpage><lpage>5081</lpage><pub-id pub-id-type="doi">10.1021/cr068221w</pub-id></element-citation></ref><ref id="bib33"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Guindon</surname><given-names>S</given-names></name><name><surname>Dufayard</surname><given-names>JF</given-names></name><name><surname>Lefort</surname><given-names>V</given-names></name><name><surname>Anisimova</surname><given-names>M</given-names></name><name><surname>Hordijk</surname><given-names>W</given-names></name><name><surname>Gascuel</surname><given-names>O</given-names></name></person-group><year>2010</year><article-title>New algorithms and methods to estimate maximum-likelihood phylogenies: assessing the performance of PhyML 3.0</article-title><source>Systematic Biology</source><volume>59</volume><fpage>307</fpage><lpage>321</lpage><pub-id pub-id-type="doi">10.1093/sysbio/syq010</pub-id></element-citation></ref><ref id="bib34"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Hanson-Smith</surname><given-names>V</given-names></name><name><surname>Kolaczkowski</surname><given-names>B</given-names></name><name><surname>Thornton</surname><given-names>JW</given-names></name></person-group><year>2010</year><article-title>Robustness of ancestral sequence reconstruction to phylogenetic uncertainty</article-title><source>Molecular Biology and Evolution</source><volume>27</volume><fpage>1988</fpage><lpage>1999</lpage><pub-id pub-id-type="doi">10.1093/molbev/msq081</pub-id></element-citation></ref><ref id="bib35"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Himpel</surname><given-names>S</given-names></name><name><surname>Tegge</surname><given-names>W</given-names></name><name><surname>Frank</surname><given-names>R</given-names></name><name><surname>Leder</surname><given-names>S</given-names></name><name><surname>Joost</surname><given-names>HG</given-names></name><name><surname>Becker</surname><given-names>W</given-names></name></person-group><year>2000</year><article-title>Specificity determinants of substrate recognition by the protein kinase DYRK1A</article-title><source>The Journal of Biological Chemistry</source><volume>275</volume><fpage>2431</fpage><lpage>2438</lpage><pub-id pub-id-type="doi">10.1074/jbc.275.4.2431</pub-id></element-citation></ref><ref id="bib36"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Holt</surname><given-names>LJ</given-names></name><name><surname>Hutti</surname><given-names>JE</given-names></name><name><surname>Cantley</surname><given-names>LC</given-names></name><name><surname>Morgan</surname><given-names>DO</given-names></name></person-group><year>2007</year><article-title>Evolution of Ime2 phosphorylation sites on Cdk1 substrates provides a mechanism to limit the effects of the phosphatase Cdc14 in meiosis</article-title><source>Molecular Cell</source><volume>25</volume><fpage>689</fpage><lpage>702</lpage><pub-id pub-id-type="doi">10.1016/j.molcel.2007.02.012</pub-id></element-citation></ref><ref id="bib37"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Holt</surname><given-names>LJ</given-names></name><name><surname>Tuch</surname><given-names>BB</given-names></name><name><surname>Villén</surname><given-names>J</given-names></name><name><surname>Johnson</surname><given-names>AD</given-names></name><name><surname>Gygi</surname><given-names>SP</given-names></name><name><surname>Morgan</surname><given-names>DO</given-names></name></person-group><year>2009</year><article-title>Global analysis of Cdk1 substrate phosphorylation sites provides insights into evolution</article-title><source>Science</source><volume>325</volume><fpage>1682</fpage><lpage>1686</lpage><pub-id pub-id-type="doi">10.1126/science.1172867</pub-id></element-citation></ref><ref id="bib38"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Hornbeck</surname><given-names>PV</given-names></name><name><surname>Kornhauser</surname><given-names>JM</given-names></name><name><surname>Tkachev</surname><given-names>S</given-names></name><name><surname>Zhang</surname><given-names>B</given-names></name><name><surname>Skrzypek</surname><given-names>E</given-names></name><name><surname>Murray</surname><given-names>B</given-names></name><name><surname>Latham</surname><given-names>V</given-names></name><name><surname>Sullivan</surname><given-names>M</given-names></name></person-group><year>2012</year><article-title>PhosphoSitePlus: a comprehensive resource for investigating the structure and function of experimentally determined post-translational modifications in man and mouse</article-title><source>Nucleic Acids Research</source><volume>40</volume><fpage>D261</fpage><lpage>D270</lpage><pub-id pub-id-type="doi">10.1093/nar/gkr1122</pub-id></element-citation></ref><ref id="bib39"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Huse</surname><given-names>M</given-names></name><name><surname>Kuriyan</surname><given-names>J</given-names></name></person-group><year>2002</year><article-title>The conformational plasticity of protein kinases</article-title><source>Cell</source><volume>109</volume><fpage>275</fpage><lpage>282</lpage><pub-id pub-id-type="doi">10.1016/S0092-8674(02)00741-9</pub-id></element-citation></ref><ref id="bib40"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Hutchison</surname><given-names>EA</given-names></name><name><surname>Glass</surname><given-names>NL</given-names></name></person-group><year>2010</year><article-title>Meiotic regulators Ndt80 and Ime2 have different roles in Saccharomyces and Neurospora</article-title><source>Genetics</source><volume>185</volume><fpage>1271</fpage><lpage>1282</lpage><pub-id pub-id-type="doi">10.1534/genetics.110.117184</pub-id></element-citation></ref><ref id="bib41"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Hutti</surname><given-names>JE</given-names></name><name><surname>Jarrell</surname><given-names>ET</given-names></name><name><surname>Chang</surname><given-names>JD</given-names></name><name><surname>Abbott</surname><given-names>DW</given-names></name><name><surname>Storz</surname><given-names>P</given-names></name><name><surname>Toker</surname><given-names>A</given-names></name><name><surname>Cantley</surname><given-names>LC</given-names></name><name><surname>Turk</surname><given-names>BE</given-names></name></person-group><year>2004</year><article-title>A rapid method for determining protein kinase phosphorylation specificity</article-title><source>Nature Methods</source><volume>1</volume><fpage>27</fpage><lpage>29</lpage><pub-id pub-id-type="doi">10.1038/nmeth708</pub-id></element-citation></ref><ref id="bib42"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Irniger</surname><given-names>S</given-names></name></person-group><year>2011</year><article-title>The Ime2 protein kinase family in fungi: more duties than just meiosis</article-title><source>Molecular Microbiology</source><volume>80</volume><fpage>1</fpage><lpage>13</lpage><pub-id pub-id-type="doi">10.1111/j.1365-2958.2011.07575.x</pub-id></element-citation></ref><ref id="bib43"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Joughin</surname><given-names>BA</given-names></name><name><surname>Liu</surname><given-names>C</given-names></name><name><surname>Lauffenburger</surname><given-names>DA</given-names></name><name><surname>Hogue</surname><given-names>CW</given-names></name><name><surname>Yaffe</surname><given-names>MB</given-names></name></person-group><year>2012</year><article-title>Protein kinases display minimal interpositional dependence on substrate sequence: potential implications for the evolution of signalling networks</article-title><source>Philosophical Transactions of the Royal Society B: Biological Sciences</source><volume>367</volume><fpage>2574</fpage><lpage>2583</lpage><pub-id pub-id-type="doi">10.1098/rstb.2012.0010</pub-id></element-citation></ref><ref id="bib44"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Kettenbach</surname><given-names>AN</given-names></name><name><surname>Wang</surname><given-names>T</given-names></name><name><surname>Faherty</surname><given-names>BK</given-names></name><name><surname>Madden</surname><given-names>DR</given-names></name><name><surname>Knapp</surname><given-names>S</given-names></name><name><surname>Bailey-Kellogg</surname><given-names>C</given-names></name><name><surname>Gerber</surname><given-names>SA</given-names></name></person-group><year>2012</year><article-title>Rapid determination of multiple linear kinase substrate motifs by mass spectrometry</article-title><source>Chemistry &amp; Biology</source><volume>19</volume><fpage>608</fpage><lpage>618</lpage><pub-id pub-id-type="doi">10.1016/j.chembiol.2012.04.011</pub-id></element-citation></ref><ref id="bib45"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Krylov</surname><given-names>DM</given-names></name><name><surname>Nasmyth</surname><given-names>K</given-names></name><name><surname>Koonin</surname><given-names>EV</given-names></name></person-group><year>2003</year><article-title>Evolution of eukaryotic cell cycle regulation: stepwise addition of regulatory kinases and late advent of the CDKs</article-title><source>Current Biology</source><volume>13</volume><fpage>173</fpage><lpage>177</lpage><pub-id pub-id-type="doi">10.1016/S0960-9822(03)00008-3</pub-id></element-citation></ref><ref id="bib46"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Landry</surname><given-names>CR</given-names></name><name><surname>Levy</surname><given-names>ED</given-names></name><name><surname>Michnick</surname><given-names>SW</given-names></name></person-group><year>2009</year><article-title>Weak functional constraints on phosphoproteomes</article-title><source>Trends in Genetics</source><volume>25</volume><fpage>193</fpage><lpage>197</lpage><pub-id pub-id-type="doi">10.1016/j.tig.2009.03.003</pub-id></element-citation></ref><ref id="bib47"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Le</surname><given-names>SQ</given-names></name><name><surname>Gascuel</surname><given-names>O</given-names></name></person-group><year>2008</year><article-title>An improved general amino acid replacement matrix</article-title><source>Molecular Biology and Evolution</source><volume>25</volume><fpage>1307</fpage><lpage>1320</lpage><pub-id pub-id-type="doi">10.1093/molbev/msn067</pub-id></element-citation></ref><ref id="bib48"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Lee</surname><given-names>YJ</given-names></name><name><surname>Jeschke</surname><given-names>GR</given-names></name><name><surname>Roelants</surname><given-names>FM</given-names></name><name><surname>Thorner</surname><given-names>J</given-names></name><name><surname>Turk</surname><given-names>BE</given-names></name></person-group><year>2012</year><article-title>Reciprocal phosphorylation of yeast glycerol-3-phosphate dehydrogenases in adaptation to distinct types of stress</article-title><source>Molecular and Cellular Biology</source><volume>32</volume><fpage>4705</fpage><lpage>4717</lpage><pub-id pub-id-type="doi">10.1128/MCB.00897-12</pub-id></element-citation></ref><ref id="bib49"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Lim</surname><given-names>WA</given-names></name><name><surname>Pawson</surname><given-names>T</given-names></name></person-group><year>2010</year><article-title>Phosphotyrosine signaling: evolving a new cellular communication system</article-title><source>Cell</source><volume>42</volume><fpage>661</fpage><lpage>667</lpage><pub-id pub-id-type="doi">10.1016/j.cell.2010.08.023</pub-id></element-citation></ref><ref id="bib50"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Liu</surname><given-names>Y</given-names></name><name><surname>Schmidt</surname><given-names>B</given-names></name><name><surname>Maskell</surname><given-names>DL</given-names></name></person-group><year>2010</year><article-title>MSAProbs: multiple sequence alignment based on pair hidden markov models and partition function posterior probabilities</article-title><source>Bioinformatics</source><volume>26</volume><fpage>1958</fpage><lpage>1964</lpage><pub-id pub-id-type="doi">10.1093/bioinformatics/btq338</pub-id></element-citation></ref><ref id="bib51"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Lochhead</surname><given-names>PA</given-names></name><name><surname>Sibbet</surname><given-names>G</given-names></name><name><surname>Morrice</surname><given-names>N</given-names></name><name><surname>Cleghon</surname><given-names>V</given-names></name></person-group><year>2005</year><article-title>Activation-loop autophosphorylation is mediated by a novel transitional intermediate form of DYRKs</article-title><source>Cell</source><volume>121</volume><fpage>925</fpage><lpage>936</lpage><pub-id pub-id-type="doi">10.1016/j.cell.2005.03.034</pub-id></element-citation></ref><ref id="bib52"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Manning</surname><given-names>G</given-names></name><name><surname>Plowman</surname><given-names>GD</given-names></name><name><surname>Hunter</surname><given-names>T</given-names></name><name><surname>Sudarsanam</surname><given-names>S</given-names></name></person-group><year>2002a</year><article-title>Evolution of protein kinase signaling from yeast to man</article-title><source>Trends in Biochemical Sciences</source><volume>27</volume><fpage>514</fpage><lpage>520</lpage><pub-id pub-id-type="doi">10.1016/S0968-0004(02)02179-5</pub-id></element-citation></ref><ref id="bib52a"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Manning</surname><given-names>G</given-names></name><name><surname>Whyte</surname><given-names>DB</given-names></name><name><surname>Martinez</surname><given-names>R</given-names></name><name><surname>Hunter</surname><given-names>T</given-names></name><name><surname>Sudarsanam</surname><given-names>S</given-names></name></person-group><year>2002b</year><article-title>The protein kinase complement of the human genome</article-title><source>Science</source><volume>298</volume><fpage>1912</fpage><lpage>1934</lpage><pub-id pub-id-type="doi">10.1126/science.1075762</pub-id></element-citation></ref><ref id="bib53"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Matsuoka</surname><given-names>S</given-names></name><name><surname>Ballif</surname><given-names>BA</given-names></name><name><surname>Smogorzewska</surname><given-names>A</given-names></name><name><surname>McDonald</surname><given-names>ER</given-names><suffix>III</suffix></name><name><surname>Hurov</surname><given-names>KE</given-names></name><name><surname>Luo</surname><given-names>J</given-names></name><name><surname>Bakalarski</surname><given-names>CE</given-names></name><name><surname>Zhao</surname><given-names>Z</given-names></name><name><surname>Solimini</surname><given-names>N</given-names></name><name><surname>Lerenthal</surname><given-names>Y</given-names></name><name><surname>Shiloh</surname><given-names>Y</given-names></name><name><surname>Gygi</surname><given-names>SP</given-names></name><name><surname>Elledge</surname><given-names>SJ</given-names></name></person-group><year>2007</year><article-title>ATM and ATR substrate analysis reveals extensive protein networks responsive to DNA damage</article-title><source>Science</source><volume>316</volume><fpage>1160</fpage><lpage>1166</lpage><pub-id pub-id-type="doi">10.1126/science.1140321</pub-id></element-citation></ref><ref id="bib54"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>McDonald</surname><given-names>CM</given-names></name><name><surname>Wagner</surname><given-names>M</given-names></name><name><surname>Dunham</surname><given-names>MJ</given-names></name><name><surname>Shin</surname><given-names>ME</given-names></name><name><surname>Ahmed</surname><given-names>NT</given-names></name><name><surname>Winter</surname><given-names>E</given-names></name></person-group><year>2009</year><article-title>The Ras/cAMP pathway and the CDK-like kinase Ime2 regulate the MAPK Smk1 and spore morphogenesis in <italic>Saccharomyces Cerevisiae</italic></article-title><source>Genetics</source><volume>181</volume><fpage>511</fpage><lpage>523</lpage><pub-id pub-id-type="doi">10.1534/genetics.108.098434</pub-id></element-citation></ref><ref id="bib55"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Mok</surname><given-names>J</given-names></name><name><surname>Kim</surname><given-names>PM</given-names></name><name><surname>Lam</surname><given-names>HY</given-names></name><name><surname>Piccirillo</surname><given-names>S</given-names></name><name><surname>Zhou</surname><given-names>X</given-names></name><name><surname>Jeschke</surname><given-names>GR</given-names></name><name><surname>Sheridan</surname><given-names>DL</given-names></name><name><surname>Parker</surname><given-names>SA</given-names></name><name><surname>Desai</surname><given-names>V</given-names></name><name><surname>Jwa</surname><given-names>M</given-names></name><name><surname>Cameroni</surname><given-names>E</given-names></name><name><surname>Niu</surname><given-names>H</given-names></name><name><surname>Good</surname><given-names>M</given-names></name><name><surname>Remenyi</surname><given-names>A</given-names></name><name><surname>Ma</surname><given-names>JL</given-names></name><name><surname>Sheu</surname><given-names>YJ</given-names></name><name><surname>Sassi</surname><given-names>HE</given-names></name><name><surname>Sopko</surname><given-names>R</given-names></name><name><surname>Chan</surname><given-names>CS</given-names></name><name><surname>De Virgilio</surname><given-names>C</given-names></name><name><surname>Hollingsworth</surname><given-names>NM</given-names></name><name><surname>Lim</surname><given-names>WA</given-names></name><name><surname>Stern</surname><given-names>DF</given-names></name><name><surname>Stillman</surname><given-names>B</given-names></name><name><surname>Andrews</surname><given-names>BJ</given-names></name><name><surname>Gerstein</surname><given-names>MB</given-names></name><name><surname>Snyder</surname><given-names>M</given-names></name><name><surname>Turk</surname><given-names>BE</given-names></name></person-group><year>2010</year><article-title>Deciphering protein kinase specificity through large-scale analysis of yeast phosphorylation site motifs</article-title><source>Science Signaling</source><volume>3</volume><fpage>ra12</fpage><pub-id pub-id-type="doi">10.1126/scisignal.2000482</pub-id></element-citation></ref><ref id="bib56"><element-citation publication-type="book"><person-group person-group-type="author"><name><surname>Morgan</surname><given-names>DO</given-names></name></person-group><year>2007</year><source>The Cell Cycle</source><publisher-name>New Science Press</publisher-name></element-citation></ref><ref id="bib57"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Morrison</surname><given-names>DK</given-names></name></person-group><year>2012</year><article-title>MAP kinase pathways</article-title><source>Cold Spring harbor Perspectives in Biology</source><volume>4</volume><pub-id pub-id-type="doi">10.1101/cshperspect.a011254</pub-id></element-citation></ref><ref id="bib58"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Moses</surname><given-names>AM</given-names></name><name><surname>Landry</surname><given-names>CR</given-names></name></person-group><year>2010</year><article-title>Moving from transcriptional to phospho-evolution: generalizing regulatory evolution?</article-title><source>Trends in Genetics</source><volume>26</volume><fpage>462</fpage><lpage>467</lpage><pub-id pub-id-type="doi">10.1016/j.tig.2010.08.002</pub-id></element-citation></ref><ref id="bib59"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Nguyen Ba</surname><given-names>AN</given-names></name><name><surname>Moses</surname><given-names>AM</given-names></name></person-group><year>2010</year><article-title>Evolution of characterized phosphorylation sites in budding yeast</article-title><source>Molecular Biology and Evolution</source><volume>27</volume><fpage>2027</fpage><lpage>2037</lpage><pub-id pub-id-type="doi">10.1093/molbev/msq090</pub-id></element-citation></ref><ref id="bib60"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Olsen</surname><given-names>JV</given-names></name><name><surname>Blagoev</surname><given-names>B</given-names></name><name><surname>Gnad</surname><given-names>F</given-names></name><name><surname>Macek</surname><given-names>B</given-names></name><name><surname>Kumar</surname><given-names>C</given-names></name><name><surname>Mortensen</surname><given-names>P</given-names></name><name><surname>Mann</surname><given-names>M</given-names></name></person-group><year>2006</year><article-title>Global, in vivo, and site-specific phosphorylation dynamics in signaling networks</article-title><source>Cell</source><volume>127</volume><fpage>635</fpage><lpage>648</lpage><pub-id pub-id-type="doi">10.1016/j.cell.2006.09.026</pub-id></element-citation></ref><ref id="bib61"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Ortlund</surname><given-names>EA</given-names></name><name><surname>Bridgham</surname><given-names>JT</given-names></name><name><surname>Redinbo</surname><given-names>MR</given-names></name><name><surname>Thornton</surname><given-names>JW</given-names></name></person-group><year>2007</year><article-title>Crystal structure of an ancient protein: evolution by conformational epistasis</article-title><source>Science</source><volume>317</volume><fpage>1544</fpage><lpage>1548</lpage><pub-id pub-id-type="doi">10.1126/science.1142819</pub-id></element-citation></ref><ref id="bib62"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Pearlman</surname><given-names>SM</given-names></name><name><surname>Serber</surname><given-names>Z</given-names></name><name><surname>Ferrell</surname><given-names>JE</given-names></name></person-group><year>2011</year><article-title>A mechanism for the evolution of phosphorylation sites</article-title><source>Cell</source><volume>147</volume><fpage>934</fpage><lpage>946</lpage><pub-id pub-id-type="doi">10.1016/j.cell.2011.08.052</pub-id></element-citation></ref><ref id="bib63"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Reményi</surname><given-names>A</given-names></name><name><surname>Good</surname><given-names>MC</given-names></name><name><surname>Bhattacharyya</surname><given-names>RP</given-names></name><name><surname>Lim</surname><given-names>WA</given-names></name></person-group><year>2005</year><article-title>The role of docking interactions in mediating signaling input, output, and discrimination in the yeast MAPK network</article-title><volume>20</volume><fpage>951</fpage><lpage>962</lpage><pub-id pub-id-type="doi">10.1016/j.molcel.2005.10.030</pub-id></element-citation></ref><ref id="bib64"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Sheridan</surname><given-names>DL</given-names></name><name><surname>Kong</surname><given-names>Y</given-names></name><name><surname>Parker</surname><given-names>SA</given-names></name><name><surname>Dalby</surname><given-names>KN</given-names></name><name><surname>Turk</surname><given-names>BE</given-names></name></person-group><year>2008</year><article-title>Substrate discrimination among mitogen-activated protein kinases through distinct docking sequence motifs</article-title><source>The Journal of Biological Chemistry</source><volume>283</volume><fpage>19511</fpage><lpage>19520</lpage><pub-id pub-id-type="doi">10.1074/jbc.M801074200</pub-id></element-citation></ref><ref id="bib65"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Sherwood</surname><given-names>RK</given-names></name><name><surname>Scaduto</surname><given-names>CM</given-names></name><name><surname>Torres</surname><given-names>SE</given-names></name><name><surname>Bennett</surname><given-names>RJ</given-names></name></person-group><year>2014</year><article-title>Convergent evolution of a fused sexual cycle promotes the haploid lifestyle</article-title><source>Molecular Systems Biology</source><volume>506</volume><fpage>387</fpage><lpage>390</lpage><pub-id pub-id-type="doi">10.1038/nature12891</pub-id></element-citation></ref><ref id="bib66"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Songyang</surname><given-names>Z</given-names></name><name><surname>Blechner</surname><given-names>S</given-names></name><name><surname>Hoagland</surname><given-names>N</given-names></name><name><surname>Hoekstra</surname><given-names>MF</given-names></name><name><surname>Piwnica-Worms</surname><given-names>H</given-names></name><name><surname>Cantley</surname><given-names>LC</given-names></name></person-group><year>1994</year><article-title>Use of an oriented peptide library to determine the Optimal substrates of protein kinases</article-title><source>Current Biology</source><volume>4</volume><fpage>973</fpage><lpage>982</lpage><pub-id pub-id-type="doi">10.1016/S0960-9822(00)00221-9</pub-id></element-citation></ref><ref id="bib67"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Songyang</surname><given-names>Z</given-names></name><name><surname>Lu</surname><given-names>KP</given-names></name><name><surname>Kwon</surname><given-names>YT</given-names></name><name><surname>Tsai</surname><given-names>LH</given-names></name><name><surname>Filhol</surname><given-names>O</given-names></name><name><surname>Cochet</surname><given-names>C</given-names></name><name><surname>Brickey</surname><given-names>DA</given-names></name><name><surname>Soderling</surname><given-names>TR</given-names></name><name><surname>Bartleson</surname><given-names>C</given-names></name><name><surname>Graves</surname><given-names>DJ</given-names></name><name><surname>DeMaggio</surname><given-names>AJ</given-names></name><name><surname>Hoekstra</surname><given-names>MF</given-names></name><name><surname>Blenis</surname><given-names>J</given-names></name><name><surname>Hunter</surname><given-names>T</given-names></name><name><surname>Cantley</surname><given-names>LC</given-names></name></person-group><year>1996</year><article-title>A structural basis for substrate specificities of protein Ser/Thr kinases: primary sequence preference of casein kinases I and II, NIMA, phosphorylase kinase, calmodulin-dependent kinase II, CDK5, and Erk1</article-title><source>Molecular and Cellular Biology</source><volume>16</volume><fpage>6486</fpage><lpage>6493</lpage></element-citation></ref><ref id="bib68"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Soundararajan</surname><given-names>M</given-names></name><name><surname>Roos</surname><given-names>AK</given-names></name><name><surname>Savitsky</surname><given-names>P</given-names></name><name><surname>Filippakopoulos</surname><given-names>P</given-names></name><name><surname>Kettenbach</surname><given-names>AN</given-names></name><name><surname>Olsen</surname><given-names>JV</given-names></name><name><surname>Gerber</surname><given-names>SA</given-names></name><name><surname>Eswaran</surname><given-names>J</given-names></name><name><surname>Knapp</surname><given-names>S</given-names></name><name><surname>Elkins</surname><given-names>JM</given-names></name></person-group><year>2013</year><article-title>Structures of down syndrome kinases, DYRKs, reveal mechanisms of kinase activation and substrate recognition</article-title><source>Structure</source><volume>21</volume><fpage>986</fpage><lpage>996</lpage><pub-id pub-id-type="doi">10.1016/j.str.2013.03.012</pub-id></element-citation></ref><ref id="bib69"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Stamatakis</surname><given-names>A</given-names></name></person-group><year>2006</year><article-title>RAxML-VI-HPC: Maximum likelihood-based phylogenetic analyses with thousands of Taxa and mixed models</article-title><source>Bioinformatics</source><volume>22</volume><fpage>2688</fpage><lpage>2690</lpage><pub-id pub-id-type="doi">10.1093/bioinformatics/btl446</pub-id></element-citation></ref><ref id="bib70"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Tan</surname><given-names>C</given-names></name><name><surname>Pasculescu</surname><given-names>A</given-names></name><name><surname>Lim</surname><given-names>W</given-names></name><name><surname>Pawson</surname><given-names>T</given-names></name><name><surname>Bader</surname><given-names>G</given-names></name><name><surname>Linding</surname><given-names>R</given-names></name></person-group><year>2009</year><article-title>Positive selection of tyrosine loss in metazoan evolution</article-title><source>Science</source><volume>325</volume><fpage>1686</fpage><pub-id pub-id-type="doi">10.1126/science.1174301</pub-id></element-citation></ref><ref id="bib71"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Thornton</surname><given-names>JW</given-names></name></person-group><year>2003</year><article-title>Resurrecting the ancestral steroid receptor: ancient origin of estrogen signaling</article-title><source>Science</source><volume>301</volume><fpage>1714</fpage><lpage>1717</lpage><pub-id pub-id-type="doi">10.1126/science.1086185</pub-id></element-citation></ref><ref id="bib72"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Thornton</surname><given-names>JW</given-names></name></person-group><year>2004</year><article-title>Resurrecting ancient genes: experimental analysis of extinct molecules</article-title><source>Nature Reviews Genetics</source><volume>5</volume><fpage>366</fpage><lpage>375</lpage><pub-id pub-id-type="doi">10.1038/nrg1324</pub-id></element-citation></ref><ref id="bib73"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Tuch</surname><given-names>BB</given-names></name><name><surname>Li</surname><given-names>H</given-names></name><name><surname>Johnson</surname><given-names>AD</given-names></name></person-group><year>2008</year><article-title>Evolution of eukaryotic transcription circuits</article-title><source>Science</source><volume>319</volume><fpage>1797</fpage><lpage>1799</lpage><pub-id pub-id-type="doi">10.1126/science.1152398</pub-id></element-citation></ref><ref id="bib74"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Ubersax</surname><given-names>JA</given-names></name><name><surname>Ferrell</surname><given-names>JE</given-names><suffix>Jnr</suffix></name></person-group><year>2007</year><article-title>Mechanisms of specificity in protein phosphorylation</article-title><source>Nature Reviews Molecular Cell Biology</source><volume>8</volume><fpage>530</fpage><lpage>541</lpage><pub-id pub-id-type="doi">10.1038/nrm2203</pub-id></element-citation></ref><ref id="bib75"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Varadarajan</surname><given-names>N</given-names></name><name><surname>Rodriguez</surname><given-names>S</given-names></name><name><surname>Hwang</surname><given-names>BY</given-names></name><name><surname>Georgiou</surname><given-names>G</given-names></name><name><surname>Iverson</surname><given-names>BL</given-names></name></person-group><year>2008</year><article-title>Highly active and selective endopeptidases with programmed substrate specificities</article-title><source>Nature Chemical Biology</source><volume>4</volume><fpage>290</fpage><lpage>294</lpage><pub-id pub-id-type="doi">10.1038/nchembio.80</pub-id></element-citation></ref><ref id="bib76"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Williams</surname><given-names>PD</given-names></name><name><surname>Pollock</surname><given-names>DD</given-names></name><name><surname>Blackburne</surname><given-names>BP</given-names></name><name><surname>Goldstein</surname><given-names>RA</given-names></name></person-group><year>2006</year><article-title>Assessing the accuracy of ancestral protein reconstruction methods</article-title><source>PLOS Computational Biology</source><volume>2</volume><fpage>e69</fpage><pub-id pub-id-type="doi">10.1371/journal.pcbi.0020069</pub-id></element-citation></ref><ref id="bib77"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Yang</surname><given-names>Z</given-names></name><name><surname>Kumar</surname><given-names>S</given-names></name></person-group><year>1996</year><article-title>Approximate methods for estimating the pattern of nucleotide substitution and the variation of substitution rates among sites</article-title><source>Molecular Biology and Evolution</source><volume>13</volume><fpage>650</fpage><lpage>659</lpage><pub-id pub-id-type="doi">10.1093/oxfordjournals.molbev.a025625</pub-id></element-citation></ref><ref id="bib78"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Yang</surname><given-names>Z</given-names></name><name><surname>Kumar</surname><given-names>S</given-names></name><name><surname>Nei</surname><given-names>M</given-names></name></person-group><year>1995</year><article-title>A new method of inference of ancestral nucleotide and amino acid sequences</article-title><source>Genetics</source><volume>141</volume><fpage>1641</fpage><lpage>1650</lpage></element-citation></ref><ref id="bib79"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Yoshida</surname><given-names>M</given-names></name><name><surname>Kawaguchi</surname><given-names>H</given-names></name><name><surname>Sakata</surname><given-names>Y</given-names></name><name><surname>Kominami</surname><given-names>K</given-names></name><name><surname>Hirano</surname><given-names>M</given-names></name><name><surname>Shima</surname><given-names>H</given-names></name><name><surname>Akada</surname><given-names>R</given-names></name><name><surname>Yamashita</surname><given-names>I</given-names></name></person-group><year>1990</year><article-title>Initiation of meiosis and sporulation in <italic>Saccharomyces Cerevisiae</italic> requires a novel protein kinase homologue</article-title><source>Molecular &amp; General Genetics</source><volume>221</volume><fpage>176</fpage><lpage>186</lpage><pub-id pub-id-type="doi">10.1007/BF00261718</pub-id></element-citation></ref><ref id="bib80"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Zhu</surname><given-names>G</given-names></name><name><surname>Fujii</surname><given-names>K</given-names></name><name><surname>Belkina</surname><given-names>N</given-names></name><name><surname>Liu</surname><given-names>Y</given-names></name><name><surname>James</surname><given-names>M</given-names></name><name><surname>Herrero</surname><given-names>J</given-names></name><name><surname>Shaw</surname><given-names>S</given-names></name></person-group><year>2005</year><article-title>Exceptional disfavor for proline at the P + 1 position among AGC and CAMK kinases establishes reciprocal specificity between them and the proline-directed kinases</article-title><source>The Journal of Biological Chemistry</source><volume>280</volume><fpage>10743</fpage><lpage>10748</lpage><pub-id pub-id-type="doi">10.1074/jbc.M413159200</pub-id></element-citation></ref></ref-list></back><sub-article article-type="article-commentary" id="SA1"><front-stub><article-id pub-id-type="doi">10.7554/eLife.04126.025</article-id><title-group><article-title>Decision letter</article-title></title-group><contrib-group content-type="section"><contrib contrib-type="editor"><name><surname>Ferrell</surname><given-names>James</given-names></name><role>Reviewing editor</role><aff><institution>Stanford University</institution>, <country>United States</country></aff></contrib></contrib-group></front-stub><body><boxed-text><p>eLife posts the editorial decision letter and author response on a selection of the published articles (subject to the approval of the authors). An edited version of the letter sent to the authors after peer review is shown, indicating the substantive concerns or comments; minor concerns are not usually shown. Reviewers have the opportunity to discuss the decision before the letter is sent (see <ext-link ext-link-type="uri" xlink:href="http://elifesciences.org/review-process">review process</ext-link>). Similarly, the author response typically shows only responses to the major concerns raised by the reviewers.</p></boxed-text><p>Thank you for sending your work entitled “Ancestral resurrection reveals evolutionary mechanisms of kinase plasticity” for consideration at eLife. Your article has now been evaluated by Michael Marletta (Senior editor), a Reviewing editor, and 2 reviewers, one of whom is also a member of the Board of Reviewing Editors. As you will see (below) the reviewers thought the work was original and interesting. It is really impressive that one can generate functional protein kinases from maximum-likelihood reconstructions of ancestral sequences, and we suspect that this approach will continue to yield interesting insights into protein evolution. However, they also had one major, substantive issue with the paper's interpretations, which would need to be accommodated in a revised version.</p><p>The Reviewing editors and the other reviewer discussed their comments before we reached this decision, and the Reviewing editor has assembled the following comments to help you prepare a revised submission.</p><p>1) The main problem is the assertion that the ancestral GMGI kinase (AncCMGI) kinase had a strong +1 proline preference, and then this specificity broadened to Pro or Arg, and then finally to Arg, as the fungal IME2s evolved. The Pro preference for AncCMGI is asserted rather unequivocally (“The ancestor of all CMGC kinases had strong specificity for proline at its +1 site”).</p><p>However, data in <xref ref-type="fig" rid="fig3">Figure 3B</xref> actually show pretty modest preference for serine peptides and a bit better for threonine peptides, but neither is as strong a preference as extant proline-directed kinases are for proline. The very modest preference for Pro in the Ser-peptides is particularly problematic given that most phosphorylations occur at Ser residues. Moreover, one of the “alternative” versions of AncCMGI shown in <xref ref-type="fig" rid="fig3s2">Figure 3—figure supplement 2</xref> had even more modest preference for a +1 proline than the version characterized in the main figures. Thus, by your own data, the ancestral kinase was rather “purple” compared to the “blue” S. cerevisiae IME2, the “red” ERKs, JNKs, SAPKs, and CDKs.</p><p>This problem is compounded by the fact that assignments of nodes in the tree are less certain near the base of the tree than near the leaves. For example, it is possible that your (purple) AncNgru protein is really closer to the root of the tree than the (reddish-purple) AncCMGI protein is.</p><p>This is not a problem that needs to be addressed with further experiments or phylogenetic reconstructions, but rather with substantial re-writing of all parts of the paper, acknowledging that your findings are consistent with two plausible evolutionary schemes: one where the ancestral kinase is relatively broad in its +1 specificity, and one where it first becomes broader and then becomes narrower. So in your <xref ref-type="fig" rid="fig6">Figure 6</xref>, either the left-hand species, as you suggest, or the middle species could be the ancestral state.</p></body></sub-article><sub-article article-type="reply" id="SA2"><front-stub><article-id pub-id-type="doi">10.7554/eLife.04126.026</article-id><title-group><article-title>Author response</article-title></title-group></front-stub><body><p>1) The main problem is the assertion that the ancestral GMGI kinase (AncCMGI) kinase had a strong +1 proline preference, and then this specificity broadened to Pro or Arg, and then finally to Arg, as the fungal IME2s evolved. The Pro preference for AncCMGI is asserted rather unequivocally (“The ancestor of all CMGC kinases had strong specificity for proline at its +1 site”).</p><p><italic>However, data in <xref ref-type="fig" rid="fig3">Figure 3B</xref> actually show pretty modest preference for serine peptides and a bit better for threonine peptides, but neither is as strong a preference as extant proline-directed kinases are for proline. The very modest preference for Pro in the Ser-peptides is particularly problematic given that most phosphorylations occur at Ser residues. Moreover, one of the “alternative” versions of AncCMGI shown in <xref ref-type="fig" rid="fig3s2">Figure 3—figure supplement 2</xref> had even more modest preference for a +1 proline than the version characterized in the main figures. Thus, by your own data, the ancestral kinase was rather “purple” compared to the “blue” S. cerevisiae IME2, the “red” ERKs, JNKs, SAPKs, and CDKs.</italic></p><p>This is true. We have substantially revised the tone of the article in response to this comment and acknowledged the two alternative possibilities in the manuscript. Our original contention that AncCMGI was specific for proline came from the PSPL screens for the ML reconstruction, which revealed a strong +1 proline signal (<xref ref-type="fig" rid="fig2">Figure 2B</xref>). However, unlike true “proline-directed” kinases (i.e. cyclin-dependent kinases or MAP kinases), the PSPL data do show a clear arginine signal at the +1 position, indicating that the AncCMGI did have the capacity to phosphorylate +1 Arg substrates. Furthermore, as pointed out by the reviewers, assays with consensus peptide substrates (<xref ref-type="fig" rid="fig3">Figure 3B</xref>) show an even more modest preference for proline. We would point out that the consensus peptide assays could underestimate the influence of the +1 position due to the sequence context, perhaps by fixing the most preferred residues at the -3 and -2 positions. While it is also correct that the different reconstructions of AncCMGI vary in their proline preference, the +1P/R ratio for all of the AncCMGI reconstructions is greater than the +1P/R ratio for all of the AncNgru reconstructions. These data suggest strongly that +1 specificity did indeed broaden with evolution along the CMGI lineage.</p><p>We have addressed this major concern through revising the manuscript in several ways to reduce the emphasis on +1 proline preference in our ancestor. Specifically, in the Results narrative our description of AncCMGI is more nuanced, indicating quantitatively the proline preference from PSPL data (5.6-fold), and emphasizing that this preference is more modest than for extant CDKs and MAPKs. We also refer to differences between the three AncCMGI reconstructions in proline selectivity, and include a new panel in <xref ref-type="fig" rid="fig3s2">Figure 3—figure supplement 2</xref> that shows +1 position PSPL data for multiple reconstructions of each ancestor.</p><p>In addition, we have updated our colors in <xref ref-type="fig" rid="fig3">figure 3A</xref> to reflect the modest proline specificity of AncCMGI.</p><p><italic>This problem is compounded by the fact that assignments of nodes in the tree are less certain near the base of the tree than near the leaves. For example, it is possible that your (purple) AncNgru protein is really closer to the root of the tree than the (reddish-purple) AncCMGI protein is.</italic></p><p>This is extremely unlikely. When using the best-fitting maximum likelihood phylogenetic model (PROTCATLG), the placement of AncNgru as a descendant of AncCMGI is 6.41x10^5 times more likely than the alternative hypothesis in which the Ngru clade (containing LF4 and MOK sequences) is basal to the CMGI clade. Although the true evolutionary history of these proteins cannot be directly observed, the sequence evidence in this case provides overwhelming support for a model in which AncNgru is a descendant of AncCMGI. To put this more intuitively: AncNgru gives rise to only the LF4/RCK/IME2 kinase family while AncCMGI gives rise to the LF4/RCK/IME2, CDK, MAPK, CDKL, CKL, and GSK families. From this standpoint, it is most appropriate to place AncCMGI basal to AncNgru.</p><p><italic>This is not a problem that needs to be addressed with further experiments or phylogenetic reconstructions, but rather with substantial re-writing of all parts of the paper, acknowledging that your findings are consistent with two plausible evolutionary schemes: one where the ancestral kinase is relatively broad in its +1 specificity, and one where it first becomes broader and then becomes narrower. So in your <xref ref-type="fig" rid="fig6">Figure 6</xref>, either the left-hand species, as you suggest, or the middle species could be the ancestral state.</italic></p><p>We agree, and have made this point throughout the text and the Discussion. As described above, our data are not unequivocal regarding the specificity of the deep ancestor, which could phosphorylate +1 arginine to some extent. For example, the Discussion now reads:</p><p>“Taken together with the measured specificities of reconstructed CMGC kinases, these results suggest that the evolutionary trajectory from +1P to +1R passed through a historic ancestor with hybrid +1 PR specificity. However, due to the modest degree of +1 specificity in AncCMGI (5.6-fold by PSPL and 2-fold in the context of an otherwise perfect consensus peptide) together with slight fluctuations in degree of specificity in alternate reconstructions, an alternative possibility is that AncCMGI was a broad specificity kinase that subfunctionalized to modern proline or arginine specific kinases.”</p><p>With respect to <xref ref-type="fig" rid="fig6">figure 6</xref>, we have updated the <xref ref-type="fig" rid="fig6">Figure 6</xref> title to “Simplified schematic of one possible path of kinase network evolution” to make it clear that we are simply exemplifying our discussion with a visual example that helps the reader think about the experiment in <xref ref-type="fig" rid="fig5">Figure 5</xref> where we know for certain that we have expanded the specificity of the extant Ime2 kinase.</p></body></sub-article></article>