Permalink
Switch branches/tags
Nothing to show
Find file
Fetching contributors…
Cannot retrieve contributors at this time
1 lines (1 sloc) 177 KB
<?xml version="1.0" encoding="UTF-8"?><!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.1d1 20130915//EN" "JATS-archivearticle1.dtd"><article article-type="research-article" dtd-version="1.1d1" xmlns:xlink="http://www.w3.org/1999/xlink"><front><journal-meta><journal-id journal-id-type="nlm-ta">elife</journal-id><journal-id journal-id-type="hwp">eLife</journal-id><journal-id journal-id-type="publisher-id">eLife</journal-id><journal-title-group><journal-title>eLife</journal-title></journal-title-group><issn publication-format="electronic">2050-084X</issn><publisher><publisher-name>eLife Sciences Publications, Ltd</publisher-name></publisher></journal-meta><article-meta><article-id pub-id-type="publisher-id">01426</article-id><article-id pub-id-type="doi">10.7554/eLife.01426</article-id><article-categories><subj-group subj-group-type="display-channel"><subject>Research article</subject></subj-group><subj-group subj-group-type="heading"><subject>Genomics and evolutionary biology</subject></subj-group><subj-group subj-group-type="heading"><subject>Plant biology</subject></subj-group></article-categories><title-group><article-title>The genomic landscape of meiotic crossovers and gene conversions in <italic>Arabidopsis thaliana</italic></article-title></title-group><contrib-group><contrib contrib-type="author" equal-contrib="yes" id="author-7446"><name><surname>Wijnker</surname><given-names>Erik</given-names></name><xref ref-type="aff" rid="aff1"/><xref ref-type="aff" rid="aff2"/><xref ref-type="fn" rid="equal-contrib">†</xref><xref ref-type="other" rid="par-4"/><xref ref-type="fn" rid="con1"/><xref ref-type="fn" rid="conf2"/><xref ref-type="other" rid="dataro1"/></contrib><contrib contrib-type="author" equal-contrib="yes" id="author-7447"><name><surname>Velikkakam James</surname><given-names>Geo</given-names></name><xref ref-type="aff" rid="aff3"/><xref ref-type="fn" rid="equal-contrib">†</xref><xref ref-type="other" rid="par-3"/><xref ref-type="fn" rid="con3"/><xref ref-type="fn" rid="conf2"/><xref ref-type="other" rid="dataro1"/></contrib><contrib contrib-type="author" id="author-7448"><name><surname>Ding</surname><given-names>Jia</given-names></name><xref ref-type="aff" rid="aff4"/><xref ref-type="fn" rid="con4"/><xref ref-type="fn" rid="conf2"/><xref ref-type="other" rid="dataro1"/></contrib><contrib contrib-type="author" id="author-7449"><name><surname>Becker</surname><given-names>Frank</given-names></name><xref ref-type="aff" rid="aff1"/><xref ref-type="fn" rid="con7"/><xref ref-type="fn" rid="conf2"/><xref ref-type="other" rid="dataro1"/></contrib><contrib contrib-type="author" id="author-7450"><name><surname>Klasen</surname><given-names>Jonas R</given-names></name><xref ref-type="aff" rid="aff3"/><xref ref-type="other" rid="par-3"/><xref ref-type="fn" rid="con5"/><xref ref-type="fn" rid="conf2"/><xref ref-type="other" rid="dataro1"/></contrib><contrib contrib-type="author" id="author-7451"><name><surname>Rawat</surname><given-names>Vimal</given-names></name><xref ref-type="aff" rid="aff3"/><xref ref-type="other" rid="par-3"/><xref ref-type="fn" rid="con6"/><xref ref-type="fn" rid="conf2"/><xref ref-type="other" rid="dataro1"/></contrib><contrib contrib-type="author" id="author-7452"><name><surname>Rowan</surname><given-names>Beth A</given-names></name><xref ref-type="aff" rid="aff5"/><xref ref-type="other" rid="par-2"/><xref ref-type="fn" rid="con11"/><xref ref-type="fn" rid="conf2"/><xref ref-type="other" rid="dataro1"/></contrib><contrib contrib-type="author" id="author-7475"><name><surname>de Jong</surname><given-names>Daniël F</given-names></name><xref ref-type="aff" rid="aff1"/><xref ref-type="aff" rid="aff6"/><xref ref-type="fn" rid="con8"/><xref ref-type="fn" rid="conf2"/><xref ref-type="other" rid="dataro1"/></contrib><contrib contrib-type="author" id="author-7476"><name><surname>de Snoo</surname><given-names>C Bastiaan</given-names></name><xref ref-type="aff" rid="aff6"/><xref ref-type="fn" rid="con9"/><xref ref-type="fn" rid="conf2"/><xref ref-type="other" rid="dataro1"/></contrib><contrib contrib-type="author" id="author-7455"><name><surname>Zapata</surname><given-names>Luis</given-names></name><xref ref-type="aff" rid="aff7"/><xref ref-type="fn" rid="con12"/><xref ref-type="fn" rid="conf2"/><xref ref-type="other" rid="dataro1"/></contrib><contrib contrib-type="author" id="author-8873"><name><surname>Huettel</surname><given-names>Bruno</given-names></name><xref ref-type="aff" rid="aff8"/><xref ref-type="fn" rid="con10"/><xref ref-type="fn" rid="conf2"/><xref ref-type="other" rid="dataro1"/></contrib><contrib contrib-type="author" id="author-7457"><name><surname>de Jong</surname><given-names>Hans</given-names></name><xref ref-type="aff" rid="aff1"/><xref ref-type="fn" rid="con14"/><xref ref-type="fn" rid="conf2"/><xref ref-type="other" rid="dataro1"/></contrib><contrib contrib-type="author" id="author-6394"><name><surname>Ossowski</surname><given-names>Stephan</given-names></name><xref ref-type="aff" rid="aff7"/><xref ref-type="fn" rid="con13"/><xref ref-type="fn" rid="conf2"/><xref ref-type="other" rid="dataro1"/></contrib><contrib contrib-type="author" id="author-1030"><name><surname>Weigel</surname><given-names>Detlef</given-names></name><xref ref-type="aff" rid="aff5"/><xref ref-type="other" rid="par-1"/><xref ref-type="fn" rid="con15"/><xref ref-type="fn" rid="conf1"/><xref ref-type="other" rid="dataro1"/></contrib><contrib contrib-type="author" id="author-7458"><name><surname>Koornneef</surname><given-names>Maarten</given-names></name><xref ref-type="aff" rid="aff1"/><xref ref-type="aff" rid="aff4"/><xref ref-type="fn" rid="con16"/><xref ref-type="fn" rid="conf2"/><xref ref-type="other" rid="dataro1"/></contrib><contrib contrib-type="author" corresp="yes" id="author-7459"><name><surname>Keurentjes</surname><given-names>Joost JB</given-names></name><xref ref-type="aff" rid="aff1"/><xref ref-type="corresp" rid="cor1">*</xref><xref ref-type="fn" rid="con2"/><xref ref-type="fn" rid="conf2"/><xref ref-type="other" rid="dataro1"/></contrib><contrib contrib-type="author" corresp="yes" id="author-6395"><name><surname>Schneeberger</surname><given-names>Korbinian</given-names></name><xref ref-type="aff" rid="aff3"/><xref ref-type="corresp" rid="cor2">*</xref><xref ref-type="other" rid="par-3"/><xref ref-type="fn" rid="con17"/><xref ref-type="fn" rid="conf2"/><xref ref-type="other" rid="dataro1"/></contrib><aff id="aff1"><institution content-type="dept">Laboratory of Genetics</institution>, <institution>Wageningen University</institution>, <addr-line><named-content content-type="city">Wageningen</named-content></addr-line>, <country>Netherlands</country></aff><aff id="aff2"><institution content-type="dept">Department of Molecular Mechanisms of Phenotypic Plasticity</institution>, <institution>IBMP–CNRS, Université de Strasbourg</institution>, <addr-line><named-content content-type="city">Strasbourg</named-content></addr-line>, <country>France</country></aff><aff id="aff3"><institution content-type="dept">Department of Plant Developmental Biology</institution>, <institution>Max Planck Institute for Plant Breeding Research</institution>, <addr-line><named-content content-type="city">Cologne</named-content></addr-line>, <country>Germany</country></aff><aff id="aff4"><institution content-type="dept">Department of Plant Breeding and Genetics</institution>, <institution>Max Planck Institute for Plant Breeding Research</institution>, <addr-line><named-content content-type="city">Cologne</named-content></addr-line>, <country>Germany</country></aff><aff id="aff5"><institution content-type="dept">Department of Molecular Biology</institution>, <institution>Max Planck Institute for Developmental Biology</institution>, <addr-line><named-content content-type="city">Tübingen</named-content></addr-line>, <country>Germany</country></aff><aff id="aff6"><institution content-type="dept">Rijk Zwaan R&amp;D Fijnaart</institution>, <institution>Rijk Zwaan</institution>, <addr-line><named-content content-type="city">Fijnaart</named-content></addr-line>, <country>Netherlands</country></aff><aff id="aff7"><institution content-type="dept">Bioinformatics and Genomics Programme</institution>, <institution>Center for Genomic Regulation (CRG) and Universitat Pompeu Fabra (UPF)</institution>, <addr-line><named-content content-type="city">Barcelona</named-content></addr-line>, <country>Spain</country></aff><aff id="aff8"><institution content-type="dept">Max Planck Genome Center Cologne</institution>, <institution>Max Planck Institute for Plant Breeding Research</institution>, <addr-line><named-content content-type="city">Cologne</named-content></addr-line>, <country>Germany</country></aff></contrib-group><contrib-group content-type="section"><contrib contrib-type="editor"><name><surname>McVean</surname><given-names>Gil</given-names></name><role>Reviewing editor</role><aff><institution>Oxford University</institution>, <country>United Kingdom</country></aff></contrib></contrib-group><author-notes><corresp id="cor1"><label>*</label>For correspondence: <email>Joost.Keurentjes@wur.nl</email> (JJBK);</corresp><corresp id="cor2"><label>*</label>For correspondence: <email>schneeberger@mpipz.mpg.de</email> (KS)</corresp><fn fn-type="con" id="equal-contrib"><label>†</label><p>These authors contributed equally to this work</p></fn></author-notes><pub-date date-type="pub" publication-format="electronic"><day>17</day><month>12</month><year>2013</year></pub-date><pub-date pub-type="collection"><year>2013</year></pub-date><volume>2</volume><elocation-id>e01426</elocation-id><history><date date-type="received"><day>26</day><month>08</month><year>2013</year></date><date date-type="accepted"><day>07</day><month>11</month><year>2013</year></date></history><permissions><copyright-statement>© 2013, Wijnker et al</copyright-statement><copyright-year>2013</copyright-year><copyright-holder>Wijnker et al</copyright-holder><license xlink:href="http://creativecommons.org/licenses/by/3.0/"><license-p>This article is distributed under the terms of the <ext-link ext-link-type="uri" xlink:href="http://creativecommons.org/licenses/by/3.0/">Creative Commons Attribution License</ext-link>, which permits unrestricted use and redistribution provided that the original author and source are credited.</license-p></license></permissions><self-uri content-type="pdf" xlink:href="elife01426.pdf"/><abstract><object-id pub-id-type="doi">10.7554/eLife.01426.001</object-id><p>Knowledge of the exact distribution of meiotic crossovers (COs) and gene conversions (GCs) is essential for understanding many aspects of population genetics and evolution, from haplotype structure and long-distance genetic linkage to the generation of new allelic variants of genes. To this end, we resequenced the four products of 13 meiotic tetrads along with 10 doubled haploids derived from <italic>Arabidopsis thaliana</italic> hybrids. GC detection through short reads has previously been confounded by genomic rearrangements. Rigid filtering for misaligned reads allowed GC identification at high accuracy and revealed an ∼80-kb transposition, which undergoes copy-number changes mediated by meiotic recombination. Non-crossover associated GCs were extremely rare most likely due to their short average length of ∼25–50 bp, which is significantly shorter than the length of CO-associated GCs. Overall, recombination preferentially targeted non-methylated nucleosome-free regions at gene promoters, which showed significant enrichment of two sequence motifs.</p><p><bold>DOI:</bold> <ext-link ext-link-type="doi" xlink:href="10.7554/eLife.01426.001">http://dx.doi.org/10.7554/eLife.01426.001</ext-link></p></abstract><abstract abstract-type="executive-summary"><object-id pub-id-type="doi">10.7554/eLife.01426.002</object-id><title>eLife digest</title><p>Most living organisms package their DNA into bundles called chromosomes. These chromosomes generally form pairs, with each chromosome in the pair containing the same number of genes. The genes also come in the same order, but the exact sequence of DNA bases within the genes can be different.</p><p>When sex cells—such as egg, sperm or pollen cells—are made, each pair of chromosomes is separated so that the each sex cell contains only half the normal number of chromosomes. However, before they are separated, the pairs swap lengths of DNA via recombination events. These can involve exchanging large chunks of the chromosomes: this is called a ‘crossover’. Alternatively, short stretches of one chromosome can be replaced by the corresponding region from the other in the pair. When these ‘non-crossovers’ cause a change in the DNA sequence they are known as gene conversions.</p><p>Long-standing questions in the field of plant biology include: how common are gene conversions? How much DNA is typically exchanged? And where in the chromosomes do these events happen most? Now, Wijnker et al. have addressed these questions by focusing on the accurate detection of recombination events, with a special emphasis on gene conversions, in the plant biologist’s favourite species: Arabidopsis.</p><p>Searching for recombination events is a challenge because, when piecing together an entire genome from lots of shorter stretches of DNA—typically called ‘reads’, it is easy to misplace some of the pieces. However, meticulous examination of these short DNA reads allowed Wijnker et al. to reliably identify gene conversions on a genome-wide scale. In Arabidopsis, gene conversion appears to be unexpectedly rare—with approximately one gene conversion detected per 140–240 non-crossovers. Recombination tends to occur in regions of the chromosomes where the DNA is only loosely packaged, is not heavily modified by the process of ‘DNA methylation’, and also near the start of genes. Furthermore, two specific sequences of DNA bases were identified that marked ‘hot spots’ in the chromosomes, where recombination happens more frequently.</p><p>Wijnker et al. suggest that the low number of gene conversions detected indicates that non-crossovers tend to exchange very short stretches of DNA. However, future research may point to additional mechanisms that explain the low incidence of gene conversion in Arabidopsis.</p><p><bold>DOI:</bold> <ext-link ext-link-type="doi" xlink:href="10.7554/eLife.01426.002">http://dx.doi.org/10.7554/eLife.01426.002</ext-link></p></abstract><kwd-group kwd-group-type="author-keywords"><title>Author keywords</title><kwd>meiotic recombination</kwd><kwd>gene conversion</kwd><kwd>next generation sequencing</kwd><kwd>double strand break repair</kwd><kwd>meiotic tetrads</kwd></kwd-group><kwd-group kwd-group-type="research-organism"><title>Research organism</title><kwd>Arabidopsis</kwd></kwd-group><funding-group><award-group id="par-1"><funding-source><institution-wrap><institution>Gottfried Wilhelm Leibniz Award from the Deutsche Forschungsgemeinschaft</institution></institution-wrap></funding-source><principal-award-recipient><name><surname>Weigel</surname><given-names>Detlef</given-names></name></principal-award-recipient></award-group><award-group id="par-2"><funding-source><institution-wrap><institution>Alexander von Humboldt Foundation</institution></institution-wrap></funding-source><principal-award-recipient><name><surname>Rowan</surname><given-names>Beth A</given-names></name></principal-award-recipient></award-group><award-group id="par-3"><funding-source><institution-wrap><institution>Max Planck Society</institution></institution-wrap></funding-source><principal-award-recipient><name><surname>Velikkakam James</surname><given-names>Geo</given-names></name><name><surname>Klasen</surname><given-names>Jonas R</given-names></name><name><surname>Rawat</surname><given-names>Vimal</given-names></name><name><surname>Schneeberger</surname><given-names>Korbinian</given-names></name></principal-award-recipient></award-group><award-group id="par-4"><funding-source><institution-wrap><institution>EMBO long-term fellowship</institution></institution-wrap></funding-source><principal-award-recipient><name><surname>Wijnker</surname><given-names>Erik</given-names></name></principal-award-recipient></award-group><funding-statement>The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.</funding-statement></funding-group><custom-meta-group><custom-meta><meta-name>elife-xml-version</meta-name><meta-value>2</meta-value></custom-meta><custom-meta specific-use="meta-only"><meta-name>Author impact statement</meta-name><meta-value>A comprehensive and genome-wide description of the genomic make-up and frequency of meiotic recombination in <italic>Arabidopsis thaliana</italic> reveals regional preferences, including nucleosome-free regions, and two associated recombination motifs.</meta-value></custom-meta></custom-meta-group></article-meta></front><body><sec id="s1" sec-type="intro"><title>Introduction</title><p>Sexually reproducing organisms are thought to have an advantage over asexual species, as novel allele combinations can emerge after meiosis in each generation (<xref ref-type="bibr" rid="bib41">Otto and Lenormand, 2002</xref>; <xref ref-type="bibr" rid="bib16">de Visser and Elena, 2007</xref>). Besides randomly passing on one of the homologous chromosomes to the next generation, meiosis introduces intra-chromosomal recombination. In addition to potentially generating superior allele combinations, this shuffling of alleles avoids Muller’s ratchet, in which asexual individuals accumulate deleterious mutations over time (<xref ref-type="bibr" rid="bib37">Muller, 1932</xref>). The impact of meiotic recombination on the allele distribution among offspring, and consequently on the haplotype structure of whole populations, depends on the rate and positioning of recombination events, which are initiated by the induction of meiotic double strand breaks (DSBs).</p><p>Meiotic DSBs are repaired through homologous recombination (HR), in which homologous sequences are used as repair templates (<xref ref-type="bibr" rid="bib49">San Filippo et al., 2008</xref>). The broken strand invades a homologous chromosome and is repaired through HR repair intermediates like the D-loop and double Holliday junction. These intermediates are resolved either as crossovers (COs), leading to the reciprocal exchange of complete chromosome arms, or non-crossovers (NCOs). In both cases, heteroduplexes may arise from sequence divergence at the site of the strand invasion. When these are resolved, they can give rise to gene conversions (GCs), the non-reciprocal exchange of alleles between homologous non-sister chromatids. Such events typically lead to a 3:1 segregation of alleles among the four gametes of a single meiosis. GCs can be associated with COs (CO–GCs) or with NCOs (NCO–GCs) where they can introduce new alleles into an otherwise unchanged genetic background, depending on how the HR intermediate was resolved.</p><p>The number of GCs depends on the number of DSBs, polymorphism density, and on the tract lengths of HR repair intermediates (<xref ref-type="bibr" rid="bib25">Kauppi et al., 2009</xref>; <xref ref-type="bibr" rid="bib17">Dooner and Martínez-Férez, 1997</xref>). Though some general characteristics of meiotic recombination may be shared among sexually reproducing organisms, such as the 150–250 DSBs that are formed at the onset of meiosis in yeast (<italic>Saccaromyces cerevisiae</italic>) (<xref ref-type="bibr" rid="bib64">Weiner and Kleckner, 1994</xref>; <xref ref-type="bibr" rid="bib7">Buhler et al., 2007</xref>), mouse (<xref ref-type="bibr" rid="bib35">Moens et al., 1997</xref>) and <italic>A. thaliana</italic> (<xref ref-type="bibr" rid="bib61">Vignard et al., 2007</xref>; <xref ref-type="bibr" rid="bib9">Chelysheva et al., 2007</xref>; <xref ref-type="bibr" rid="bib50">Sanchez-Moran et al., 2007</xref>), the outcomes of meiosis with around 90, 27 and 10 COs per meiosis are profoundly different (<xref ref-type="bibr" rid="bib31">Mancera et al., 2008</xref>; <xref ref-type="bibr" rid="bib36">Moens et al., 2002</xref>; <xref ref-type="bibr" rid="bib21">Giraut et al., 2011</xref>; <xref ref-type="bibr" rid="bib48">Salome et al., 2012</xref>).</p><p>There are three recent reports assessing GC rates in <italic>A. thaliana</italic>. <xref ref-type="bibr" rid="bib30">Lu et al. (2012)</xref> sequenced two meiotic tetrads at ∼13x coverage (<xref ref-type="bibr" rid="bib30">Lu et al., 2012</xref>) by making use of the quartet mutant (<italic>qrt</italic>), which allows for unordered tetrad analysis. They identified three CO–GCs and two NCO–GCs per meiosis. <xref ref-type="bibr" rid="bib67">Yang et al. (2012)</xref> investigated NCO–GCs through whole-genome sequencing of 40 F<sub>2</sub> offspring, reporting a 130-fold higher GC rate than <xref ref-type="bibr" rid="bib30">Lu et al. (2012)</xref>. Finally, <xref ref-type="bibr" rid="bib59">Sun et al. (2012)</xref> made use of fluorescent reporters (<xref ref-type="bibr" rid="bib19">Francis et al., 2007</xref>) in a <italic>qrt</italic> mutant background to assess GC rates at transgenic loci, which when extrapolated to the whole-genome suggest a similar GC rate as <xref ref-type="bibr" rid="bib30">Lu et al. (2012)</xref>.</p><p>We have sequenced the four products of each of 13 meiotic tetrads and 10 homozygous doubled haploid (DH) offspring obtained from heterozygous F<sub>1</sub> hybrids. We show that short read alignment-based artifacts, which result from structural differences between parental genomes, can greatly inflate the apparent incidence of GCs. In total, we identified over 200 recombination events to the nucleotide level. These data suggest that recombination does not seem to be affected by sequence divergence, but revealed their preference for open chromatin and a significant association to two sequence motifs. Aided by simulations we provide estimates on gene conversion rates and associated tract lengths. CO-associated GCs are at ∼300–400 bp between 6–16 times longer than GCs resulting from NCOs (25–50 bp). The recovery of a ∼80 kb transposition that through crossover recombination leads to duplications and deletions in the offspring underlines the potential of meiosis to not only create new allele combinations but also to cause copy-number alterations of transposed sequences.</p></sec><sec id="s2" sec-type="results"><title>Results</title><sec id="s2-1"><title>Sequencing and genotyping of meiotic tetrads</title><p>We constructed F<sub>1</sub> hybrids of the <italic>A. thaliana</italic> accessions Columbia (Col) and Landsberg <italic>erecta</italic> (L<italic>er</italic>) in the <italic>quartet1 (qrt1</italic>) −/− background (<xref ref-type="bibr" rid="bib45">Preuss et al., 1994</xref>) and crossed individual pollen tetrads to a male sterile EMS mutant of the Cape Verde Island (Cvi) accession as a female receptor. This generated four heterozygous offspring, each composed of one haploid genome of Cvi and one haploid recombinant Col-L<italic>er</italic> genome, referred hereafter to as a complete tetrad (<xref ref-type="fig" rid="fig1">Figure 1A</xref>, ‘Materials and methods’). The 20 genomes of five complete tetrads were sequenced at an average coverage of 54x, whereas eight other complete tetrads were sequenced at lower average coverage of 14x, amounting to a total of 52 sequenced tetrad offspring (<xref ref-type="supplementary-material" rid="SD1-data">Supplementary file 1A</xref>). In addition we sequenced the genomes of all three parental accessions and performed standard resequencing analyses, which yielded a list of 269,842 high quality bi-allelic single nucleotide polymorphism (SNP) markers that distinguish the Col and L<italic>er</italic> alleles (average coverage 56x, <xref ref-type="supplementary-material" rid="SD2-data">Supplementary file 2A</xref>, ‘Materials and methods’) (<xref ref-type="bibr" rid="bib39">Ossowski et al., 2008</xref>).<fig-group><fig id="fig1" position="float"><object-id pub-id-type="doi">10.7554/eLife.01426.003</object-id><label>Figure 1.</label><caption><title>Experimental design and summary of recombination events within 62 recombinants.</title><p>(<bold>A</bold>) 13 complete tetrads were generated by crossing <italic>qrt1</italic> in a Col background to <italic>qrt1</italic> in a L<italic>er</italic> background, and then using single pollen tetrads from the F<sub>1</sub> hybrids to fertilize a Cvi male sterile pollen receptor. Tetrad offspring are heterozygous, with one recombinant Col-L<italic>er</italic> genome and one Cvi genome. 10 DH lines were generated by crossing wildtype Col-L<italic>er</italic> F<sub>1</sub> hybrids to the <italic>GFP-tailswap</italic> haploid inducer. One round of selfing generated doubled haploids. (<bold>B</bold>) Example of graphical genotypes of all five Col-L<italic>er</italic> recombinant chromosomes of all four offspring of a complete tetrad. Col regions are shown in red, L<italic>er</italic> regions in blue. The homologous chromosome, which is inherited from Cvi, is not shown. The three enlarged regions show a CO–GC, a CO without GC and a NCO–GC (clockwise, starting at the upper left corner). (<bold>C</bold>) All recombination events identified in this study. Different recombination types are labeled with different colors. Centromere positions are indicted with black lines. (<bold>D</bold>) Example of the graphical genotypes of the five chromosomes of one DH line (Col regions are shown in red, L<italic>er</italic> regions in blue). The two enlarged regions show a NCO–GC (left) and a CO without GC (right).</p><p><bold>DOI:</bold> <ext-link ext-link-type="doi" xlink:href="10.7554/eLife.01426.003">http://dx.doi.org/10.7554/eLife.01426.003</ext-link></p></caption><graphic xlink:href="elife01426f001"/></fig><fig id="fig1s1" position="float" specific-use="child-fig"><object-id pub-id-type="doi">10.7554/eLife.01426.004</object-id><label>Figure 1—figure supplement 1.</label><caption><title>Graphical genotypes of all 13 complete tetrads.</title><p>For each tetrad, all five chromosomes of all four offspring are shown (chromosome one to five, top to bottom). The four offspring genomes of one tetrad are plotted one above the other. Black lines depict peri-centromeric regions. The upper five tetrads (29, 38, 40, 58, and 62) are the deeply sequenced tetrads and were used for the detection of NCO–GCs.</p><p><bold>DOI:</bold> <ext-link ext-link-type="doi" xlink:href="10.7554/eLife.01426.004">http://dx.doi.org/10.7554/eLife.01426.004</ext-link></p></caption><graphic xlink:href="elife01426fs001"/></fig><fig id="fig1s2" position="float" specific-use="child-fig"><object-id pub-id-type="doi">10.7554/eLife.01426.005</object-id><label>Figure 1—figure supplement 2.</label><caption><title>Graphical genotypes of 10 DH lines.</title><p>For each DH line, all five chromosomes are shown, chromosome one to five (top to bottom). Black lines depict peri-centromeric regions.</p><p><bold>DOI:</bold> <ext-link ext-link-type="doi" xlink:href="10.7554/eLife.01426.005">http://dx.doi.org/10.7554/eLife.01426.005</ext-link></p></caption><graphic xlink:href="elife01426fs002"/></fig></fig-group></p><p>To reconstruct the recombinant Col-L<italic>er</italic> chromosomes of individual tetrad offspring, we first assigned either a Col or L<italic>er</italic> genotype to every marker in the genome and subsequently merged blocks of consecutive markers with the same genotypes for the identification of crossover break points. This block-merging step occasionally merged interspersed markers with different genotypes. While this efficiently removes all genotyping errors, it will not allow for the identification of any GCs. The visualizations of these stretches were used to create graphical genotypes (e.g., in <xref ref-type="fig" rid="fig1">Figure 1B</xref>).</p><p>Since COs lead to reciprocal exchange, complete tetrads allow the reconstruction of all CO recombination events of a single meiosis. At least one CO was identified on each chromosome, with an average of 10.1 COs per meiosis (<xref ref-type="fig" rid="fig1">Figure 1B</xref>, <xref ref-type="fig" rid="fig1s1">Figure 1—figure supplement 1</xref>). The larger chromosomes one, three and five showed more COs than chromosomes two and four, which is in agreement with earlier reports on CO frequency in male meiosis in <italic>A. thaliana</italic> (<xref ref-type="bibr" rid="bib21">Giraut et al., 2011</xref>; <xref ref-type="bibr" rid="bib65">Wijnker et al., 2012</xref>) (<xref ref-type="supplementary-material" rid="SD1-data">Supplementary file 1B</xref>). In total, we recovered 131 COs throughout all 13 tetrads (<xref ref-type="supplementary-material" rid="SD1-data">Supplementary file 1C</xref>).</p></sec><sec id="s2-2"><title>NCO identification in tetrad offspring</title><p>Unlike COs, NCOs can only be identified when they result in GCs. NCO–GCs can be detected by searching for L<italic>er</italic> alleles in regions inherited from Col, and <italic>vice versa</italic>. We distinguished between two different types of NCO–GCs. The first type can be detected at markers where the expected allele is isogenic to the recipient parent (Cvi). Here, NCO–GCs alter a homozygous into a heterozygous genotype (type-1 NCO–GCs). The second type is detected when the expected allele is different from the Cvi allele, and NCO–GCs change a heterozygous genotype into a homozygous genotype (type-2 NCO–GCs, <xref ref-type="fig" rid="fig2s1">Figure 2—figure supplement 1</xref>). As the footprint for type-2 NCO–GCs is the absence of one of the parental alleles, high sequencing depth is required to be confident in assigning a null-allele. In a previous study, this was addressed by removing all type-2 NCO–GCs from the analysis, effectively removing around 50% of the markers (<xref ref-type="bibr" rid="bib30">Lu et al., 2012</xref>).</p><p>Precise identification of NCO–GCs relies on accurate detection of homozygous and heterozygous states at polymorphic markers, which in turn depends on two criteria: a reliable short read alignment-based allele frequency threshold for distinguishing between homozygous and heterozygous genotypes, and a sufficient number of aligned sequence reads. In order to define the first, we summarized short read alignment-derived frequencies of the parental allele at all type-1 and type-2 marker positions. Fitting beta distributions to the observed allele frequency distributions allowed for the identification of a global, coverage-dependent cutoff that distinguishes between heterozygous and homozygous genotypes (<xref ref-type="fig" rid="fig2s2">Figure 2—figure supplement 2</xref>, ‘Materials and methods’).</p><p>To establish a minimum coverage threshold, we reasoned that insufficient coverage will lead to increased detection of (false-positive) NCO–GCs, but once a critical coverage is met, higher coverage requirements will not further impact the frequency of NCO–GCs candidate predictions. We calculated the frequency of NCO–GCs (defined as the ratio of converted markers divided by all markers) for a wide range of minimal coverage thresholds (<xref ref-type="fig" rid="fig2s3">Figure 2—figure supplement 3</xref>). At coverage levels of at least 50x, the frequency of NCO–GCs remained stable, and this sequence depth was subsequently used as the minimal threshold.</p><p>Combining both the allele frequency and the coverage thresholds, we were able to confidently assign genotypes to 2,477,241 markers within the five deeply sequenced complete tetrads. Among these, 1,359 revealed putative NCO–GCs.</p></sec><sec id="s2-3"><title>Genome rearrangements can lead to false-positive NCO–GC calls</title><p>Type-1 and type-2 NCO–GC classification is a consequence of allele sharing with the recipient Cvi genotype. Because Col and L<italic>er</italic> alleles segregate in an equal (2:2) ratio among the four tetrad offspring, both types of NCO–GCs are expected to occur at similar numbers. However, we identified a striking overrepresentation of type-1 NCO–GCs (<xref ref-type="fig" rid="fig2s4">Figure 2—figure supplement 4</xref>). As mentioned, type-1 NCO–GCs introduce heterozygous genotype calls at markers where homozygous genotype calls would be expected. Such patterns of putative heterozygous alleles can also be introduced by erroneously aligned reads. By manually screening short read alignments, we found a large fraction of putative NCO–GCs residing in regions in which the parental lines had been assigned heterozygous genotypes. Since all three parental lines are highly inbred, and hence homozygous throughout their genomes, the presence of heterozygous marker calls can only be explained by misaligned reads that result from unknown rearrangements or repeats. This is especially critical for L<italic>er</italic> and Cvi, where genome information is more limited than for the reference accession Col. <xref ref-type="fig" rid="fig2">Figure 2A</xref> shows an example where three haplotypes appear to be present at a single locus when the short read data of L<italic>er</italic> are aligned against the Col reference genome. This specific marker was used to assign NCO–GCs in two earlier studies (<xref ref-type="bibr" rid="bib67">Yang et al., 2012</xref>; <xref ref-type="bibr" rid="bib30">Lu et al., 2012</xref>). To prevent such erroneous calls, we redefined the initial set of markers by excluding all markers near regions with evidence of putative duplications (‘Materials and methods’).<fig-group><fig id="fig2" position="float"><object-id pub-id-type="doi">10.7554/eLife.01426.006</object-id><label>Figure 2.</label><caption><title>Identification of gene conversions is complex because of unknown duplications and transpositions in the <italic>A. thaliana</italic> genome.</title><p>(<bold>A</bold>) Short read alignments of L<italic>er</italic> against the reference sequence at position 13,957,178 on chromosome 1. Individual reads are shown as blue lines, while mismatches between reference sequence and short reads are colored according to mismatch types. Three distinct L<italic>er</italic> haplotypes align to this region, indicating that this sequence is present in triplicate in the L<italic>er</italic> genome. As L<italic>er</italic> is homozygous, at least two haplotypes were not aligned to their respective origin. (<bold>B</bold>) The genomic landscape of the two insertion sites of an ∼80 kb transposition between L<italic>er</italic> and Col. Blue and red boxes mark sequences that are unique to L<italic>er</italic> and Col respectively. Green boxes highlight the transposed (and inverted) DNA. Genes annotated in Col are shown in grey. (<bold>C</bold>) Graphical genotypes of chromosome 3 of the four genomes of tetrad 58 (Col-derived genomic regions are shown in red, L<italic>er</italic> regions in blue). Cvi sequences are not shown. Grey arrows indicate the insertion sites of the transposition illustrated in <bold>B</bold>. Tetrad 58_1 and tetrad 58_2 formed a crossover between these insertion sites. As a result, tetrad 58_1 lost all transposed sequences, whereas in tetrad 58_2 the transposed DNA is duplicated. (<bold>D</bold>) Short read alignments of all four genomes of tetrad 58 to chromosome 3 at positions 22,565,274 to 22,565,374, which overlap the transposed DNA. This region includes two closely linked SNPs that can distinguish all three parental alleles (black dots indicate mismatches to the reference sequence). The reads that can be assigned to one of the three parents are shown by different colors. Tetrad 58_1, which lost the transposed DNA, shows the absence of Col and L<italic>er</italic> derived reads, whereas tetrad 58_2, which inherited both transposed regions, shows the presence of both Col and L<italic>er</italic> alleles in this region. (<bold>E</bold>) Redrawing of the graphical genotypes of six Col-L<italic>er</italic> F<sub>2</sub> offspring, as presented in the appendix of <xref ref-type="bibr" rid="bib67">Yang et al. (2012)</xref>. These offspring experienced a putative double CO, co-localizing with the Col insertion site of above-mentioned transposition. Note that in all six F<sub>2</sub> offspring, at least one of the recombinant chromosomes features a CO between the transposition sites. This suggests that the annotated double recombinations are not real, but that the observed patterns originate from copy number variation due to transposed DNA.</p><p><bold>DOI:</bold> <ext-link ext-link-type="doi" xlink:href="10.7554/eLife.01426.006">http://dx.doi.org/10.7554/eLife.01426.006</ext-link></p></caption><graphic xlink:href="elife01426f002"/></fig><fig id="fig2s1" position="float" specific-use="child-fig"><object-id pub-id-type="doi">10.7554/eLife.01426.007</object-id><label>Figure 2—figure supplement 1.</label><caption><title>Graphical illustration of short read alignments at type-1 and type-2 markers revealing no NCO–GCs (two loci at the left) and the same loci revealing type-1 and type-2 NCO-GCs (right).</title><p>Short read alignments are shown in colors according to the respective genome, if they cannot be uniquely assigned to one unique parent they are shown in grey. At type-1 markers, the expected allele of the recombinant Col or L<italic>er</italic> chromosome is similar to the Cvi allele. This leads to a homozygous genotype (left). At type-2 markers, Cvi is different from the expected allele and a heterozygous marker is observed (second from left). In contrast, in the case of GCs, type-1 markers will reveal an additional allele, whereas type-2 markers will feature the absence of an expected allele (two loci on the right).</p><p><bold>DOI:</bold> <ext-link ext-link-type="doi" xlink:href="10.7554/eLife.01426.007">http://dx.doi.org/10.7554/eLife.01426.007</ext-link></p></caption><graphic xlink:href="elife01426fs003"/></fig><fig id="fig2s2" position="float" specific-use="child-fig"><object-id pub-id-type="doi">10.7554/eLife.01426.008</object-id><label>Figure 2—figure supplement 2.</label><caption><title>Observed allele distribution throughout all deeply sequenced tetrad genomes.</title><p>Observed allele frequencies (AFs) were derived from allele counts (based on short read alignments) at the respective marker loci. AFs at type-1 markers (homozygous alleles) are shown in red, AFs at type-2 markers in blue (heterozygous alleles). Blue and red lines are beta distributions fitted to the observed AF distributions. These were used as probability functions for short read-based AFs. At an allele frequency of 0.889 the percentiles of the two probability functions were almost similar. Hence, assigning heterozygous and homozygous alleles based on this frequency cutoff has an almost similar accuracy for type-1 and type-2 markers, and the error rate of GC assignment at type-1 and type-2 markers is expected to be equal.</p><p><bold>DOI:</bold> <ext-link ext-link-type="doi" xlink:href="10.7554/eLife.01426.008">http://dx.doi.org/10.7554/eLife.01426.008</ext-link></p></caption><graphic xlink:href="elife01426fs004"/></fig><fig id="fig2s3" position="float" specific-use="child-fig"><object-id pub-id-type="doi">10.7554/eLife.01426.009</object-id><label>Figure 2—figure supplement 3.</label><caption><title>NCO–GC frequency per marker per meiosis measured in the five deeply sequenced tetrads, using a range of minimal coverage thresholds and three different marker sets.</title><p>NCO–GC frequency was assessed using different marker sets. Marker sets with more stringent filtering showed a lower frequency of putative NCO–GCs, which indicates that filtering reduces the incidence of false positives. For all sets we observed that too low minimal coverage thresholds (for assigning either a NCO–GC or no NCO–GC) led to increased putative NCO–GC frequencies. NCO–GC frequency leveled off beyond a coverage requirement of 50x for all marker sets. The blue cross indicates the estimated NCO–GC frequency, based on all PCR-validated NCO–GCs.</p><p><bold>DOI:</bold> <ext-link ext-link-type="doi" xlink:href="10.7554/eLife.01426.009">http://dx.doi.org/10.7554/eLife.01426.009</ext-link></p></caption><graphic xlink:href="elife01426fs005"/></fig><fig id="fig2s4" position="float" specific-use="child-fig"><object-id pub-id-type="doi">10.7554/eLife.01426.010</object-id><label>Figure 2—figure supplement 4.</label><caption><title>The number of putative type-1 and type-2 NCO–GCs in all 20 deeply sequenced tetrad offspring using different marker sets.</title><p>Bar charts show the number of NCO–GC detected per offspring plant using different marker set generated with different filter stringencies in each subsequent step (top to bottom). Quality score filtered marker sets revealed an overrepresentation of type-1 NCO–GC (top panel). Even after filtering for markers in close vicinity to putatively duplicated regions, some samples still showed a relative high incidence of type-1 markers (2<sup>nd</sup> panel). After removing markers with falsely aligned reads (using regions where all three parental alleles could be distinguished) and regions of high sequence divergence, the overall number of NCO–GCs was drastically reduced, however the ratio of type-1 and type-2 markers is close to equal, as theoretically expected (3<sup>rd</sup> panel).</p><p><bold>DOI:</bold> <ext-link ext-link-type="doi" xlink:href="10.7554/eLife.01426.010">http://dx.doi.org/10.7554/eLife.01426.010</ext-link></p></caption><graphic xlink:href="elife01426fs006"/></fig><fig id="fig2s5" position="float" specific-use="child-fig"><object-id pub-id-type="doi">10.7554/eLife.01426.011</object-id><label>Figure 2—figure supplement 5.</label><caption><title>Transposed sequences on <italic>A. thaliana</italic> chromosome 3.</title><p>(<bold>A</bold>) Upper panel shows, analogous to <xref ref-type="fig" rid="fig2">Figure 2B</xref>, the location of transposed DNA between the two parental lines L<italic>er</italic> and Col. Thin blue and red lines indicate co-linear sequences. Green thick lines show shared, albeit transposed and inverted sequences. Thick blue and red lines show sequences unique to L<italic>er</italic> or Col respectively. Numbered arrows indicate primer positions used for verification of transposed sequences. Primer positions with black numbers refer to primers present in both Col and L<italic>er</italic>, whereas red numbers indicate Col-specific primers. Tick marks in Col describe 10 kb distances and numbers below the Col sequence refer to the approximate transposition breakpoints in the Col reference genome. (<bold>B</bold>) Primer pairs that give a product in L<italic>er</italic> but not in Col. (<bold>C</bold>) Primer pairs that give a product in Col but not in L<italic>er</italic>. The fragment generated by primers 1 and 2 measures ∼9 kb on gel, while based on the Col reference we expected 7,9 Kb. We therefore designed a second (independent) set of primers for positions 1, 2, 3, and 4 and repeated the PCRs for these primer combinations (<bold>D</bold>). All fragments were of similar size as the fragments shown in B and C, corroborating the slightly longer than expected length for the product by primers 1 and 2.</p><p><bold>DOI:</bold> <ext-link ext-link-type="doi" xlink:href="10.7554/eLife.01426.011">http://dx.doi.org/10.7554/eLife.01426.011</ext-link></p></caption><graphic xlink:href="elife01426fs007"/></fig><fig id="fig2s6" position="float" specific-use="child-fig"><object-id pub-id-type="doi">10.7554/eLife.01426.012</object-id><label>Figure 2—figure supplement 6.</label><caption><title>NCO–GC frequency per marker per meiosis measured in 10 recombinant DH lines at increasing minimal coverage thresholds.</title><p>From a minimal coverage threshold of 10 onwards, the observed frequency of putative NCO–GC does not majorly change. This is in contrast to the minimal coverage requirement for the tetrad sample analysis. The reasons for this lie in the homozygous nature of the DH samples, that makes identification of NCO–GCs much easier in comparison to the tetrad samples. The blue cross indicates the estimated NCO–GC frequency after PCR validating all predicted NCO–GCs.</p><p><bold>DOI:</bold> <ext-link ext-link-type="doi" xlink:href="10.7554/eLife.01426.012">http://dx.doi.org/10.7554/eLife.01426.012</ext-link></p></caption><graphic xlink:href="elife01426fs008"/></fig></fig-group></p><p>Repeating the analysis with the filtered set of markers still revealed apparently false positive NCO–GC calls. Most remarkable was an ∼80 kb region on chromosome three, in three out of five independent tetrads, which harbored multiple closely linked putative type-2 NCO–GCs, but not a single type-1 NCO–GC. Thus, all these putative NCO–GCs were homozygous genotypes, which suggested another source of error. Intriguingly, for each of these tetrads, one other offspring was found with a similar pattern in the same region, but in contrast these offspring featured only putative type-1 NCO–GCs (heterozygous genotypes). Manual inspection of short read alignments and de novo assemblies of L<italic>er</italic> revealed a so-far undescribed large-scale rearrangement between Col and L<italic>er</italic> (<xref ref-type="fig" rid="fig2">Figure 2B</xref>, <xref ref-type="fig" rid="fig2s5">Figure 2—figure supplement 5</xref>). This rearrangement encompasses two closely linked transpositions, which together relocated ∼40 kb (including six genes) from ∼22.53–22.57 Mb of the reference genome to 17.36 Mb on chromosome 3 in the genome of L<italic>er</italic>. Additionally, the region adjacent to the transposed sequences (between 22.51 Mb and 22.59 Mb on chromosome 3) appeared to be mostly absent in L<italic>er</italic>. The segregation of this complex region in combination with a CO in between the two insertion sites of the transpositions resulted in either a duplication or a deletion of the transposed sequence within individual offspring genomes (<xref ref-type="fig" rid="fig2">Figure 2C</xref>). As a consequence, this unexpected copy number variation introduced false NCO–GC calls (<xref ref-type="fig" rid="fig2">Figure 2D</xref>). Among all 13 tetrads (including the eight shallow sequenced tetrads), there are seven that show a recombination event between the transposed regions, suggesting that an ∼80 kb duplication/deletion is present in about half of all Col-L<italic>er</italic> gametes.</p><p>An earlier report highlighted the region at ∼22.5 Mb as hotspot for double recombination (<xref ref-type="bibr" rid="bib67">Yang et al., 2012</xref>). Intriguingly, each of the genomes with a putative double recombination event at this position within the graphical genotypes also featured a CO between the insertion sites, which leads to the above-mentioned copy number variations and misleading genotypes (<xref ref-type="fig" rid="fig2">Figure 2E</xref>).</p><p>To remove these and similar patterns we utilized the presence of the Cvi alleles. We assigned individual read pairs to one of the three parental genotypes whenever closely linked polymorphisms allowed distinguishing all three parental alleles. All marker loci where short reads aligned from all parental lines, were removed from further analysis. In addition, we removed markers where the local sequence divergence complicated the alignment of L<italic>er</italic> short reads to the reference sequence, since such regions can interfere with the allele frequency calculation (‘Materials and methods’).</p><p>With the final set of 137,339 markers, we genotyped all 20 deeply sequenced tetrad offspring for the presence of putative NCO–GCs at markers that were not found to be involved in CO-associated gene conversion (<xref ref-type="supplementary-material" rid="SD2-data">Supplementary file 2B</xref>, ‘Materials and methods’). We were able to confidently assign genotypes to 1,092,055 loci within the 20 individuals, revealing 10 putative NCO–GCs that were supported by a total of 12 converted markers. PCR-based sequencing confirmed the 3:1 segregation of seven NGO–GCs (based on seven markers), but rejected three putative NCO–GCs (based on five markers) because there was a 2:2 segregation of the alleles which was not apparent in the short read sequencing data (<xref ref-type="supplementary-material" rid="SD1-data">Supplementary file 1D</xref>, ‘Materials and methods’). The false-positive predictions may result from incomplete filtering against complex sequence differences, for which no closely linked markers were available to distinguish the reads of all three parents.</p></sec><sec id="s2-4"><title>Doubled haploid lines as independent controls for NCO–GC detection</title><p>A reliable calculation of NCO–GCs is complicated by the heterozygous nature of our samples and stringent filtering may have reduced the number of identified GCs. In order to confirm our analysis of NCO–GCs in an independent experiment, we resequenced the homozygous genomes of 10 doubled haploid offspring of Col-L<italic>er</italic> F<sub>1</sub> hybrids (average coverage 49x, <xref ref-type="supplementary-material" rid="SD1-data">Supplementary file 1A</xref>) that we randomly selected from an previously generated DH population (<xref ref-type="bibr" rid="bib65">Wijnker et al., 2012</xref>) (‘Materials and methods’). These were generated by crossing the heterozygous F<sub>1</sub> (as male) to the <italic>GFP-tailswap</italic> haploid inducer (<xref ref-type="bibr" rid="bib47">Ravi and Chan, 2010</xref>) to produce haploid plants with recombinant genomes, which were selfed to give rise to diploid, doubled haploid (DH) lines (<xref ref-type="bibr" rid="bib65">Wijnker et al., 2012</xref>) (<xref ref-type="fig" rid="fig1">Figure 1A</xref>, <xref ref-type="fig" rid="fig1">Figure 1C</xref>). The genomes of the DH lines have blocks of homozygous regions derived either from Col or L<italic>er</italic>, which resulted from recombination events during meiosis in the F<sub>1</sub> parent.</p><p>In addition, we resequenced the parental lines of the DH lines and defined a set of 438,915 high quality markers (average coverage 58.9, <xref ref-type="supplementary-material" rid="SD2-data">Supplementary file 2C</xref>, ‘Materials and methods’). Genotyping and identification of consecutive blocks of the same genotype were performed in analogy to the tetrad sample analysis (‘Materials and methods’). This revealed 60 COs in total (<xref ref-type="supplementary-material" rid="SD1-data">Supplementary file 1E</xref>). Only one of these CO sites revealed the presence of a GC, which is expected, since single gametes cannot reveal the complete picture of CO–GCs (<xref ref-type="bibr" rid="bib46">Qi et al., 2009</xref>) (<xref ref-type="fig" rid="fig1">Figure 1D</xref>, <xref ref-type="fig" rid="fig1s2">Figure 1—figure supplement 2</xref>).</p><p>In order to test if the haploid life cycle during the generation of DH lines interferes with the assignment of NCO–GCs, we searched for spontaneous mutations in all ten DH lines. To this end, we analyzed 743,440,307 positions across all ten genomes, for which we had sufficient, non-ambiguous short read alignments and identified eight spontaneous mutations (‘Materials and methods’). From this, we estimated a spontaneous mutation rate of 1.1 × 10<sup>−8</sup> ± 1.0 ×10<sup>−8</sup> per site per generation, which is slightly higher than the estimated mutation rate of 7.0 × 10<sup>−9</sup> for sexually reproducing plants (<xref ref-type="bibr" rid="bib40">Ossowski et al., 2010</xref>). Like for the spontaneous mutation rates, we observed an enrichment of transitions (n = 6) over transversions (n = 2). An increased mutation rate in haploids could result from the absence of a homologous repair template in haploid (G1) cells that might lead to increased repair through the presumably more error-prone pathway of non-homologous end-joining (<xref ref-type="bibr" rid="bib23">Gorbunova and Levy, 1997</xref>; <xref ref-type="bibr" rid="bib32">Mao et al., 2008</xref>). Alternatively, the process of uni-parental genome elimination (<xref ref-type="bibr" rid="bib51">Sanei et al., 2011</xref>) could potentially cause mutagenic stress. Even if spontaneous mutation rates are increased in the DH lines, there are on average no more than two mutations per DH genome, which is by far not large enough to interfere with the accurate identification of GCs, which act on existing variation only.</p><p>To assess a minimal coverage threshold to identify putative NCO–GCs for the DH lines, we used the same allele frequency cutoffs as for the analysis of the tetrad samples and calculated the conversion frequency for a series of minimal coverage thresholds. From a minimal coverage of 10 onwards, the frequency of NCO–GCs levels off (<xref ref-type="fig" rid="fig2s6">Figure 2—figure supplement 6</xref>). Using this value as threshold, we genotyped 438,915 markers in each of the 10 DH lines (‘Materials and methods’) and could confidently assign genotypes to 3,672,610 loci. Among these we identified 10 putative NCO–GC tracts (converting 13 markers in total). PCR-based sequencing of these loci confirmed nine NCOs (that converted a total of 10 markers), but rejected one putative NCO–GC based on three converted markers (<xref ref-type="supplementary-material" rid="SD2-data">Supplementary file 2</xref>).</p></sec><sec id="s2-5"><title>The endogenous rate of gene conversion in <italic>A. thaliana</italic></title><p>We estimated the frequency of NCO–GCs in the five deeply sequenced tetrads and 10 independent DH lines as 5.9 × 10<sup>−6</sup> ± 6.1 × 10<sup>−6</sup> and 2.7 × 10<sup>−6</sup> ± 2.7 ×10<sup>−6</sup> per site per meiosis respectively, numbers that are in very close agreement. Combining both experiments we estimated a frequency of 3.6 × 10<sup>−6</sup> ± 2.7 ×10<sup>−6</sup> for NCO–GCs per site per meiosis, a frequency that is three orders of magnitude higher than the spontaneous mutation rate (<xref ref-type="bibr" rid="bib40">Ossowski et al., 2010</xref>). In addition, we calculated the frequency of CO–GCs based on the tetrad data and estimated this rate at 7.8 × 10<sup>−6</sup> ± 5.4 × 10<sup>−6</sup> per site per meiosis, which is not significantly different from the NCO–GC frequency. See <xref ref-type="fig" rid="fig1">Figure 1D</xref> for a spatial overview.</p></sec><sec id="s2-6"><title>Sequence divergence within CO-associated GCs</title><p>To obtain a more detailed view on recombination sites, we reconstructed the sequences around GC sites from the tetrad samples by manually inspecting and locally assembling short reads to obtain the full sequence within conversion tracts. We focused on a subset of 71 COs for which sufficient sequencing information was available in both gametes. All 71 CO sites have been visualized in <xref ref-type="supplementary-material" rid="SD3-data">Supplementary file 3</xref>. Of these COs 62.0% (n = 44) showed the presence of associated GCs (<xref ref-type="supplementary-material" rid="SD1-data">Supplementary file 1F,G</xref>). All CO-associated conversion tracts (COCTs) except one showed co-conversion of adjacent polymorphisms (<xref ref-type="bibr" rid="bib55">Schultes and Szostak, 1990</xref>), indicating that all alleles in the COCT segregated in a 3:1 ratio. This observation compares well to yeast, where co-conversion of alleles prevails (<xref ref-type="bibr" rid="bib55">Schultes and Szostak, 1990</xref>; <xref ref-type="bibr" rid="bib57">Stahl and Foss, 2010</xref>). We observed 23 and 20 co-conversions to L<italic>er</italic> and Col, respectively, which is not significantly different (p value 0.64 [χ<sup>2</sup>-test]).</p><p>Occasionally, sequence divergence within the COCTs was large. We observed COCTs with multiple deletions and insertions of up to 18 bp in length. In one of them, L<italic>er</italic> is different from Col at almost one fifth of all positions throughout the COCT. Since homologous recombination requires homology for strand invasion, we speculated that overall COCTs should prefer regions with higher similarity. The distribution of sequence identity at COCTs was not found to be significantly different from randomly sampled regions from the chromosome arms with similar lengths. However, around 3.9% of these regions featured less than 5% confidently aligned positions, when aligning short reads of L<italic>er</italic> against the reference sequence. Such regions, which nearly completely miss any homology, were not targeted by any recombination according our data (<xref ref-type="fig" rid="fig3">Figure 3A</xref>). Nevertheless, reliable identification of a true lower border for the tolerance for sequence divergence at CO sites will be difficult to establish. This would require many more CO events, but also more precise information on the positioning of DSBs and strand invasion, since it is possible that recombination events are initiated adjacent to their resulting conversion tracts.<fig id="fig3" position="float"><object-id pub-id-type="doi">10.7554/eLife.01426.013</object-id><label>Figure 3.</label><caption><title>Sequence similarity at recombination sites, length differences between COCTs and NCOCTs and crossover interference.</title><p>(<bold>A</bold>) The percentage of confidently aligned positions within the resequencing of L<italic>er</italic> was used as proxy for local sequence similarity. The percentages at COs were compared to a background distribution based on random sampling in non-peri-centromeric regions. (<bold>B</bold>) Comparison of COCT and NCOCT lengths. COCTs are significantly longer. (<bold>C</bold>) Repeatedly simulating (n = 10,000) sets of 52 COs randomly placed in the non-peri-centromeric regions predicted the average number of converted markers throughout all COCTs (left) as well as the average number of co-converted markers within a single COCT (right) given a fixed, simulated tract length (blue lines show the average values, shaded regions indicate standard deviations). The dashed red lines indicate the observed values for the real number of converted markers and average number of markers that co-converted. The intersections of observed and simulated numbers suggest an average COCT length of ∼300–400 bp. (<bold>D</bold>) Estimation of NCOCT length, as shown for COCTs in <bold>C</bold>. As the absolute number of NCOs is not obvious, we simulated a range of different NCO numbers (green to yellow colors indicate different numbers of DSBs per meiosis, of which half are simulated not to restore the original allele). Assuming a NCOCTs length of ∼400 bp (as we estimated for COCT length), only five NCOs would be formed per meiosis (left), however in this scenario the average number of co-converted markers would deviate drastically from the observed value (right). (<bold>E</bold>) The density distribution of distances between neighboring CO and between neighboring CO and NCO events reveals differences in inter-event spacing. The observed distances between recombination events (colored), with the average inter-event distance shown as a vertical line, are compared to randomized inter-event distances (black lines) measured after randomizing the labels of the existing tetrads. The boxplots show the distribution of the means of each randomization. The CO–CO distances are significantly longer than random distances due to crossover interference. Interference between COs and NCOs could not be detected.</p><p><bold>DOI:</bold> <ext-link ext-link-type="doi" xlink:href="10.7554/eLife.01426.013">http://dx.doi.org/10.7554/eLife.01426.013</ext-link></p></caption><graphic xlink:href="elife01426f003"/></fig></p><p>To assess sequence divergence at NCO–GCs, we used the NCO–GCs recovered in the deeply sequenced tetrads and added another 11 NCO–GCs identified in the same tetrads. These additional GCs did not pass our final filters, but have all been confirmed by PCR. In total, we identified 18 converted polymorphisms, which could be assigned to 14 distinct NCO conversion tracts (NCOCTs) (<xref ref-type="supplementary-material" rid="SD1-data">Supplementary file 1H</xref>). Ten of these NCOCTs consisted of a single polymorphism and the remaining four of two converted polymorphisms. Such short NCOCTs cannot be used to distinguish between NCOCTs resulting from co-conversion and more complex tracts of alternating 3:1 and 2:2 segregating markers, as the detection of complex conversions requires NCOCTs with at least three consecutive markers.</p></sec><sec id="s2-7"><title>COs have longer gene conversion tracts than NCOs</title><p>The length distributions of observed CO and NCO conversion tracts are significantly different (p value 0.03 [<italic>t</italic> test], <xref ref-type="fig" rid="fig3">Figure 3B</xref>). Estimating the actual GC tract lengths from the observed GC tract lengths is difficult if the observed tracts are short and the marker density is relatively low. In particular for NCOCTs, where a majority of observed tracts had a length of 1 bp, tract length estimations are tenuous. We therefore performed simulations to determine the expected total number of converted markers within the five deeply sequenced tetrads and the average number of markers within a single conversion given different lengths of simulated conversion tracts.</p><p>For COCTs, we simulated 10,000 sets of 71 randomly placed COCTs of increasing conversion tract lengths from 100 bp to 1 kb and calculated the resulting total number of converted markers among all 71 COCTs and the average number of converted markers per tract. We found that a simulated length of ∼400 bp corresponds to an average number of 69.9 converted markers, which is very close to the 72 converted markers observed in the tetrad samples. An average of 2.2 converted markers per tract corresponds to a simulated COCT length of ∼300 bp (<xref ref-type="fig" rid="fig3">Figure 3C</xref>). Differences between these two estimates might result from the fact that we simulated a fixed COCT length, while actual COCTs might feature variability in their lengths. From this we estimated that COCTs are on average ∼300–400 bp in length.</p><p>We repeated this simulation for NCOCTs. In contrast to simulations for COCTs, simulating NCOCT placement is hampered by the lack of knowledge of how many NCOs were present. It is likely that not all NCOs lead to GC, either due to restoration of the original allele as was assumed for half of the NCOs in yeast (<xref ref-type="bibr" rid="bib31">Mancera et al., 2008</xref>), or because they occur in regions without polymorphisms. We therefore simulated different NCO rates within the range of reported DSBs per meiosis (<xref ref-type="bibr" rid="bib61">Vignard et al., 2007</xref>; <xref ref-type="bibr" rid="bib9">Chelysheva et al., 2007</xref>; <xref ref-type="bibr" rid="bib50">Sanchez-Moran et al., 2007</xref>). We further assumed that half of them are restored to the original alleles and thus only 50% of them have the potential to lead to conversions (green to yellow colored lines in <xref ref-type="fig" rid="fig3">Figure 3D</xref>). These simulations suggested that 100 DSBs per meiosis with a tract length of 50 bp would lead to the detection of 6.7 converted markers on average, which is close to the observed number of 7. Likewise, simulating 200 DSBs per meiosis with a tract length of 25 bp would lead to a similar number of converted markers (average number of conversions was 6.5) (left side of <xref ref-type="fig" rid="fig3">Figure 3D</xref>).</p><p>An alternative hypothesis is that a substantial fraction of meiotic DSBs is repaired through the (identical) sister chromatid and thus does not lead to GC. In order to test this, we simulated a range of very small numbers of meiotic DSBs with a tract length of 400 bp (our estimate of COCT length) and found that no more than 10 DSBs per meiosis repaired through the homologous chromatid would be sufficient to convert as many markers as we had observed in our data (<xref ref-type="fig" rid="fig3">Figure 3D</xref>). However, simulated tracts of ∼400 bp co-converted significantly more markers per CT as we observed in the real conversions (right side in <xref ref-type="fig" rid="fig3">Figure 3D</xref>).</p><p>Moreover, short NCOCTs lengths (averaging at 25–50 bp) would be in agreement with the low recovery rate of NCOs in studies on plant meiosis, and would concur with reported estimates of meiotic DSBs (<xref ref-type="bibr" rid="bib61">Vignard et al., 2007</xref>; <xref ref-type="bibr" rid="bib9">Chelysheva et al., 2007</xref>; <xref ref-type="bibr" rid="bib50">Sanchez-Moran et al., 2007</xref>). It is important to note that short NCOCTs do not mutually exclude the possibility that some of the DSB are repaired through the sister chromatid.</p></sec><sec id="s2-8"><title>Interference acts between COs but not between COs and NCOs</title><p>Genetic interference is the phenomenon that the distance between adjacent recombination events along the chromosome axis is longer than expected under random placement. Interference between COs has been extensively studied in many different species, but reports on interference between COs and NCOs are scarce (<xref ref-type="bibr" rid="bib3">Berchowitz and Copenhaver, 2010</xref>; <xref ref-type="bibr" rid="bib31">Mancera et al., 2008</xref>). To study interference, we calculated distances between adjacent COs and between NCOs and neighboring COs in our tetrad data set (<xref ref-type="fig" rid="fig3">Figure 3E</xref>). These were compared to expected distances between recombination events after randomizing the composition of tetrads by random sampling from all tetrad offspring. This preserved the non-uniform distribution of recombination events along the chromosome axis, while removing dependency between them as previously proposed by <xref ref-type="bibr" rid="bib31">Mancera et al. (2008)</xref> (‘Materials and methods’). Observed inter-CO distances (average distance of 11.9 Mb) were significantly longer than expected based on randomized tetrads (average distance of 8.3 Mb; p value &lt;0.01 [permutation test]), confirming CO–CO interference. Observed CO–NCO distances fell within the expected range (average distance of NCOs: 3.2 Mb, average distance in randomization: 4.5 Mb, p value 0.91 [permutation test]), evidencing no significant difference from random placement. This either suggests that interference acts between COs only, or that the level of interference is so low we could not detect it.</p></sec><sec id="s2-9"><title>Recombination targets gene promoters with low levels of DNA methylation</title><p>The placement of recombination events is a complex interplay between various genomic features (e.g., see <xref ref-type="bibr" rid="bib43">Pan et al., 2011</xref>). The precise localization of 67 COCTs and NCOCTs allowed us to investigate what local dependency may influence recombination localization in <italic>A. thaliana</italic>. Since conversion tracts might form only to one side of a DSB, we included 500 bp of flanking sequence on either side of the conversion tracts.</p><p>Recombination sites in animal and fungal genomes have been shown to correlate with high GC content, a presumed effect of biased gene conversion, in which AT/GC mismatches, are preferentially repaired to CG basepairs (<xref ref-type="bibr" rid="bib43">Pan et al., 2011</xref>; <xref ref-type="bibr" rid="bib18">Duret and Galtier, 2009</xref>; <xref ref-type="bibr" rid="bib44">Pessia et al., 2012</xref>). We tested whether recombination events in <italic>Arabidopsis thaliana</italic> may be correlated with an elevated GC content. The GC content in the 67 conversion tracts was significantly lower than the genomic background (<xref ref-type="fig" rid="fig4">Figure 4A</xref>). This suggests that recombination in <italic>A. thaliana</italic> may be biased towards AT rich regions.<fig id="fig4" position="float"><object-id pub-id-type="doi">10.7554/eLife.01426.014</object-id><label>Figure 4.</label><caption><title>Recombination sites are enriched for un-methylated, AT rich promoter regions.</title><p>(<bold>A</bold>) The GC content was calculated for 67 conversion tracts (including 500 bp of flanking sequence, top) and compared to a background distribution of 5,000 equally sized random locations sampled from non-peri-centromeric regions (bottom). Mean and variance are significantly different (mean: p value 2.4 × 10<sup>−5</sup> [<italic>t</italic> test]; variance: p value 0.02 [Levene’s test]). (<bold>B)</bold> The level of DNA methylation at recombination sites (top) was estimated based on bisulfite-treated DNA sequencing of mature rosette leaves (<xref ref-type="bibr" rid="bib58">Stroud et al., 2012</xref>). DNA methylation at recombination sites (top) is significantly lower as compared to 5,000 equally sized random regions selected from chromosome arms (bottom) (p value 2.2 × 10<sup>−16</sup> [<italic>t</italic> test]). (<bold>C</bold>) Associating recombination sites and gene annotations reveals a significant enrichment of recombination sites in promoters and gene ends. Promoters were defined as 500-bp regions upstream of transcription start sites, gene ends as the last 200 bp of a gene. The background distributions were estimated by randomly sampling from non-peri-centromeric regions.</p><p><bold>DOI:</bold> <ext-link ext-link-type="doi" xlink:href="10.7554/eLife.01426.014">http://dx.doi.org/10.7554/eLife.01426.014</ext-link></p></caption><graphic xlink:href="elife01426f004"/></fig></p><p>In <italic>A. thaliana</italic>, CG, CHG and CHH methylation show enrichment in peri-centromeric regions, while the high levels of gene body methylation comprise almost exclusively CG methylation (<xref ref-type="bibr" rid="bib12">Cokus et al., 2008</xref>; <xref ref-type="bibr" rid="bib28">Lister et al., 2008</xref>). A string of recent papers investigated the role of DNA methylation in <italic>A. thaliana</italic> recombination (<xref ref-type="bibr" rid="bib68">Yelina et al., 2012</xref>; <xref ref-type="bibr" rid="bib33">Melamed-Bessudo and Levy, 2012</xref>; <xref ref-type="bibr" rid="bib13">Colomé-Tatché et al., 2012</xref>; <xref ref-type="bibr" rid="bib34">Mirouze et al., 2012</xref>). They concordantly report a general increase in recombination in the chromosome arms when DNA methylation levels are suppressed. Consistently, we found that the percentage of methylated DNA for all types of DNA methylation, CG, CHG and CHH methylation, was significantly lower at recombination sites as measured within somatic tissue (mature rosette leaves) suggesting that local levels of DNA methylation interfere with recombination (<xref ref-type="fig" rid="fig4">Figure 4B</xref>) (<xref ref-type="bibr" rid="bib58">Stroud et al., 2012</xref>).</p><p>Previous reports suggested that <italic>A. thaliana</italic> recombination hotspots co-localize with transposable elements (<xref ref-type="bibr" rid="bib24">Horton et al., 2012</xref>) and that recombination events are enriched in peri-centromeric regions (<xref ref-type="bibr" rid="bib67">Yang et al., 2012</xref>). This seems to be at odds with the avoidance of methylated DNA as our data suggested. We estimated the placement of recombination with respect to gene annotation (<xref ref-type="fig" rid="fig4">Figure 4C</xref>). Gene promoters and gene ends are significantly overrepresented among our recombination sites (p values 0.03 and 0.02 respectively [permutation tests]). Gene body regions are underrepresented among our recombination sites, but their underrepresentation is not significant. As our DNA methylation analysis suggested, we do not find evidence for an increase in recombination at transposable elements.</p><p>While we observed less recombination in genes than expected for random placement, the largest fraction of recombination events did however occur in genes. This implies that recombination frequently leads to the formation of new allelic variants of genes. Of 71 CO events in the tetrad samples, two generated new alleles of genes encoding putative ‘hybrid proteins’ in both of the respective gametes. Likewise five of the 60 CO events in the DH lines overlapped with genes, resulting in the generation of putative new allelic variants of genes. This is most likely a severe underestimate, as we only used available markers and might have missed some of the remaining polymorphisms, which could give rise to new variants of genes. Even though NCOs are relatively rare events, we found four NCOs that resulted in non-synonymous substitutions.</p></sec><sec id="s2-10"><title>DNA-motifs associated with recombination sites</title><p>It has been shown for a variety of species that putative DNA binding motifs may be associated with recombination hotspot activity (<xref ref-type="bibr" rid="bib2">Baudat et al., 2010</xref>; <xref ref-type="bibr" rid="bib24">Horton et al., 2012</xref>; <xref ref-type="bibr" rid="bib11">Choi et al., 2013</xref>; <xref ref-type="bibr" rid="bib14">Comeron et al., 2012</xref>; <xref ref-type="bibr" rid="bib38">Myers et al., 2008</xref>). In <italic>A. thaliana</italic>, it was only possible so far to analyze ancestral recombination hotspots identified via linkage-based methods. Two recent studies reported on a collection of A- and CT-rich motifs in Arabidopsis, which are enriched at ancestral recombination hotspots (<xref ref-type="bibr" rid="bib24">Horton et al., 2012</xref>; <xref ref-type="bibr" rid="bib11">Choi et al., 2013</xref>). We searched for enriched sequence motifs at recombination sites and found two distinct motifs, which were significantly overrepresented in the set of 67 conversion tracts using the motif identification program MEME (‘Materials and methods’). The first was a palindromic GAA/CTT microsatellite, present in ∼51% (34 out of 67) of sequences (<xref ref-type="fig" rid="fig5">Figure 5A</xref>). The second was a poly-A homopolymer, present at ∼76% (51 out of 67) of the recombination sites. To verify our findings we assessed the occurrences of these motifs within random genomic regions selected from the chromosome arms and found that both motifs occur at significantly higher frequencies at recombination sites as compared to genomic background random regions in the chromosome arms (GAA/CTT: p value 0.01; poly-A: p value 5.0 × 10<sup>−7</sup>, [permutation tests], <xref ref-type="fig" rid="fig5">Figure 5A</xref>, ‘Materials and methods’).<fig id="fig5" position="float"><object-id pub-id-type="doi">10.7554/eLife.01426.015</object-id><label>Figure 5.</label><caption><title>Recombination sites are significantly associated with two sequence motifs and nucleosome-free regions.</title><p>(<bold>A</bold>) Two sequence motifs (poly-A and CTT/GAA) were found significantly enriched at recombination sites after searching for over-enriched motifs using MEME. We established background frequency distributions by randomly sampling regions of the same sizes from non-peri-centromeric regions, followed by a targeted search for the respective motif (shown as histograms). The observed number of motifs at recombination sites is shown by red arrows (poly-A: p value 3.8 × 10<sup>−6</sup>; CTT: p value 0.002 [permutation test]). (<bold>B</bold>) The distances between recombination sites and DNA sequences that cannot be bound by nucleosomes are significantly enriched for short distances (p value 2.0e-16 [generalized linear model fitting]). Nucleosome exclusion sites are defined as (A)<sub>10</sub> and ((GC)<sub>3</sub>NN)<sub>3</sub> (Wang et al. (1996)) (<xref ref-type="bibr" rid="bib60">Suter et al., 2000</xref>). (<bold>C</bold>) Comparison of recombination sites and nucleosome occupancy. The nucleosome occupancy was estimated through DNA sequencing performed after digesting with MNase of somatic tissue (<xref ref-type="bibr" rid="bib10">Chodavarapu et al., 2010</xref>). The nucleosome-bound genomic regions are preferentially sequenced and establish a quantitative readout of nucleosome occupancy. The read count distributions at recombination sites (top) and at 5,000 random background regions (bottom) are significantly different (p value 1.9e-4 [<italic>t</italic> test]).</p><p><bold>DOI:</bold> <ext-link ext-link-type="doi" xlink:href="10.7554/eLife.01426.015">http://dx.doi.org/10.7554/eLife.01426.015</ext-link></p></caption><graphic xlink:href="elife01426f005"/></fig></p></sec><sec id="s2-11"><title>Recombination in <italic>A. thaliana</italic> targets constitutively open chromatin</title><p>Poly-A motifs are common in eukaryotic genomes and are known to prevent nucleosomes from binding DNA. Nucleosome-free regions are known to be targeted by the recombination machinery in yeast (<xref ref-type="bibr" rid="bib43">Pan et al., 2011</xref>; <xref ref-type="bibr" rid="bib66">Wu and Lichten, 1994</xref>). In mouse, conversely, meiotic DSBs apparently target nucleosome bound regions guided by the methyltransferase PRMD9 (<xref ref-type="bibr" rid="bib56">Smagulova et al., 2011</xref>).</p><p>We examined known nucleosome exclusion sequences, that comprise poly-A (n≥10) and specific CG rich motifs ([C/G]<sub>3</sub>-N<sub>2</sub>-[C/G]<sub>3</sub>-N<sub>2</sub>-[C/G]<sub>3</sub>) (<xref ref-type="bibr" rid="bib63">Wang and Griffith, 1996</xref>; <xref ref-type="bibr" rid="bib60">Suter et al., 2000</xref>) for their overrepresentation near recombination sites. The shortest distance from the recombination midpoints to the nearest nucleosome-exclusion motif was compared with a distance distribution based on randomly sampled sites (<xref ref-type="fig" rid="fig5">Figure 5B</xref>). Recombination events occurred much closer to nucleosome-exclusion sites than expected for random placement, which suggests that the <italic>A. thaliana</italic> recombination machinery targets constitutively open chromatin. To corroborate our hypothesis of recombination targeting open chromatin and being negatively correlated with nucleosome occupancy, we tested whether our recombination sites are known to be free of nucleosomes in somatic tissues. For this we compared the amount of DNA reads generated after digestion with MNase (<xref ref-type="bibr" rid="bib10">Chodavarapu et al., 2010</xref>) for regions of conversion tracts to random non-peri-centromeric regions. Significant underrepresentation of DNA reads in conversion tracts corroborate that recombination is less likely in regions that are bound by nucleosomes (<xref ref-type="fig" rid="fig5">Figure 5C</xref>).</p></sec></sec><sec id="s3" sec-type="discussion"><title>Discussion</title><p>In order to describe the full extent of allelic exchange between <italic>A. thaliana</italic> homologous chromosomes, we have resequenced the complete genomes of 13 complete meiotic tetrads and 10 homozygous doubled haploid offspring of heterozygous F<sub>1</sub> hybrids. Genome rearrangements were shown to severely confound identification of genuine NCO–GCs, but stringent filters led to highly similar GC rate estimates for both offspring types. In particular, we showed how a complex 80 kb rearrangement led to the erroneous identification of recurrent double recombination events. Our results underscore the need for extreme caution and rigor when studying (biased) GCs using short reads.</p><sec id="s3-1"><title>The rate of gene conversion in <italic>A. thaliana</italic></title><p><xref ref-type="bibr" rid="bib59">Sun et al. (2012)</xref> estimated a GC rate of 3.5 × 10<sup>−4</sup> per marker per meiosis based on fluorescent reporter genes in pollen tetrads (<xref ref-type="bibr" rid="bib19">Francis et al., 2007</xref>). This would translate to 8.8 × 10<sup>−5</sup> per site per meiosis. Since for most of their data these authors could not distinguish between NCO–GCs and CO–GCs, this frequency should be compared to the combined frequency of NCO–GC and CO–GCs in our study, which is 1.1 × 10<sup>−5</sup> per site per meiosis. Potential causes for differences may lie in local GC formation differences (<xref ref-type="bibr" rid="bib59">Sun et al., 2012</xref>), experimental variation (e.g., plant growth conditions) or the higher probability of meiotic DSBs to occur in transgenes.</p><p>In <italic>A. thaliana</italic>, the ∼1–3 NCO–GCs and ∼10 COs per meiosis we observed do not amount to the 120 to 250 DSBs that were observed during meiotic prophase (<xref ref-type="bibr" rid="bib61">Vignard et al., 2007</xref>; <xref ref-type="bibr" rid="bib9">Chelysheva et al., 2007</xref>; <xref ref-type="bibr" rid="bib50">Sanchez-Moran et al., 2007</xref>). Our simulation-based estimates of conversion tract lengths suggested short NCO tract lengths as a possible factor for this discrepancy. Small NCO tracts most often will not lead to conversions, as they do not overlap with polymorphisms. Minimal observed COCT lengths range from 1 to 1,229 bp (<xref ref-type="supplementary-material" rid="SD1-data">Supplementary file 1F</xref>), whereas the lengths of NCOCTs range from 1 to 282 bp (<xref ref-type="supplementary-material" rid="SD1-data">Supplementary file 1H</xref>). The recovery of two fairly long NCO events of 275 and 282 bp could be attributed to variation in NCO size, and is likely a detection bias, as NCOCT detection is strongly biased towards recovery of longer tracts (<xref ref-type="bibr" rid="bib15">Curtis et al., 1989</xref>). A previous estimate of 558 bp for COCT length and less than 150 bp for NCOCT length (<xref ref-type="bibr" rid="bib30">Lu et al., 2012</xref>) as based on six and four observations respectively, are close to our estimations. As suggested by <xref ref-type="bibr" rid="bib30">Lu et al. (2012)</xref>, we cannot exclude other factors, like the preferential repair of heteroduplexes to parental genotypes (<xref ref-type="bibr" rid="bib5">Borts et al., 2000</xref>) or inter-sister rather than inter-homologue repair to explain the low GC rate in <italic>A. thaliana</italic> (<xref ref-type="bibr" rid="bib22">Goldfarb and Lichten, 2010</xref>). However, our data suggesting short NCO–GCs tracts do not require such additional assumptions.</p><p>Sequence identity at CO sites was as low as 82%, due to indels of up to 18 bp, which might not even comprise the lower limit of sequence divergence at CO sites. This may explain why previous work did not detect a relationship between CO frequencies and SNP density, albeit at a much coarser level (<xref ref-type="bibr" rid="bib48">Salome et al., 2012</xref>).</p><p>Recombination events in <italic>A. thaliana</italic> are closely associated with nucleosome-free regions. The enrichment for poly-A motifs may explain this observation. Sequence annotation showed that promoter regions are enriched for recombination sites. These observations add to the emergent pattern that the recombination machinery targets accessible DNA at gene promoters in yeast and <italic>Drosophila melanogaster</italic> (<xref ref-type="bibr" rid="bib43">Pan et al., 2011</xref>; <xref ref-type="bibr" rid="bib14">Comeron et al., 2012</xref>). Even in mouse, where recombination hotspots are commonly guided by the DNA-binding methyltransferase PRDM9, gene promoters are targeted in the absence of PRDM9 (<xref ref-type="bibr" rid="bib6">Brick et al., 2012</xref>). A GAA/CTT microsatellite motif was present at half of the investigated recombination sites in <italic>A. thaliana</italic>, and are (nearly) identical to two sequences found associated to ancestral hotspots (<xref ref-type="bibr" rid="bib11">Choi et al., 2013</xref>). This motif is similar to the TRANSLOCON 1 (TL1) binding motif, to which the heat-shock factor-like transcription factor HSFB1 binds (<xref ref-type="bibr" rid="bib42">Pajerowska-Mukhtar et al., 2012</xref>; <xref ref-type="bibr" rid="bib62">Wang et al., 2005</xref>). There are however no indications that this transcription factor is involved in <italic>A. thaliana</italic> meiosis.</p></sec><sec id="s3-2"><title>The impact of recombination on population-wide allele frequencies</title><p>To understand the coherent flow of genes and alleles within natural populations of <italic>A. thaliana</italic> (<xref ref-type="bibr" rid="bib4">Bomblies et al., 2010</xref>; <xref ref-type="bibr" rid="bib8">Cao et al., 2011</xref>; <xref ref-type="bibr" rid="bib29">Long et al., 2013</xref>) it is essential to understand not only the selective forces that affect allele frequencies, but also the mechanisms that introduce and distribute genetic variation. Together with the random inheritance of homologous chromosomes, COs are the major factors generating new allele combinations by redistributing allelic variation through exchanging complete chromosome arms. The prevalent co-conversion of polymorphisms at CO sites prohibits the formation of complex crossover patterns of alternating polymorphisms, which implies that it is the COs themselves that diffuse haplotype borders, rather than their associated gene conversions.</p><p>NCO–GCs exchange variation between existing haplotypes and as such can effectively generate new alleles of genes. We recovered four NCOs that indeed lead to the generation of new allelic variants of genes through NCO. However, GCs can only generate new allele combinations in heterozygous plants. In the case of <italic>A. thaliana</italic>, which has a selfing rate of at least 85% (and much higher in most populations [<xref ref-type="bibr" rid="bib4">Bomblies et al., 2010</xref>]), this impact will be relatively small in comparison to other organisms.</p><p>As shown in our analyses, meiotic recombination is an effective mechanism to generate new copy number variants, as was previously suggested by <xref ref-type="bibr" rid="bib30">Lu et al. (2012)</xref>. We described how CO events altered the copy number of the transposed sequence within the offspring genomes. The identification of an 80 kb transposition that is present on the same chromosome, but distant enough to allow for intermittent CO events, generated haploid genotypes that either lost or duplicated the transposed sequence. While we can currently only speculate on the resulting fitness (dis-)advantages for their offspring, the genomic effect of combining crossover recombination with genomic structural variation (<xref ref-type="bibr" rid="bib54">Schneeberger et al., 2011</xref>; <xref ref-type="bibr" rid="bib20">Gan et al., 2011</xref>; <xref ref-type="bibr" rid="bib52">Schmitz et al., 2013</xref>; <xref ref-type="bibr" rid="bib29">Long et al., 2013</xref>; <xref ref-type="bibr" rid="bib8">Cao et al., 2011</xref>) provides significant potential for further selection and shaping of the <italic>A. thaliana</italic> pan-genome.</p></sec></sec><sec id="s4" sec-type="materials|methods"><title>Materials and methods</title><sec id="s4-1"><title>Generation of meiotic tetrads and doubled haploids lines</title><p>For the generation of meiotic tetrads, a Col–L<italic>er</italic> hybrid in a <italic>quartet</italic>1 background was made by crossing <italic>qrt1 −/−</italic> Col (N660403) to <italic>qrt1 −/−</italic> L<italic>er</italic> (N8050), which were obtained from the Nottingham <italic>Arabidopsis</italic> Stock Center. Single meiotic pollen quartets from this F<sub>1</sub> plant were picked up with a hair under a microscope and transferred onto a virgin flower of a male sterile Cape Verde Islands (Cvi) female receptor. The male sterility mutant was selected from an EMS treated Cvi population and backcrossed twice to Cvi. These plants were grown under standard long day conditions in a growth chamber. Over 700 unique pollinations were made. All resulting siliques were individually harvested, and when four seeds were recovered from one silique, the resulting plants were grown under short day conditions to maximize rosette size before harvesting. Plants were genotyped using a previously described SNP marker set (<xref ref-type="bibr" rid="bib65">Wijnker et al., 2012</xref>) to verify that all markers segregate in the expected tetrad 2:2 ratio. Doubled haploid <italic>Arabidopsis</italic> lines were selected from crosses made for an existing Columbia (Col)—L<italic>er</italic> DH population. Five of these DH1, DH2, DH3, DH5 and DH10 were featured in a previous publication as plants 53, 34, 58, 49 and 72 respectively (<xref ref-type="bibr" rid="bib65">Wijnker et al., 2012</xref>). The other five DH lines were selected from among (doubled) haploid lines that selected from the same crosses after initial publication of this material.</p></sec><sec id="s4-2"><title>Library preparation and sequencing</title><p>DNA of all five parental lines, DHs and tetrad offspring was extracted from adult rosettes using the CTAB method, with a nuclei extraction step to remove mitochondrial and chloroplast DNA (<xref ref-type="bibr" rid="bib8">Cao et al., 2011</xref>). One-half to one gram of <italic>A. thaliana</italic> leaves were ground to a fine powder in liquid nitrogen using mortar and pestle and transferred to a 15-ml polyethylene centrifuge tube containing 10 ml of ice-cold nuclei extraction buffer, consisting of 10 mM TRIS-HCl pH 9.5, 10 mM EDTA pH 8.0, 100 mM KCl, 500 mM sucrose, 4 mM spermidine, 1 mM spermine and 0.1% beta-mercaptoethanol. The suspended tissue was mixed thoroughly with a wide-bore pipette and filtered through two layers of Miracloth (CalBiochem, San Diego, CA) into an ice-cold 50-ml polyethylene centrifuge tube by a brief spin at less than 100 g for 5 s. Two ml lysis buffer, consisting of 100 mM Tris ph7.5, 0.7M NaCl, 10 mM EDTA, 1% BME (2-mercaptoethanol) and 1% CTAB in H<sub>2</sub>O, were added to the filtered suspension and mixed gently for 2 min on ice. The nuclei were pelleted by centrifugation at 2000 g for 10 min at 4°C. 500 ul CTAB extraction buffer was added to the nuclei pellet, mixed well by inverting the tube and incubated for 30 min at 60°C. The sample was then cooled for 5 min at RT before adding 350 μl Chloroform/isoamyl alcohol (24:1), inverting and mixing gently for about 5 min, and spinning in a microcentrifuge at 6000 rpm for 10 min. The upper layer (450 μl) was transferred to a new 2-ml tube containing 450 μl isopropanol and mixed by inverting several times before pelleting the DNA by centrifugation at 13000 rpm for 3 min. After washing the DNA pellet in 75% EtOH, the DNA was resuspended in sterile DNase free water (containing RNaseA 10 μg/ml). The sample was incubated at 65°C for 20 min to destroy any DNases, and stored at 4°C until use. The DNA concentration and quality was determined with a Nanodrop 1000 (Peqlab, Erlangen, Germany), a Qubit 2.0 Fluorometer (Life Technologies, Carlsbad, CA, USA) and on a 1% agarose gel. DNA samples were concentrated to more than 50 ng/µl with a speed-vac when necessary.</p><p>The DNA samples were sequenced by the Max Planck Genome Center Cologne, Cologne, Germany, and by the Max Planck Institute for Developmental Biology, Tübingen, Germany. At both sequencing facilities, quality checks were performed with a Bioanalyzer (Agilent 2100, Agilent, Böblingen, Germany). Libraries for the doubled haploids, parents and the five deeply sequenced tetrads were generated using the Illumina Genomic DNA TruSeq sample kit (Illumina, San Diego, CA) according to the manufacturer’s instructions. Libraries for the eight shallowly sequenced tetrads were prepared by fragmenting the DNA using dsDNA Shearase (Zymo Research, Irvine, CA, USA) to obtain 100- to 1000-bp fragments, A-tailing using Klenow exonuclease (New England Biolabs, Ipswitch, MA, USA), ligating to indexed adapters using the QuickLigation kit (New England Biolabs, Ipswitch, MA, USA), size selection to 300 to 500 bp using AMPure XP SPRI beads (Beckman-Coulter, Pasadena, CA), and PCR enrichment using the Phusion DNA polymerase (New England Biolabs, Ipswitch, MA, USA). The complete methodological details for the latter library preparation protocol will appear in a separate manuscript that is currently in preparation for publication. The samples were sequenced on Illumina Genome Analyzer GAIIx and Illumina HiSeq2000 in 100 bp paired-end runs. Sequencing yield per sample is listed in <xref ref-type="supplementary-material" rid="SD1-data">Supplementary file 1A</xref>.</p></sec><sec id="s4-3"><title>Resequencing analysis</title><p>We applied SHORE to the whole-genome sequencing data of the five parental accessions (parents of tetrads: <italic>qrt</italic> Col, <italic>qrt</italic> L<italic>er</italic> and EMS male sterile Cvi, and the Col and L<italic>er</italic> parents of the DH lines), the 52 individual tetrad genomes and the 10 DH offspring independently (<xref ref-type="bibr" rid="bib39">Ossowski et al., 2008</xref>). For each sample, short reads were quality filtered and trimmed using the default values. High quality reads were then aligned against the Arabidopsis reference sequence using GenomeMapper by allowing up to 10% mismatches and gaps (<xref ref-type="bibr" rid="bib53">Schneeberger et al., 2009</xref>; <xref ref-type="bibr" rid="bib27">Lamesch et al., 2012</xref>). After using read pair information to remove repetitive alignments, we performed consensus calling again using the default parameters.</p></sec><sec id="s4-4"><title>Initial marker definition and CO identification for tetrad analysis</title><p>Initial markers included all loci with a quality score greater than 24 in the analysis of Col-0 and Cvi, and a quality score of 40 (which is the maximum SHORE assigns) within the analysis of L<italic>er</italic>. In addition we removed all markers that resided in putative duplications, transposable elements, in regions that showed enriched coverage or that were closer than 150 bp to a position that showed evidence of two different alleles in the resequencing of one of the parental genomes (<xref ref-type="supplementary-material" rid="SD2-data">Supplementary file 2A</xref>).</p><p>For the reconstruction of the individual tetrad genomes, we assigned either a Col or L<italic>er</italic> allele to each marker if there was a resequencing quality score greater than 15. This initial genotyping was used to identify COs by merging consecutive markers of the same genotype into blocks of markers. Blocks with at least 25 consecutive markers were used as seeds. COs positions were identified by extending the seeds until the nearest block of the other genotype with more markers than the preceding block.</p></sec><sec id="s4-5"><title>Identification and rate estimation of NCO–GCs in tetrad genomes</title><p>NCO–GC identification was attempted in the deeply sequenced tetrad genomes only. An NCO–GC was reported if the genotype at a marker was different from the background that was assigned in the initial genotyping. In order to distinguish between homozygous and heterozygous genotypes reliably, we required a minimum coverage threshold and a coverage-dependent allele frequency cutoff. We established a coverage-dependent frequency cutoff that allows identifying homozygous and heterozygous genotypes at identical error rates. To this end, we fitted beta distributions to the observed distribution of allele frequencies as measured by the short read alignments at type-1 and type-2 markers respectively (an example for a minimal coverage of 50 is shown in <xref ref-type="fig" rid="fig2s2">Figure 2—figure supplement 2</xref>). The unique point at which both distributions share the same quartile has been defined as the allele frequency threshold. This was repeated at multiple minimal coverage thresholds. Assigning homozygous and heterozygous genotypes at the same error rate allows for the calculation of the frequency of NCO–GCs by dividing markers with assigned NCO–GC by all markers with an assigned genotype. For the identification of a minimal coverage threshold, NCO–GC frequencies were calculated at multiple minimal coverage values. Calculations with lower coverage thresholds revealed higher NCO–GC rates compared to calculations with more stringent coverage thresholds (<xref ref-type="fig" rid="fig2s3">Figure 2—figure supplement 3</xref>).</p><p>NCO–GC identification was initially performed on the set of markers used for CO identification. Though this marker set was defined in order to include high quality markers only, further filtering was necessary. Several filtering steps were applied, each reducing the number of markers. First, we extended the regions which encompass putative duplications and which were not allowed to feature markers. Each transposable element and putative duplicated region (as defined by the resequencing analysis performed with SHORE) was extended by 2 kb. Regions around heterozygous positions were defined as 1 kb up- and down-stream. The second filtering excluded all those markers in which we found evidence of read pair alignments from three different parents, which is a reliable indicator of wrong alignments. As a third filtering step, we removed markers where local sequence divergence hampered the alignment of L<italic>er</italic> short reads to the reference sequence.</p><p>Finally we removed all markers that were shown to segregate in a 2:2 manner within the Sanger sequencing data. As the absolute number of false positive NCO–GCs was relatively low compared to the absolute number of markers, we conclude that this is also true for the non-NCO–GCs. Errors result from technical artifacts and thus affect NCO–GCs and non-NCO–GCs equally, and thus the absolute number of false negatives is negligible.</p></sec><sec id="s4-6"><title>Identification and validation of the 80 kb of transposed sequences</title><p>Initial identification of the 80 kb transposition was based on the recombinant tetrad genomes showing copy number variation at the transposition sites, which led to false GC calls. Local <italic>de novo</italic> assembly of reads aligned to this region and prior access to a <italic>de novo</italic> assembly of the L<italic>er</italic> genome, which will appear in a separate manuscript that is currently in preparation for publication, revealed the insertion sites and allowed for primer design. <xref ref-type="fig" rid="fig2s5">Figure 2—figure supplement 5</xref> illustrates the validation of transposed and inverted regions through PCRs. These PCRs were done on the Col and L<italic>er</italic> parental lines. <xref ref-type="supplementary-material" rid="SD2-data">Supplementary file 2F</xref> lists used primer sequences.</p></sec><sec id="s4-7"><title>Refinement of CO and NCO conversion tracts</title><p>In order to get a complete picture of the polymorphisms in the COCT and NCOCT in both the tetrad and the DH samples, we manually parsed all short read alignments around CO and NCO–GCs combined with local short read assemblies of the short reads aligning to the respective region in addition to the read mate pairs that were not aligned using Velvet (<xref ref-type="bibr" rid="bib69">Zerbino and Birney, 2008</xref>).</p></sec><sec id="s4-8"><title>Validation of NCO–GCs in tetrads</title><p>Putative NCO–GC events were validated by Sanger sequencing of PCR products of all four tetrad offspring. All PCRs were done on extracted genomic DNA, except for NCO–GC events in tetrad 58, where DNA of the sequencing library was used. <xref ref-type="supplementary-material" rid="SD2-data">Supplementary file 2D</xref> lists primer sequences for all NCOs.</p></sec><sec id="s4-9"><title>Recombination interference</title><p>Distances between neighboring COs and NCOs events were calculated using the midpoint of each CO and NCO as its unique location. To assess whether interference acts on different types of recombination events we implemented a permutation test for adjacent recombination events as suggested by <xref ref-type="bibr" rid="bib31">Mancera et al. (2008)</xref>. For each permutation test we generated 10,000 iterations. Instead of randomizing recombination locations we randomized the labels of the tetrads of each CO, and thereby preserved the inhomogeneity of CO placement across the genome. Chromatid interference, the non-random association of the chromatids involved in two adjacent CO, would violate the validity of this background distribution of our permutation tests. However, we could not identify any evidence for chromatid interference in our data.</p></sec><sec id="s4-10"><title>Validation of NCOs in DH lines</title><p>Putative NCO–GC events were validated by Sanger sequencing of PCR products of both parental lines Col and Ler and each respective DH line. <xref ref-type="supplementary-material" rid="SD2-data">Supplementary file 2E</xref> lists used primer sequences for each NCO.</p></sec><sec id="s4-11"><title>Marker definition and genotyping of DH samples</title><p>High quality markers were defined as positions with a quality score of 25 or greater in the resequencing of the Col-0 sample, a homozygous SNP in the resequencing analysis of L<italic>er</italic> with a quality value of 40 and a short read coverage between 50 and 150 read alignments for the resequencing of L<italic>er</italic>. We genotyped each of the 10 DH samples at each of the markers, when we identified a homozygous consensus call with a quality score of 15 or more. Initial genotyping of the 438,919 markers and CO location identification was performed as outlined for the tetrad genomes.</p><p>NCO–GCs are identified at markers with genotypes that differ from the expected parental genotype. We used the coverage-dependent frequency cutoffs identified in the analysis of the tetrad samples in order to identify to what level of ambiguous reads can be accepted at homozygous positions. This allowed the identification (and removal) of markers that are most likely featuring alignments from different loci in the genome. Calculating the probability of NCO–GCs per marker at a series of minimum coverage thresholds revealed that beyond a minimal coverage of 10, the frequency of NCO–GC remained stable. This threshold was used in the subsequent identification of GCs.</p></sec><sec id="s4-12"><title>Identification of spontaneous mutations in the DH lines</title><p>For the sequencing of the doubled haploid samples, we pooled the DNA of at least four sibling double haploid plants for each sample. Spontaneous mutations, which occurred in the pollen or egg cell of the haploid progenitor plant, as well as recent somatic mutations of individual doubled haploid plants will not be fixed in these pools. By comparing the resequencing data of all 10 samples, we identified mutations, which are specific for one of the DH samples. These encompass mutations that happened in the haploid life cycle before meiotic cell divisions. Only homozygous positions with a SHORE score of greater than or equal to 32 were probed for spontaneous mutations. Like the identification of NCO–GCs, the identification of spontaneous mutations required a minimal coverage of 10, an allele frequency higher than the cutoff to distinguish between homozygous and heterozygous positions (as defined for the tetrad analysis), and equivalent evidence for the non-mutant allele in the other samples.</p></sec><sec id="s4-13"><title>Meiotic DSB-associated motif identification</title><p>We searched for consensus motifs at 67 conversion tracts identified in tetrads and DH lines. Performing motif searches on conversion tracts ensures that the sequences were subjected to DSB repair. However, as conversion tracts are not necessarily centered on the location of the respective DSB, we included flanking sequences of 500 bp to increase the probability to encompass the complete region targeted by DSB repair. Candidate motifs were identified with MEME (<xref ref-type="bibr" rid="bib1">Bailey et al., 2006</xref>), which was run with the ‘zoops’ model, while correcting for the genomic background. Motifs with a minimum of five and maximum of 15 bp were identified. Only two motifs featured an e value of less than 1e-05. Due to the repetitive nature of the recovered motifs, we performed an additional permutation to test for random occurrence in the genome. Rescreening was performed within the observed conversion tracts as well as in 1,000 random genomic region of the same length using a positional weight matrix. The positional weight matrix was mapped against each sequence using MOODS and matches with p value &lt;0.001 were considered (<xref ref-type="bibr" rid="bib26">Korhonen et al., 2009</xref>).</p></sec><sec id="s4-14"><title>Data deposition</title><p>Short read data have been deposited in the EBI short read archive under accession number ERP003793.</p></sec></sec></body><back><ack id="ack"><title>Acknowledgements</title><p>Melany Bartsch and Wim Soppe are thanked for providing the M<sub>2</sub> population derived from EMS-treated Cvi Jose van de Belt and Marijke Hartog are acknowledged for their help with PCRs.</p></ack><sec sec-type="additional-information"><title>Additional information</title><fn-group content-type="competing-interest"><title>Competing interests</title><fn fn-type="conflict" id="conf1"><p>DW: Deputy Editor, <italic>eLife</italic>.</p></fn><fn fn-type="conflict" id="conf2"><p>The other authors declare that no competing interests exist.</p></fn></fn-group><fn-group content-type="author-contribution"><title>Author contributions</title><fn fn-type="con" id="con1"><p>EW, Conception and design, Acquisition of data, Analysis and interpretation of data, Drafting or revising the article</p></fn><fn fn-type="con" id="con2"><p>JJBK, Conception and design, Acquisition of data, Analysis and interpretation of data, Drafting or revising the article</p></fn><fn fn-type="con" id="con3"><p>GVJ, Analysis and interpretation of data, Drafting or revising the article</p></fn><fn fn-type="con" id="con4"><p>JD, Analysis and interpretation of data, Drafting or revising the article</p></fn><fn fn-type="con" id="con5"><p>JRK, Analysis and interpretation of data, Drafting or revising the article</p></fn><fn fn-type="con" id="con6"><p>VR, Analysis and interpretation of data, Drafting or revising the article</p></fn><fn fn-type="con" id="con7"><p>FB, Acquisition of data, Drafting or revising the article</p></fn><fn fn-type="con" id="con8"><p>DFJ, Acquisition of data, Drafting or revising the article</p></fn><fn fn-type="con" id="con9"><p>CBS, Acquisition of data, Drafting or revising the article</p></fn><fn fn-type="con" id="con10"><p>BH, Acquisition of data, Drafting or revising the article</p></fn><fn fn-type="con" id="con11"><p>BAR, Acquisition of data, Analysis and interpretation of data, Drafting or revising the article</p></fn><fn fn-type="con" id="con12"><p>LZ, Analysis and interpretation of data, Drafting or revising the article, Contributed unpublished essential data or reagents</p></fn><fn fn-type="con" id="con13"><p>SO, Analysis and interpretation of data, Drafting or revising the article, Contributed unpublished essential data or reagents</p></fn><fn fn-type="con" id="con14"><p>HJ, Conception and design, Drafting or revising the article</p></fn><fn fn-type="con" id="con15"><p>DW, Conception and design, Drafting or revising the article</p></fn><fn fn-type="con" id="con16"><p>MK, Conception and design, Drafting or revising the article</p></fn><fn fn-type="con" id="con17"><p>KS, Conception and design, Analysis and interpretation of data, Drafting or revising the article</p></fn></fn-group></sec><sec sec-type="supplementary-material"><title>Additional files</title><supplementary-material id="SD1-data"><object-id pub-id-type="doi">10.7554/eLife.01426.016</object-id><label>Supplementary file 1.</label><caption><p>(<bold>A</bold>) Read number. (<bold>B</bold>) Number of crossovers per tetrad and DH line and chromosome. (<bold>C</bold>) CO location. (<bold>D</bold>) NCO location and extent within the tetrad samples. (<bold>E</bold>) CO location in DH lines. (<bold>F</bold>) Exact loci of CO and extent of associated GC in tetrad samples. (<bold>G</bold>) Converted markers within COCTs in tetrad samples. (<bold>H</bold>) Converted markers within NCOCTs in DH lines.</p><p><bold>DOI:</bold> <ext-link ext-link-type="doi" xlink:href="10.7554/eLife.01426.016">http://dx.doi.org/10.7554/eLife.01426.016</ext-link></p></caption><media mime-subtype="xlsx" mimetype="application" xlink:href="elife01426s001.xlsx"/></supplementary-material><supplementary-material id="SD2-data"><object-id pub-id-type="doi">10.7554/eLife.01426.017</object-id><label>Supplementary file 2.</label><caption><p>(<bold>A</bold>) Initial marker list used for an initial genotyping of the tetrad samples. (<bold>B</bold>) Filtered marker list used to genotype tetrad samples. (<bold>C</bold>) High quality marker list used to genotype DH samples. (<bold>D</bold>) Primer pairs used for verification of NCOs in tetrad offspring. (<bold>E</bold>) Primer pairs used for verification of NCOs in DH lines. (<bold>F</bold>) Primers used for the verification of transposed DNA on chromosome 3.</p><p><bold>DOI:</bold> <ext-link ext-link-type="doi" xlink:href="10.7554/eLife.01426.017">http://dx.doi.org/10.7554/eLife.01426.017</ext-link></p></caption><media mime-subtype="xlsx" mimetype="application" xlink:href="elife01426s002.xlsx"/></supplementary-material><supplementary-material id="SD3-data"><object-id pub-id-type="doi">10.7554/eLife.01426.018</object-id><label>Supplementary file 3.</label><caption><p>Includes a visualization of the exact makeup of 71 COs identified in the tetrad samples, for which sufficient sequencing information was available. The colored areas indicate the number of short read alignments for each of the positions as indicated on the x-axis. Red and blue areas refer to regions that descended from Col-0 and L<italic>er</italic>, respectively. Grey areas cannot be assigned to either of them. Vertical lines indicate sequence differences between the parental genotypes and have the respective genotype indicated next to them. Within the CO sites (between the outer borders of both grey areas) the polymorphism data are based on hand curated short read alignments and local assemblies. Outside the flanking markers the polymorphisms encompass the marker used for reconstructing the recombinant chromosomes only. Note each tetrad sample contains one recombinant chromosome and one that is derived from the Cvi parent. The Cvi alleles are not indicated in these plots.</p><p><bold>DOI:</bold> <ext-link ext-link-type="doi" xlink:href="10.7554/eLife.01426.018">http://dx.doi.org/10.7554/eLife.01426.018</ext-link></p></caption><media mime-subtype="pdf" mimetype="application" xlink:href="elife01426s003.pdf"/></supplementary-material><sec sec-type="datasets"><title>Major dataset</title><p>The following dataset was generated:</p><p><related-object content-type="generated-dataset" document-id="Dataset ID and/or url" document-id-type="dataset" document-type="data" id="dataro1"><name><surname>Wijnker</surname><given-names>E</given-names></name>, <name><surname>Velikkakam James</surname><given-names>G</given-names></name>, <name><surname>Ding</surname><given-names>J</given-names></name>, <name><surname>Becker</surname><given-names>F</given-names></name>, <name><surname>Klasen</surname><given-names>JR</given-names></name>, <name><surname>Rawat</surname><given-names>V</given-names></name>, <etal/>, <year>2013</year><x>, </x><source><italic>Arabidopsis thaliana</italic> recombinant tetrads and DH lines</source><x>, </x><object-id pub-id-type="art-access-id">ERP003793</object-id><x>; </x><ext-link ext-link-type="uri" xlink:href="http://www.ebi.ac.uk/ena/data/view/ERP003793">http://www.ebi.ac.uk/ena/data/view/ERP003793</ext-link><x>, </x><comment>Publicly available at the European Nucleotide Archive (<ext-link ext-link-type="uri" xlink:href="http://www.ebi.ac.uk/ena/">http://www.ebi.ac.uk/ena/</ext-link>).</comment></related-object></p></sec></sec><ref-list><title>References</title><ref id="bib1"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Bailey</surname><given-names>Timothy L</given-names></name><name><surname>Williams</surname><given-names>Nadya</given-names></name><name><surname>Misleh</surname><given-names>Chris</given-names></name><name><surname>Li</surname><given-names>Wilfred W</given-names></name></person-group><year>2006</year><article-title>MEME: discovering and analyzing DNA and protein sequence motifs</article-title><source>Nucleic Acids Research</source><volume>34</volume><fpage>W369</fpage><lpage>W373</lpage><pub-id pub-id-type="doi">10.1093/nar/gkl198</pub-id></element-citation></ref><ref id="bib2"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Baudat</surname><given-names>F</given-names></name><name><surname>Buard</surname><given-names>J</given-names></name><name><surname>Grey</surname><given-names>C</given-names></name><name><surname>Fledel-Alon</surname><given-names>A</given-names></name><name><surname>Ober</surname><given-names>C</given-names></name><name><surname>Przeworski</surname><given-names>M</given-names></name><name><surname>Coop</surname><given-names>G</given-names></name><name><surname>de Massy</surname><given-names>B</given-names></name></person-group><year>2010</year><article-title>PRDM9 is a major determinant of meiotic recombination hotspots in humans and mice</article-title><source>Science</source><volume>327</volume><fpage>836</fpage><lpage>840</lpage><pub-id pub-id-type="doi">10.1126/science.1183439</pub-id></element-citation></ref><ref id="bib3"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Berchowitz</surname><given-names>Luke E</given-names></name><name><surname>Copenhaver</surname><given-names>Gregory P</given-names></name></person-group><year>2010</year><article-title>Genetic interference: don’t stand so close to me</article-title><source>Curr Genomics</source><volume>11</volume><fpage>91</fpage><lpage>102</lpage><pub-id pub-id-type="doi">10.2174/138920210790886835</pub-id></element-citation></ref><ref id="bib4"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Bomblies</surname><given-names>Kirsten</given-names></name><name><surname>Yant</surname><given-names>Levi</given-names></name><name><surname>Laitinen</surname><given-names>Roosa A</given-names></name><name><surname>Kim</surname><given-names>Sang-Tae</given-names></name><name><surname>Hollister</surname><given-names>Jesse D</given-names></name><name><surname>Warthmann</surname><given-names>Norman</given-names></name><name><surname>Fitz</surname><given-names>J</given-names></name><name><surname>Weigel</surname><given-names>D</given-names></name></person-group><year>2010</year><article-title>Local-scale patterns of genetic variability, outcrossing, and spatial structure in natural sands of <italic>Arabidopsis thaliana</italic></article-title><source>PLOS Genetics</source><volume>6</volume><fpage>e1000890</fpage><pub-id pub-id-type="doi">10.1371/journal.pgen.1000890</pub-id></element-citation></ref><ref id="bib5"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Borts</surname><given-names>Rhona H</given-names></name><name><surname>Chambers</surname><given-names>SR</given-names></name><name><surname>Abdullah</surname><given-names>MFF</given-names></name></person-group><year>2000</year><article-title>The many faces of mismatch repair in meiosis</article-title><source>Mutation Research</source><volume>451</volume><fpage>129</fpage><lpage>150</lpage><pub-id pub-id-type="doi">10.1016/S0027-5107(00)00044-0</pub-id></element-citation></ref><ref id="bib6"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Brick</surname><given-names>Kevin</given-names></name><name><surname>Smagulova</surname><given-names>Fatima</given-names></name><name><surname>Khil</surname><given-names>Pavel</given-names></name><name><surname>Daniel Camerini-Otero</surname><given-names>R</given-names></name><name><surname>Petukhova</surname><given-names>Galina V</given-names></name></person-group><year>2012</year><article-title>Genetic recombination is directed away from functional genomic elements in mice</article-title><source>Nature</source><volume>485</volume><fpage>642</fpage><lpage>645</lpage><pub-id pub-id-type="doi">10.1038/nature11089</pub-id></element-citation></ref><ref id="bib7"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Buhler</surname><given-names>Cyril</given-names></name><name><surname>Borde</surname><given-names>Valérie</given-names></name><name><surname>Lichten</surname><given-names>Michael</given-names></name></person-group><year>2007</year><article-title>Mapping meiotic single-strand DNA reveals a new landscape of DNA double-strand breaks in <italic>Saccharomyces cerevisiae</italic></article-title><source>PLOS Biology</source><volume>5</volume><fpage>e324</fpage><pub-id pub-id-type="doi">10.1371/journal.pbio.0050324</pub-id></element-citation></ref><ref id="bib8"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Cao</surname><given-names>Jun</given-names></name><name><surname>Schneeberger</surname><given-names>Korbinian</given-names></name><name><surname>Ossowski</surname><given-names>Stephan</given-names></name><name><surname>Gunther</surname><given-names>Torsten</given-names></name><name><surname>Bender</surname><given-names>Sebastian</given-names></name><name><surname>Fitz</surname><given-names>Joffrey</given-names></name><name><surname>Koenig</surname><given-names>D</given-names></name><name><surname>Lanz</surname><given-names>C</given-names></name><name><surname>Stegle</surname><given-names>O</given-names></name><name><surname>Lippert</surname><given-names>C</given-names></name><name><surname>Wang</surname><given-names>X</given-names></name><name><surname>Ott</surname><given-names>F</given-names></name><name><surname>Müller</surname><given-names>J</given-names></name><name><surname>Alonso-Blanco</surname><given-names>C</given-names></name><name><surname>Borgwardt</surname><given-names>K</given-names></name><name><surname>Schmid</surname><given-names>KJ</given-names></name><name><surname>Weigel</surname><given-names>D</given-names></name></person-group><year>2011</year><article-title>Whole-genome sequencing of multiple <italic>Arabidopsis thaliana</italic> populations</article-title><source>Nature Genetics</source><volume>43</volume><fpage>956</fpage><lpage>963</lpage><pub-id pub-id-type="doi">10.1038/ng.911</pub-id></element-citation></ref><ref id="bib9"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Chelysheva</surname><given-names>Liudmila</given-names></name><name><surname>Gendrot</surname><given-names>Ghislaine</given-names></name><name><surname>Vezon</surname><given-names>Daniel</given-names></name><name><surname>Doutriaux</surname><given-names>Marie-Pascale</given-names></name><name><surname>Mercier</surname><given-names>Raphaël</given-names></name><name><surname>Grelon</surname><given-names>Mathilde</given-names></name></person-group><year>2007</year><article-title>Zip4/Spo22 is required for class i CO formation but not for synapsis completion in <italic>Arabidopsis thaliana</italic></article-title><source>PLOS Genetics</source><volume>3</volume><fpage>e83</fpage><pub-id pub-id-type="doi">10.1371/journal.pgen.0030083</pub-id></element-citation></ref><ref id="bib10"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Chodavarapu</surname><given-names>Ramakrishna K</given-names></name><name><surname>Feng</surname><given-names>Suhua</given-names></name><name><surname>Bernatavichute</surname><given-names>Yana V</given-names></name><name><surname>Chen</surname><given-names>Pao-Yang</given-names></name><name><surname>Stroud</surname><given-names>Hume</given-names></name><name><surname>Yu</surname><given-names>Yanchun</given-names></name><name><surname>Hetzel</surname><given-names>JA</given-names></name><name><surname>Kuo</surname><given-names>F</given-names></name><name><surname>Kim</surname><given-names>J</given-names></name><name><surname>Cokus</surname><given-names>SJ</given-names></name><name><surname>Casero</surname><given-names>D</given-names></name><name><surname>Bernal</surname><given-names>M</given-names></name><name><surname>Huijser</surname><given-names>P</given-names></name><name><surname>Clark</surname><given-names>AT</given-names></name><name><surname>Krämer</surname><given-names>U</given-names></name><name><surname>Merchant</surname><given-names>SS</given-names></name><name><surname>Zhang</surname><given-names>X</given-names></name><name><surname>Jacobsen</surname><given-names>SE</given-names></name><name><surname>Pellegrini</surname><given-names>M</given-names></name></person-group><year>2010</year><article-title>Relationship between nucleosome positioning and DNA methylation</article-title><source>Nature</source><volume>466</volume><fpage>388</fpage><lpage>392</lpage><pub-id pub-id-type="doi">10.1038/nature09147</pub-id></element-citation></ref><ref id="bib11"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Choi</surname><given-names>C Kyuha</given-names></name><name><surname>Zhao</surname><given-names>Xiaohui</given-names></name><name><surname>Kelly</surname><given-names>Krystyna A</given-names></name><name><surname>Venn</surname><given-names>Oliver</given-names></name><name><surname>Higgins</surname><given-names>James D</given-names></name><name><surname>Yelina</surname><given-names>Nataliya E</given-names></name><name><surname>Hardcastle</surname><given-names>TJ</given-names></name><name><surname>Ziolkowski</surname><given-names>PA</given-names></name><name><surname>Copenhaver</surname><given-names>GP</given-names></name><name><surname>Franklin</surname><given-names>FC</given-names></name><name><surname>McVean</surname><given-names>G</given-names></name><name><surname>Henderson</surname><given-names>IR</given-names></name></person-group><year>2013</year><article-title>Arabidopsis meiotic crossover hot spots overlap with H2A.Z nucleosomes at gene promoters</article-title><source>Nature Genetics</source><volume>45</volume><fpage>1327</fpage><lpage>1336</lpage><pub-id pub-id-type="doi">10.1038/ng.2766</pub-id></element-citation></ref><ref id="bib12"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Cokus</surname><given-names>Shawn J</given-names></name><name><surname>Feng</surname><given-names>Suhua</given-names></name><name><surname>Zhang</surname><given-names>Xiaoyu</given-names></name><name><surname>Chen</surname><given-names>Zugen</given-names></name><name><surname>Merriman</surname><given-names>Barry</given-names></name><name><surname>Haudenschild</surname><given-names>Christian D</given-names></name><name><surname>Pradhan</surname><given-names>S</given-names></name><name><surname>Nelson</surname><given-names>SF</given-names></name><name><surname>Pellegrini</surname><given-names>M</given-names></name><name><surname>Jacobsen</surname><given-names>SE</given-names></name></person-group><year>2008</year><article-title>Shotgun bisulphite sequencing of the Arabidopsis genome reveals DNA methylation patterning</article-title><source>Nature</source><volume>452</volume><fpage>215</fpage><lpage>219</lpage><pub-id pub-id-type="doi">10.1038/nature06745</pub-id></element-citation></ref><ref id="bib13"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Colomé-Tatché</surname><given-names>Maria</given-names></name><name><surname>Cortijo</surname><given-names>Sandra</given-names></name><name><surname>Wardenaar</surname><given-names>René</given-names></name><name><surname>Morgado</surname><given-names>Lionel</given-names></name><name><surname>Lahouze</surname><given-names>Benoit</given-names></name><name><surname>Sarazin</surname><given-names>Alexis</given-names></name><name><surname>Etcheverry</surname><given-names>M</given-names></name><name><surname>Martin</surname><given-names>A</given-names></name><name><surname>Feng</surname><given-names>S</given-names></name><name><surname>Duvernois-Berthet</surname><given-names>E</given-names></name><name><surname>Labadie</surname><given-names>K</given-names></name><name><surname>Wincker</surname><given-names>P</given-names></name><name><surname>Jacobsen</surname><given-names>SE</given-names></name><name><surname>Jansen</surname><given-names>RC</given-names></name><name><surname>Colot</surname><given-names>V</given-names></name><name><surname>Johannes</surname><given-names>F</given-names></name></person-group><year>2012</year><article-title>Features of the Arabidopsis recombination landscape resulting from the combined Loss of sequence variation and DNA methylation</article-title><source>Proceedings of the National Academy of Sciences of the United States of America</source><volume>109</volume><fpage>16240</fpage><lpage>16245</lpage><pub-id pub-id-type="doi">10.1073/pnas.1212955109</pub-id></element-citation></ref><ref id="bib14"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Comeron</surname><given-names>Josep M</given-names></name><name><surname>Ratnappan</surname><given-names>Ramesh</given-names></name><name><surname>Bailin</surname><given-names>Samuel</given-names></name></person-group><year>2012</year><article-title>The many landscapes of recombination in <italic>Drosophila melanogaster</italic></article-title><source>PLOS Genetics</source><volume>8</volume><fpage>e1002905</fpage><pub-id pub-id-type="doi">10.1371/journal.pgen.1002905</pub-id></element-citation></ref><ref id="bib15"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Curtis</surname><given-names>D</given-names></name><name><surname>H Clark</surname><given-names>S</given-names></name><name><surname>Chovnick</surname><given-names>A</given-names></name><name><surname>Bender</surname><given-names>W</given-names></name></person-group><year>1989</year><article-title>Molecular analysis of recombination events in Drosophila</article-title><source>Genetics</source><volume>122</volume><fpage>653</fpage><lpage>661</lpage></element-citation></ref><ref id="bib16"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>De Visser</surname><given-names>J Arjan</given-names></name><name><surname>Elena</surname><given-names>Santiago F</given-names></name></person-group><year>2007</year><article-title>The evolution of sex: empirical insights into the roles of epistasis and drift</article-title><source>Nature Reviews Genetics</source><volume>8</volume><fpage>139</fpage><lpage>149</lpage><pub-id pub-id-type="doi">10.1038/nrg1985</pub-id></element-citation></ref><ref id="bib17"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Dooner</surname><given-names>HK</given-names></name><name><surname>Martínez-Férez</surname><given-names>IM</given-names></name></person-group><year>1997</year><article-title>Recombination occurs uniformly within the bronze gene, a meiotic recombination hotspot in the maize genome</article-title><source>The Plant Cell Online</source><volume>9</volume><fpage>1633</fpage><lpage>1646</lpage><pub-id pub-id-type="doi">10.1105/tpc.9.9.1633</pub-id></element-citation></ref><ref id="bib18"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Duret</surname><given-names>Laurent</given-names></name><name><surname>Galtier</surname><given-names>Nicolas</given-names></name></person-group><year>2009</year><article-title>Biased gene conversion and the evolution of mammalian genomic landscapes</article-title><source>Annual Review of Genomics and Human Genetics</source><volume>10</volume><fpage>285</fpage><lpage>311</lpage><pub-id pub-id-type="doi">10.1146/annurev-genom-082908-150001</pub-id></element-citation></ref><ref id="bib19"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Francis</surname><given-names>Kirk E</given-names></name><name><surname>Lam</surname><given-names>Sandy Y</given-names></name><name><surname>Harrison</surname><given-names>Benjamin D</given-names></name><name><surname>Bey</surname><given-names>Alexandra L</given-names></name><name><surname>Berchowitz</surname><given-names>Luke E</given-names></name><name><surname>Copenhaver</surname><given-names>Gregory P</given-names></name></person-group><year>2007</year><article-title>Pollen tetrad-based visual assay for meiotic recombination in Arabidopsis</article-title><source>Proceedings of the National Academy of Sciences of the United States of America</source><volume>104</volume><fpage>3913</fpage><lpage>3918</lpage><pub-id pub-id-type="doi">10.1073/pnas.0608936104</pub-id></element-citation></ref><ref id="bib20"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Gan</surname><given-names>Xiangchao</given-names></name><name><surname>Stegle</surname><given-names>Oliver</given-names></name><name><surname>Behr</surname><given-names>Jonas</given-names></name><name><surname>Steffen</surname><given-names>Joshua G</given-names></name><name><surname>Drewe</surname><given-names>Philipp</given-names></name><name><surname>Hildebrand</surname><given-names>Katie L</given-names></name><name><surname>Lyngsoe</surname><given-names>R</given-names></name><name><surname>Schultheiss</surname><given-names>SJ</given-names></name><name><surname>Osborne</surname><given-names>EJ</given-names></name><name><surname>Sreedharan</surname><given-names>VT</given-names></name><name><surname>Kahles</surname><given-names>A</given-names></name><name><surname>Bohnert</surname><given-names>R</given-names></name><name><surname>Jean</surname><given-names>G</given-names></name><name><surname>Derwent</surname><given-names>P</given-names></name><name><surname>Kersey</surname><given-names>P</given-names></name><name><surname>Belfield</surname><given-names>EJ</given-names></name><name><surname>Harberd</surname><given-names>NP</given-names></name><name><surname>Kemen</surname><given-names>E</given-names></name><name><surname>Toomajian</surname><given-names>C</given-names></name><name><surname>Kover</surname><given-names>PX</given-names></name><name><surname>Clark</surname><given-names>RM</given-names></name><name><surname>Rätsch</surname><given-names>G</given-names></name><name><surname>Mott</surname><given-names>R</given-names></name></person-group><year>2011</year><article-title>Multiple reference genomes and transcriptomes for <italic>Arabidopsis thaliana</italic></article-title><source>Nature</source><volume>477</volume><fpage>419</fpage><lpage>423</lpage><pub-id pub-id-type="doi">10.1038/nature10414</pub-id></element-citation></ref><ref id="bib21"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Giraut</surname><given-names>Laurène</given-names></name><name><surname>Falque</surname><given-names>Matthieu</given-names></name><name><surname>Drouaud</surname><given-names>Jan</given-names></name><name><surname>Pereira</surname><given-names>Lucie</given-names></name><name><surname>Martin</surname><given-names>Olivier C</given-names></name><name><surname>Mézard</surname><given-names>Christine</given-names></name></person-group><year>2011</year><article-title>Genome-wide crossover distribution in <italic>Arabidopsis thaliana</italic> meiosis reveals sex-specific patterns along chromosomes</article-title><source>PLOS Genetics</source><volume>7</volume><fpage>e1002354</fpage><pub-id pub-id-type="doi">10.1371/journal.pgen.1002354</pub-id></element-citation></ref><ref id="bib22"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Goldfarb</surname><given-names>Tamara</given-names></name><name><surname>Lichten</surname><given-names>Michael</given-names></name></person-group><year>2010</year><article-title>Frequent and efficient use of the sister chromatid for DNA double-strand break repair during budding yeast meiosis</article-title><source>PLOS Biology</source><volume>8</volume><fpage>e1000520</fpage><pub-id pub-id-type="doi">10.1371/journal.pbio.1000520</pub-id></element-citation></ref><ref id="bib23"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Gorbunova</surname><given-names>Vera</given-names></name><name><surname>Levy</surname><given-names>Avraham A</given-names></name></person-group><year>1997</year><article-title>Non-homologous DNA end joining in plant cells is associated with deletions and filler DNA insertions</article-title><source>Nucleic Acids Research</source><volume>25</volume><fpage>4650</fpage><lpage>4657</lpage><pub-id pub-id-type="doi">10.1093/nar/25.22.4650</pub-id></element-citation></ref><ref id="bib24"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Horton</surname><given-names>Matthew W</given-names></name><name><surname>Hancock</surname><given-names>Angela M</given-names></name><name><surname>Huang</surname><given-names>Yu S</given-names></name><name><surname>Toomajian</surname><given-names>Christopher</given-names></name><name><surname>Atwell</surname><given-names>Susanna</given-names></name><name><surname>Auton</surname><given-names>Adam</given-names></name><name><surname>Muliyati</surname><given-names>NW</given-names></name><name><surname>Platt</surname><given-names>A</given-names></name><name><surname>Sperone</surname><given-names>FG</given-names></name><name><surname>Vilhjálmsson</surname><given-names>BJ</given-names></name><name><surname>Nordborg</surname><given-names>M</given-names></name><name><surname>Borevitz</surname><given-names>JO</given-names></name><name><surname>Bergelson</surname><given-names>J</given-names></name></person-group><year>2012</year><article-title>Genome-wide patterns of genetic variation in worldwide <italic>Arabidopsis thaliana</italic> accessions from the RegMap panel</article-title><source>Nature Genetics</source><volume>44</volume><fpage>212</fpage><lpage>216</lpage><pub-id pub-id-type="doi">10.1038/ng.1042</pub-id></element-citation></ref><ref id="bib25"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Kauppi</surname><given-names>Liisa</given-names></name><name><surname>May</surname><given-names>Celia A</given-names></name><name><surname>Jeffreys</surname><given-names>Alec J</given-names></name></person-group><year>2009</year><article-title>Analysis of meiotic recombination products from human sperm</article-title><source>Methods in Molecular Biology</source><volume>557</volume><fpage>323</fpage><lpage>355</lpage><pub-id pub-id-type="doi">10.1007/978-1-59745-527-5_20</pub-id></element-citation></ref><ref id="bib26"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Korhonen</surname><given-names>Janne</given-names></name><name><surname>Martinmäki</surname><given-names>Petri</given-names></name><name><surname>Pizzi</surname><given-names>Cinzia</given-names></name><name><surname>Rastas</surname><given-names>Pasi</given-names></name><name><surname>Ukkonen</surname><given-names>Esko</given-names></name></person-group><year>2009</year><article-title>MOODS: fast search for position weight matrix matches in DNA sequences</article-title><source>Bioinformatics</source><volume>25</volume><fpage>3181</fpage><lpage>3182</lpage><pub-id pub-id-type="doi">10.1093/bioinformatics/btp554</pub-id></element-citation></ref><ref id="bib27"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Lamesch</surname><given-names>Philippe</given-names></name><name><surname>Berardini</surname><given-names>Tanya Z</given-names></name><name><surname>Li</surname><given-names>Donghui</given-names></name><name><surname>Swarbreck</surname><given-names>David</given-names></name><name><surname>Wilks</surname><given-names>Christopher</given-names></name><name><surname>Sasidharan</surname><given-names>Rajkumar</given-names></name><name><surname>Muller</surname><given-names>R</given-names></name><name><surname>Dreher</surname><given-names>K</given-names></name><name><surname>Alexander</surname><given-names>DL</given-names></name><name><surname>Garcia-Hernandez</surname><given-names>M</given-names></name><name><surname>Karthikeyan</surname><given-names>AS</given-names></name><name><surname>Lee</surname><given-names>CH</given-names></name><name><surname>Nelson</surname><given-names>WD</given-names></name><name><surname>Ploetz</surname><given-names>L</given-names></name><name><surname>Singh</surname><given-names>S</given-names></name><name><surname>Wensel</surname><given-names>A</given-names></name><name><surname>Huala</surname><given-names>E</given-names></name></person-group><year>2012</year><article-title>The Arabidopsis information Resource (TAIR): improved gene annotation and new tools</article-title><source>Nucleic Acids Research</source><volume>40</volume><fpage>D1202</fpage><lpage>D1210</lpage><pub-id pub-id-type="doi">10.1093/nar/gkr1090</pub-id></element-citation></ref><ref id="bib28"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Lister</surname><given-names>Ryan</given-names></name><name><surname>O’Malley</surname><given-names>Ronan C</given-names></name><name><surname>Tonti-Filippini</surname><given-names>Julian</given-names></name><name><surname>Gregory</surname><given-names>Brian D</given-names></name><name><surname>Berry</surname><given-names>Charles C</given-names></name><name><surname>Millar</surname><given-names>AHarvey</given-names></name><name><surname>Ecker</surname><given-names>JR</given-names></name></person-group><year>2008</year><article-title>Highly integrated single-base resolution maps of the epigenome in Arabidopsis</article-title><source>Cell</source><volume>133</volume><fpage>523</fpage><lpage>536</lpage><pub-id pub-id-type="doi">10.1016/j.cell.2008.03.029</pub-id></element-citation></ref><ref id="bib29"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Long</surname><given-names>Quan</given-names></name><name><surname>Rabanal</surname><given-names>Fernando A</given-names></name><name><surname>Meng</surname><given-names>Dazhe</given-names></name><name><surname>Huber</surname><given-names>Christian D</given-names></name><name><surname>Farlow</surname><given-names>Ashley</given-names></name><name><surname>Platzer</surname><given-names>Alexander</given-names></name><name><surname>Zhang</surname><given-names>Q</given-names></name><name><surname>Vilhjálmsson</surname><given-names>BJ</given-names></name><name><surname>Korte</surname><given-names>A</given-names></name><name><surname>Nizhynska</surname><given-names>V</given-names></name><name><surname>Voronin</surname><given-names>V</given-names></name><name><surname>Korte</surname><given-names>P</given-names></name><name><surname>Sedman</surname><given-names>L</given-names></name><name><surname>Mandáková</surname><given-names>T</given-names></name><name><surname>Lysak</surname><given-names>MA</given-names></name><name><surname>Seren</surname><given-names>Ü</given-names></name><name><surname>Hellmann</surname><given-names>I</given-names></name><name><surname>Nordborg</surname><given-names>M</given-names></name></person-group><year>2013</year><article-title>Massive genomic variation and strong selection in <italic>Arabidopsis thaliana</italic> lines from Sweden</article-title><source>Nature Genetics</source><volume>45</volume><fpage>884</fpage><lpage>890</lpage><pub-id pub-id-type="doi">10.1038/ng.2678</pub-id></element-citation></ref><ref id="bib30"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Lu</surname><given-names>Pingli</given-names></name><name><surname>Han</surname><given-names>Xinwei</given-names></name><name><surname>Qi</surname><given-names>Ji</given-names></name><name><surname>Yang</surname><given-names>Jiange</given-names></name><name><surname>Wijeratne</surname><given-names>Asela J</given-names></name><name><surname>Li</surname><given-names>Tao</given-names></name><name><surname>Ma</surname><given-names>H</given-names></name></person-group><year>2012</year><article-title>Analysis of Arabidopsis genome-wide variations before and after meiosis and meiotic recombination by resequencing Landsberg erecta and all four products of a single meiosis</article-title><source>Genome Research</source><volume>22</volume><fpage>508</fpage><lpage>518</lpage><pub-id pub-id-type="doi">10.1101/gr.127522.111</pub-id></element-citation></ref><ref id="bib31"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Mancera</surname><given-names>Eugenio</given-names></name><name><surname>Bourgon</surname><given-names>Richard</given-names></name><name><surname>Brozzi</surname><given-names>Alessandro</given-names></name><name><surname>Huber</surname><given-names>Wolfgang</given-names></name><name><surname>Steinmetz</surname><given-names>Lars M</given-names></name></person-group><year>2008</year><article-title>High-resolution mapping of meiotic crossovers and non-crossovers in yeast</article-title><source>Nature</source><volume>454</volume><fpage>479</fpage><lpage>485</lpage><pub-id pub-id-type="doi">10.1038/nature07135</pub-id></element-citation></ref><ref id="bib32"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Mao</surname><given-names>Zhiyong</given-names></name><name><surname>Bozzella</surname><given-names>Michael</given-names></name><name><surname>Seluanov</surname><given-names>Andrei</given-names></name><name><surname>Gorbunova</surname><given-names>Vera</given-names></name></person-group><year>2008</year><article-title>DNA repair by nonhomologous end joining and homologous recombination during cell cycle in human cells</article-title><source>Cell Cycle</source><volume>7</volume><fpage>2902</fpage><lpage>2906</lpage><pub-id pub-id-type="doi">10.4161/cc.7.18.6679</pub-id></element-citation></ref><ref id="bib33"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Melamed-Bessudo</surname><given-names>Cathy</given-names></name><name><surname>Levy</surname><given-names>Avraham A</given-names></name></person-group><year>2012</year><article-title>Deficiency in DNA methylation increases meiotic crossover rates in euchromatic but not in heterochromatic regions in Arabidopsis</article-title><source>Proceedings of the National Academy of Sciences of the United States of America</source><volume>109</volume><fpage>E981</fpage><lpage>E988</lpage><pub-id pub-id-type="doi">10.1073/pnas.1120742109</pub-id></element-citation></ref><ref id="bib34"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Mirouze</surname><given-names>Marie</given-names></name><name><surname>Lieberman-Lazarovich</surname><given-names>Michal</given-names></name><name><surname>Aversano</surname><given-names>Riccardo</given-names></name><name><surname>Bucher</surname><given-names>Etienne</given-names></name><name><surname>Nicolet</surname><given-names>Joël</given-names></name><name><surname>Reinders</surname><given-names>Jon</given-names></name><name><surname>Paszkowski</surname><given-names>J</given-names></name></person-group><year>2012</year><article-title>Loss of DNA methylation affects the recombination landscape in Arabidopsis</article-title><source>Proceedings of the National Academy of Sciences of the United States of America</source><volume>109</volume><fpage>5880</fpage><lpage>5885</lpage><pub-id pub-id-type="doi">10.1073/pnas.1120841109</pub-id></element-citation></ref><ref id="bib35"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Moens</surname><given-names>Peter B</given-names></name><name><surname>Chen</surname><given-names>David J</given-names></name><name><surname>Shen</surname><given-names>Zhiyuan</given-names></name><name><surname>Kolas</surname><given-names>Nadine</given-names></name><name><surname>Tarsounas</surname><given-names>Madalina</given-names></name><name><surname>Heng</surname><given-names>Henry HQ</given-names></name><name><surname>Spyropoulos</surname><given-names>B</given-names></name></person-group><year>1997</year><article-title>Rad51 immunocytology in rat and mouse spermatocytes and oocytes</article-title><source>Chromosoma</source><volume>106</volume><fpage>207</fpage><lpage>215</lpage><pub-id pub-id-type="doi">10.1007/s004120050241</pub-id></element-citation></ref><ref id="bib36"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Moens</surname><given-names>Peter B</given-names></name><name><surname>Kolas</surname><given-names>Nadine K</given-names></name><name><surname>Tarsounas</surname><given-names>Madalena</given-names></name><name><surname>Marcon</surname><given-names>Edyta</given-names></name><name><surname>Cohen</surname><given-names>Paula E</given-names></name><name><surname>Spyropoulos</surname><given-names>Barbara</given-names></name></person-group><year>2002</year><article-title>The time course and chromosomal localization of recombination-related proteins at meiosis in the mouse are compatible with models that can resolve the early DNA-DNA interactions without reciprocal recombination</article-title><source>Journal of Cell Science</source><volume>115</volume><fpage>1611</fpage><lpage>1622</lpage></element-citation></ref><ref id="bib37"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Muller</surname><given-names>HJ</given-names></name></person-group><year>1932</year><article-title>Some genetic aspects of sex</article-title><source>The American Naturalist</source><volume>66</volume><fpage>118</fpage><lpage>138</lpage></element-citation></ref><ref id="bib38"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Myers</surname><given-names>Simon</given-names></name><name><surname>Freeman</surname><given-names>Colin</given-names></name><name><surname>Auton</surname><given-names>Adam</given-names></name><name><surname>Donnelly</surname><given-names>Peter</given-names></name><name><surname>McVean</surname><given-names>Gil</given-names></name></person-group><year>2008</year><article-title>A common sequence motif associated with recombination hot spots and genome instability in humans</article-title><source>Nature Genetics</source><volume>40</volume><fpage>1124</fpage><lpage>1129</lpage><pub-id pub-id-type="doi">10.1038/ng.213</pub-id></element-citation></ref><ref id="bib39"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Ossowski</surname><given-names>Stephan</given-names></name><name><surname>Korbinian</surname><given-names>Schneeberger</given-names></name><name><surname>Clark</surname><given-names>Richard M</given-names></name><name><surname>Lanz</surname><given-names>Christa</given-names></name><name><surname>Warthmann</surname><given-names>Norman</given-names></name><name><surname>Weigel</surname><given-names>Detlef</given-names></name></person-group><year>2008</year><article-title>Sequencing of natural strains of <italic>Arabidopsis thaliana</italic> with short reads</article-title><source>Genome Research</source><volume>18</volume><fpage>2024</fpage><lpage>2033</lpage><pub-id pub-id-type="doi">10.1101/gr.080200.108</pub-id></element-citation></ref><ref id="bib40"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Ossowski</surname><given-names>Stephan</given-names></name><name><surname>Korbinian</surname><given-names>Schneeberger</given-names></name><name><surname>Lucas-Lledó</surname><given-names>José Ignacio</given-names></name><name><surname>Warthmann</surname><given-names>Norman</given-names></name><name><surname>Clark</surname><given-names>Richard M</given-names></name><name><surname>Shaw</surname><given-names>Ruth G</given-names></name><name><surname>Weigel</surname><given-names>D</given-names></name><name><surname>Lynch</surname><given-names>M</given-names></name></person-group><year>2010</year><article-title>The rate and molecular spectrum of spontaneous mutations in <italic>Arabidopsis thaliana</italic></article-title><source>Science</source><volume>327</volume><fpage>92</fpage><lpage>94</lpage><pub-id pub-id-type="doi">10.1126/science.1180677</pub-id></element-citation></ref><ref id="bib41"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Otto</surname><given-names>Sarah P</given-names></name><name><surname>Lenormand</surname><given-names>Thomas</given-names></name></person-group><year>2002</year><article-title>Resolving the paradox of sex and recombination</article-title><source>Nature Reviews Genetics</source><volume>3</volume><fpage>252</fpage><lpage>261</lpage><pub-id pub-id-type="doi">10.1038/nrg761</pub-id></element-citation></ref><ref id="bib42"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Pajerowska-Mukhtar</surname><given-names>Karolina M</given-names></name><name><surname>Wang</surname><given-names>Wei</given-names></name><name><surname>Tada</surname><given-names>Yasuomi</given-names></name><name><surname>Oka</surname><given-names>Nodoka</given-names></name><name><surname>Tucker</surname><given-names>Chandra L</given-names></name><name><surname>Fonseca</surname><given-names>JosePedro</given-names></name><name><surname>Dong</surname><given-names>X</given-names></name></person-group><year>2012</year><article-title>The HSF-like transcription factor TBF1 is a major molecular switch for plant growth-to-defense transition</article-title><source>Current Biology</source><volume>22</volume><fpage>103</fpage><lpage>112</lpage><pub-id pub-id-type="doi">10.1016/j.cub.2011.12.015</pub-id></element-citation></ref><ref id="bib43"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Pan</surname><given-names>Jing</given-names></name><name><surname>Sasaki</surname><given-names>Mariko</given-names></name><name><surname>Kniewel</surname><given-names>Ryan</given-names></name><name><surname>Murakami</surname><given-names>Hajime</given-names></name><name><surname>Blitzblau</surname><given-names>Hannah G</given-names></name><name><surname>Tischfield</surname><given-names>Sam E</given-names></name><name><surname>Zhu</surname><given-names>X</given-names></name><name><surname>Neale</surname><given-names>MJ</given-names></name><name><surname>Jasin</surname><given-names>M</given-names></name><name><surname>Socci</surname><given-names>ND</given-names></name><name><surname>Hochwagen</surname><given-names>A</given-names></name><name><surname>Keeney</surname><given-names>S</given-names></name></person-group><year>2011</year><article-title>A Hierarchical combination of factors shapes the genome-wide topography of yeast meiotic recombination initiation</article-title><source>Cell</source><volume>144</volume><fpage>719</fpage><lpage>731</lpage><pub-id pub-id-type="doi">10.1016/j.cell.2011.02.009</pub-id></element-citation></ref><ref id="bib44"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Pessia</surname><given-names>Eugénie</given-names></name><name><surname>Popa</surname><given-names>Alexandra</given-names></name><name><surname>Mousset</surname><given-names>Sylvain</given-names></name><name><surname>Rezvoy</surname><given-names>Clément</given-names></name><name><surname>Duret</surname><given-names>Laurent</given-names></name><name><surname>Marais</surname><given-names>Gabriel AB</given-names></name></person-group><year>2012</year><article-title>Evidence for widespread GC-biased gene conversion in eukaryotes</article-title><source>Genome Biology and Evolution</source><volume>4</volume><fpage>787</fpage><lpage>794</lpage><pub-id pub-id-type="doi">10.1093/gbe/evs052</pub-id></element-citation></ref><ref id="bib45"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Preuss</surname><given-names>D</given-names></name><name><surname>Rhee</surname><given-names>SY</given-names></name><name><surname>Davis</surname><given-names>RW</given-names></name></person-group><year>1994</year><article-title>Tetrad analysis possible in Arabidopsis with mutation of the quartet (QRT) genes</article-title><source>Science</source><volume>264</volume><fpage>1458</fpage><lpage>1460</lpage><pub-id pub-id-type="doi">10.1126/science.8197459</pub-id></element-citation></ref><ref id="bib46"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Qi</surname><given-names>Ji</given-names></name><name><surname>Wijeratne</surname><given-names>Asela</given-names></name><name><surname>Tomsho</surname><given-names>Lynn</given-names></name><name><surname>Hu</surname><given-names>Yi</given-names></name><name><surname>Schuster</surname><given-names>Stephan</given-names></name><name><surname>Ma</surname><given-names>Hong</given-names></name></person-group><year>2009</year><article-title>Characterization of meiotic crossovers and gene conversion by whole-genome sequencing in <italic>Saccharomyces cerevisiae</italic></article-title><source>BMC Genomics</source><volume>10</volume><fpage>475</fpage><pub-id pub-id-type="doi">10.1186/1471-2164-10-475</pub-id></element-citation></ref><ref id="bib47"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Ravi</surname><given-names>Maruthachalam</given-names></name><name><surname>Chan</surname><given-names>Simon W</given-names></name></person-group><year>2010</year><article-title>Haploid plants produced by centromere-mediated genome elimination</article-title><source>Nature</source><volume>464</volume><fpage>615</fpage><lpage>618</lpage><pub-id pub-id-type="doi">10.1038/nature08842</pub-id></element-citation></ref><ref id="bib48"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Salome</surname><given-names>PA</given-names></name><name><surname>Bomblies</surname><given-names>K</given-names></name><name><surname>Fitz</surname><given-names>J</given-names></name><name><surname>Laitinen</surname><given-names>RAE</given-names></name><name><surname>Warthmann</surname><given-names>N</given-names></name><name><surname>Yant</surname><given-names>L</given-names></name><name><surname>Weigel</surname><given-names>D</given-names></name></person-group><year>2012</year><article-title>The recombination landscape in <italic>Arabidopsis thaliana</italic> F2 populations</article-title><source>Heredity</source><volume>108</volume><fpage>447</fpage><lpage>455</lpage><pub-id pub-id-type="doi">10.1038/hdy.2011.95</pub-id></element-citation></ref><ref id="bib49"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>San Filippo</surname><given-names>Joseph</given-names></name><name><surname>Sung</surname><given-names>Patrick</given-names></name><name><surname>Klein</surname><given-names>Hannah</given-names></name></person-group><year>2008</year><article-title>Mechanism of eukaryotic homologous recombination</article-title><source>Annual Review of Biochemistry</source><volume>77</volume><fpage>229</fpage><lpage>257</lpage><pub-id pub-id-type="doi">10.1146/annurev.biochem.77.061306.125255</pub-id></element-citation></ref><ref id="bib50"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Sanchez-Moran</surname><given-names>Eugenio</given-names></name><name><surname>Santos</surname><given-names>Juan-Luis</given-names></name><name><surname>Jones</surname><given-names>Gareth H</given-names></name><name><surname>Franklin</surname><given-names>F Christopher H</given-names></name></person-group><year>2007</year><article-title>ASY1 mediates AtDMC1-dependent interhomolog recombination during meiosis in Arabidopsis</article-title><source>Genes and Development</source><volume>21</volume><fpage>2220</fpage><lpage>2233</lpage><pub-id pub-id-type="doi">10.1101/gad.439007</pub-id></element-citation></ref><ref id="bib51"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Sanei</surname><given-names>Maryam</given-names></name><name><surname>Pickering</surname><given-names>Richard</given-names></name><name><surname>Kumke</surname><given-names>Katrin</given-names></name><name><surname>Nasuda</surname><given-names>Shuhei</given-names></name><name><surname>Houben</surname><given-names>Andreas</given-names></name></person-group><year>2011</year><article-title>Loss of centromeric histone H3 (CENH3) from centromeres precedes uniparental chromosome elimination in interspecific barley hybrids</article-title><source>Proceedings of the National Academy of Sciences of the United States of America</source><volume>108</volume><fpage>E498</fpage><lpage>E505</lpage><pub-id pub-id-type="doi">10.1073/pnas.1103190108</pub-id></element-citation></ref><ref id="bib52"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Schmitz</surname><given-names>Robert J</given-names></name><name><surname>Schultz</surname><given-names>Matthew D</given-names></name><name><surname>Urich</surname><given-names>Mark A</given-names></name><name><surname>Nery</surname><given-names>Joseph R</given-names></name><name><surname>Pelizzola</surname><given-names>Mattia</given-names></name><name><surname>Libiger</surname><given-names>Ondrej</given-names></name><name><surname>Alix</surname><given-names>Andrew</given-names></name><name><surname>McCosh</surname><given-names>RB</given-names></name><name><surname>Chen</surname><given-names>H</given-names></name><name><surname>Schork</surname><given-names>NJ</given-names></name><name><surname>Ecker</surname><given-names>JR</given-names></name></person-group><year>2013</year><article-title>Patterns of population epigenomic diversity</article-title><source>Nature</source><volume>495</volume><fpage>193</fpage><lpage>198</lpage><pub-id pub-id-type="doi">10.1038/nature11968</pub-id></element-citation></ref><ref id="bib53"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Schneeberger</surname><given-names>Korbinian</given-names></name><name><surname>Hagmann</surname><given-names>Jörg</given-names></name><name><surname>Ossowski</surname><given-names>Stephan</given-names></name><name><surname>Warthmann</surname><given-names>Norman</given-names></name><name><surname>Gesing</surname><given-names>Sandra</given-names></name><name><surname>Kohlbacher</surname><given-names>Oliver</given-names></name><name><surname>Weigel</surname><given-names>D</given-names></name></person-group><year>2009</year><article-title>Simultaneous alignment of short reads against multiple genomes</article-title><source>Genome Biology</source><volume>10</volume><fpage>R98</fpage><pub-id pub-id-type="doi">10.1186/gb-2009-10-9-r98</pub-id></element-citation></ref><ref id="bib54"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Schneeberger</surname><given-names>Korbinian</given-names></name><name><surname>Ossowski</surname><given-names>Stephan</given-names></name><name><surname>Ott</surname><given-names>Felix</given-names></name><name><surname>Klein</surname><given-names>Juliane D</given-names></name><name><surname>Wang</surname><given-names>Xi</given-names></name><name><surname>Lanz</surname><given-names>Christa</given-names></name><name><surname>Smith</surname><given-names>LM</given-names></name><name><surname>Cao</surname><given-names>J</given-names></name><name><surname>Fitz</surname><given-names>J</given-names></name><name><surname>Warthmann</surname><given-names>N</given-names></name><name><surname>Henz</surname><given-names>SR</given-names></name><name><surname>Huson</surname><given-names>DH</given-names></name><name><surname>Weigel</surname><given-names>D</given-names></name></person-group><year>2011</year><article-title>Reference-guided assembly of four diverse <italic>Arabidopsis thaliana</italic> genomes</article-title><source>Proceedings of the National Academy of Sciences of the United States of America</source><volume>108</volume><fpage>10249</fpage><lpage>10254</lpage><pub-id pub-id-type="doi">10.1073/pnas.1107739108</pub-id></element-citation></ref><ref id="bib55"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Schultes</surname><given-names>NP</given-names></name><name><surname>Szostak</surname><given-names>JW</given-names></name></person-group><year>1990</year><article-title>Decreasing gradients of gene conversion on both sides of the initiation site for meiotic recombination at the ARG4 locus in yeast</article-title><source>Genetics</source><volume>126</volume><fpage>813</fpage><lpage>822</lpage></element-citation></ref><ref id="bib56"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Smagulova</surname><given-names>Fatima</given-names></name><name><surname>Gregoretti</surname><given-names>Ivan V</given-names></name><name><surname>Brick</surname><given-names>Kevin</given-names></name><name><surname>Khil</surname><given-names>Pavel</given-names></name><name><surname>Camerini-Otero</surname><given-names>RDaniel</given-names></name><name><surname>Petukhova</surname><given-names>Galina V</given-names></name></person-group><year>2011</year><article-title>Genome-wide analysis reveals novel molecular features of mouse recombination hotspots</article-title><source>Nature</source><volume>472</volume><fpage>375</fpage><lpage>378</lpage><pub-id pub-id-type="doi">10.1038/nature09869</pub-id></element-citation></ref><ref id="bib57"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Stahl</surname><given-names>Franklin W</given-names></name><name><surname>Foss</surname><given-names>Henriette M</given-names></name></person-group><year>2010</year><article-title>A two-pathway analysis of meiotic crossing over and gene conversion in <italic>Saccharomyces cerevisiae</italic></article-title><source>Genetics</source><volume>186</volume><fpage>515</fpage><lpage>536</lpage><pub-id pub-id-type="doi">10.1534/genetics.110.121194</pub-id></element-citation></ref><ref id="bib58"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Stroud</surname><given-names>Hume</given-names></name><name><surname>Hale</surname><given-names>Christopher J</given-names></name><name><surname>Feng</surname><given-names>Suhua</given-names></name><name><surname>Caro</surname><given-names>Elena</given-names></name><name><surname>Jacob</surname><given-names>Yannick</given-names></name><name><surname>Michaels</surname><given-names>Scott D</given-names></name><name><surname>Jacobsen</surname><given-names>SE</given-names></name></person-group><year>2012</year><article-title>DNA methyltransferases are required to induce heterochromatic re-replication in Arabidopsis</article-title><source>PLOS Genetics</source><volume>8</volume><fpage>e1002808</fpage><pub-id pub-id-type="doi">10.1371/journal.pgen.1002808</pub-id></element-citation></ref><ref id="bib59"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Sun</surname><given-names>Yujin</given-names></name><name><surname>Ambrose</surname><given-names>Jonathan H</given-names></name><name><surname>Haughey</surname><given-names>Brena S</given-names></name><name><surname>Webster</surname><given-names>Tyler D</given-names></name><name><surname>Pierrie</surname><given-names>Sarah N</given-names></name><name><surname>Muñoz</surname><given-names>Daniela F</given-names></name><name><surname>Wellman</surname><given-names>EC</given-names></name><name><surname>Cherian</surname><given-names>S</given-names></name><name><surname>Lewis</surname><given-names>SM</given-names></name><name><surname>Berchowitz</surname><given-names>LE</given-names></name><name><surname>Copenhaver</surname><given-names>GP</given-names></name></person-group><year>2012</year><article-title>Deep genome-wide measurement of meiotic gene conversion using tetrad analysis in <italic>Arabidopsis thaliana</italic></article-title><source>PLOS Genetics</source><volume>8</volume><fpage>e1002968</fpage><pub-id pub-id-type="doi">10.1371/journal.pgen.1002968</pub-id></element-citation></ref><ref id="bib60"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Suter</surname><given-names>Bernhard</given-names></name><name><surname>Schnappauf</surname><given-names>Georg</given-names></name><name><surname>Thoma</surname><given-names>Fritz</given-names></name></person-group><year>2000</year><article-title>Poly(dA·dT) sequences exist as rigid DNA structures in nucleosome-free yeast promoters in vivo</article-title><source>Nucleic Acids Research</source><volume>28</volume><fpage>4083</fpage><lpage>4089</lpage><pub-id pub-id-type="doi">10.1093/nar/28.21.4083</pub-id></element-citation></ref><ref id="bib61"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Vignard</surname><given-names>Julien</given-names></name><name><surname>Siwiec</surname><given-names>Tanja</given-names></name><name><surname>Chelysheva</surname><given-names>Liudmila</given-names></name><name><surname>Vrielynck</surname><given-names>Nathalie</given-names></name><name><surname>Gonord</surname><given-names>Florine</given-names></name><name><surname>Armstrong</surname><given-names>Susan J</given-names></name><name><surname>Schlögelhofer</surname><given-names>P</given-names></name><name><surname>Mercier</surname><given-names>R</given-names></name></person-group><year>2007</year><article-title>The interplay of RecA-related proteins and the MND1–HOP2 complex during meiosis in <italic>Arabidopsis thaliana</italic></article-title><source>PLOS Genetics</source><volume>3</volume><fpage>e176</fpage><pub-id pub-id-type="doi">10.1371/journal.pgen.0030176</pub-id></element-citation></ref><ref id="bib62"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Wang</surname><given-names>Dong</given-names></name><name><surname>Weaver</surname><given-names>Natalie D</given-names></name><name><surname>Kesarwani</surname><given-names>Meenu</given-names></name><name><surname>Dong</surname><given-names>Xinnian</given-names></name></person-group><year>2005</year><article-title>Induction of protein secretory pathway is required for systemic acquired resistance</article-title><source>Science</source><volume>308</volume><fpage>1036</fpage><lpage>1040</lpage><pub-id pub-id-type="doi">10.1126/science.1108791</pub-id></element-citation></ref><ref id="bib63"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Wang</surname><given-names>YH</given-names></name><name><surname>Griffith</surname><given-names>JD</given-names></name></person-group><year>1996</year><article-title>The [(G/C)3NN]n motif: a common DNA repeat that excludes nucleosomes</article-title><source>Proceedings of the National Academy of Sciences of the United States of America</source><volume>93</volume><fpage>8863</fpage><lpage>8867</lpage><pub-id pub-id-type="doi">10.1073/pnas.93.17.8863</pub-id></element-citation></ref><ref id="bib64"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Weiner</surname><given-names>Beth M</given-names></name><name><surname>Kleckner</surname><given-names>Nancy</given-names></name></person-group><year>1994</year><article-title>Chromosome pairing via multiple interstitial interactions before and during meiosis in yeast</article-title><source>Cell</source><volume>77</volume><fpage>977</fpage><lpage>991</lpage><pub-id pub-id-type="doi">10.1016/0092-8674(94)90438-3</pub-id></element-citation></ref><ref id="bib65"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Wijnker</surname><given-names>Erik</given-names></name><name><surname>van Dun</surname><given-names>Kees</given-names></name><name><surname>de Snoo</surname><given-names>C Bastiaan</given-names></name><name><surname>Lelivelt</surname><given-names>Cilia C</given-names></name><name><surname>Keurentjes</surname><given-names>Joost J</given-names></name><name><surname>Naharudin</surname><given-names>NazatulShima</given-names></name><name><surname>Ravi</surname><given-names>M</given-names></name><name><surname>Chan</surname><given-names>SW</given-names></name><name><surname>de Jong</surname><given-names>H</given-names></name><name><surname>Dirks</surname><given-names>R</given-names></name></person-group><year>2012</year><article-title>Reverse breeding in <italic>Arabidopsis thaliana</italic> generates homozygous parental lines from a heterozygous plant</article-title><source>Nature Genetics</source><volume>44</volume><fpage>467</fpage><lpage>470</lpage><pub-id pub-id-type="doi">10.1038/ng.2203</pub-id></element-citation></ref><ref id="bib66"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Wu</surname><given-names>TC</given-names></name><name><surname>Lichten</surname><given-names>M</given-names></name></person-group><year>1994</year><article-title>Meiosis-induced double-strand break sites determined by yeast chromatin structure</article-title><source>Science</source><volume>263</volume><fpage>515</fpage><lpage>518</lpage><pub-id pub-id-type="doi">10.1126/science.8290959</pub-id></element-citation></ref><ref id="bib67"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Yang</surname><given-names>Sihai</given-names></name><name><surname>Yuan</surname><given-names>Yang</given-names></name><name><surname>Wang</surname><given-names>Long</given-names></name><name><surname>Li</surname><given-names>Jing</given-names></name><name><surname>Wang</surname><given-names>Wen</given-names></name><name><surname>Liu</surname><given-names>Haoxuan</given-names></name><name><surname>Chen</surname><given-names>JQ</given-names></name><name><surname>Hurst</surname><given-names>LD</given-names></name><name><surname>Tian</surname><given-names>D</given-names></name></person-group><year>2012</year><article-title>Great majority of recombination events in Arabidopsis are gene conversion events</article-title><source>Proceedings of the National Academy of Sciences of the United States of America</source><volume>109</volume><fpage>20992</fpage><lpage>20997</lpage><pub-id pub-id-type="doi">10.1073/pnas.1211827110</pub-id></element-citation></ref><ref id="bib68"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Yelina</surname><given-names>Nataliya E</given-names></name><name><surname>Choi</surname><given-names>Kyuha</given-names></name><name><surname>Chelysheva</surname><given-names>Liudmila</given-names></name><name><surname>Macaulay</surname><given-names>Malcolm</given-names></name><name><surname>de Snoo</surname><given-names>Bastiaan</given-names></name><name><surname>Wijnker</surname><given-names>Erik</given-names></name><name><surname>Miller</surname><given-names>N</given-names></name><name><surname>Drouaud</surname><given-names>J</given-names></name><name><surname>Grelon</surname><given-names>M</given-names></name><name><surname>Copenhaver</surname><given-names>GP</given-names></name><name><surname>Mezard</surname><given-names>C</given-names></name><name><surname>Kelly</surname><given-names>KA</given-names></name><name><surname>Henderson</surname><given-names>IR</given-names></name></person-group><year>2012</year><article-title>Epigenetic remodeling of meiotic crossover frequency in <italic>Arabidopsis thaliana</italic> DNA methyltransferase mutants</article-title><source>PLOS Genetics</source><volume>8</volume><fpage>e1002844</fpage><pub-id pub-id-type="doi">10.1371/journal.pgen.1002844</pub-id></element-citation></ref><ref id="bib69"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Zerbino</surname><given-names>Daniel R</given-names></name><name><surname>Birney</surname><given-names>Ewan</given-names></name></person-group><year>2008</year><article-title>Velvet: algorithms for de novo short read assembly using de Bruijn graphs</article-title><source>Genome Research</source><volume>18</volume><fpage>821</fpage><lpage>829</lpage><pub-id pub-id-type="doi">10.1101/gr.074492.107</pub-id></element-citation></ref></ref-list></back><sub-article article-type="article-commentary" id="SA1"><front-stub><article-id pub-id-type="doi">10.7554/eLife.01426.019</article-id><title-group><article-title>Decision letter</article-title></title-group><contrib-group content-type="section"><contrib contrib-type="editor"><name><surname>McVean</surname><given-names>Gil</given-names></name><role>Reviewing editor</role><aff><institution>Oxford University</institution>, <country>United Kingdom</country></aff></contrib></contrib-group></front-stub><body><boxed-text><p>eLife posts the editorial decision letter and author response on a selection of the published articles (subject to the approval of the authors). An edited version of the letter sent to the authors after peer review is shown, indicating the substantive concerns or comments; minor concerns are not usually shown. Reviewers have the opportunity to discuss the decision before the letter is sent (see <ext-link ext-link-type="uri" xlink:href="http://elife.elifesciences.org/review-process">review process</ext-link>). Similarly, the author response typically shows only responses to the major concerns raised by the reviewers.</p></boxed-text><p>Thank you for sending your work entitled “The genomic landscape of meiotic crossovers and gene conversions in <italic>Arabidopsis thaliana</italic>” for consideration at <italic>eLife</italic>. Your article has been favorably evaluated by a Senior editor and 3 reviewers, one of whom is a member of our Board of Reviewing Editors.</p><p>The Reviewing editor and the other reviewers discussed their comments before we reached this decision, and the Reviewing editor has assembled the following comments to help you prepare a revised submission.</p><p>The distribution of non-crossover association gene conversion (NCO-GC) events in large genomes (such as those of plants and mammals) is of interest to those studying both recombination processes and also the evolutionary impacts of recombination. Several previous papers have studied rates and patterns of NCO-GC in Arabidopsis. This paper adds to the literature by sequencing both meiotic tetrads (13 total, 5 in great detail) and doubled haploids (DH; 10 total). The authors take great care to reduce the false positive rate for gene-conversion events, which results in a fairly limited number of events (7 validated NCO-GC events meeting all filters from the tetrads, 9 from the DH lines). They analyse the rate of GC (both CO and NCO), tract lengths, interference, relationship to genomic feature and possible motifs. Although the findings largely replicate those seen in other organisms, the demonstration in Arabidopsis is important and the experimental is carried out and analysed at a very high level.</p><p>In revision, we would like you to address the following points.</p><p>1) It would be interesting to see a blow-up of any CO events associated with NCOs with patterns of Col and Ler markers between the 4 sister genomes shown. In particular, to show the spatial relationships between the observed COs and gene conversion tracts.</p><p>2) When trying to fit models of tract length and GC rate, it is important to include not just the observed number of conversion events, but also the distribution of the number of SNPs co-converted. This will help resolve the relationship between rate and length.</p><p>3) During sequencing of the DH lines the authors observe a two-fold higher rate of mutation compared to published rates, which is attributed to their haploid life cycle. Do the authors mean that this occurs due to the absence of a sister template to repair from? Interestingly, they estimate a higher conversion frequency at COs – do the authors have enough information to know whether specific types of mutations (e.g., transitions/transversions) are occurring?</p></body></sub-article><sub-article article-type="reply" id="SA2"><front-stub><article-id pub-id-type="doi">10.7554/eLife.01426.020</article-id><title-group><article-title>Author response</article-title></title-group></front-stub><body><p><italic>1) It would be interesting to see a blow-up of any CO events associated with NCOs with patterns of Col and Ler markers between the 4 sister genomes shown. In particular, to show the spatial relationships between the observed COs and gene conversion tracts</italic>.</p><p>We have added visualizations of the genomic regions of 71 CO sites, for which sufficient sequencing information was available for the reconstruction of putative conversions tract. The visualizations include the parental genomes and all four genomes of the respective tetrads (see Supplementary file 3).</p><p><italic>2) When trying to fit models of tract length and GC rate, it is important to include not just the observed number of conversion events, but also the distribution of the number of SNPs co-converted. This will help resolve the relationship between rate and length</italic>.</p><p>We agree this indeed advances our simulations. We have now included a comparison of the average number of co-converted markers per conversion tract. In addition, we do not only show average values, but added standard deviations. This additional analysis corroborates that the tract lengths of COs and NCOs are drastically different.</p><p><italic>3) During sequencing of the DH lines the authors observe a two-fold higher rate of mutation compared to published rates, which is attributed to their haploid life cycle. Do the authors mean that this occurs due to the absence of a sister template to repair from</italic>?</p><p>The higher mutation rate in Arabidopsis haploids may have diverse causes. The absence of a homologous repair template in haploid cells in G1 might lead to more (presumably error-prone) repair through non-homologous end-joining (NHEJ). However, it is also conceivable that the process of uni-parental genome elimination causes mutagenic stress (i.e., see <xref ref-type="bibr" rid="bib51">Sanei et al., 2011</xref>, PNAS). We have now added a sentence that clarifies this.
In the original version of the manuscript we included 1-bp deletions in the calculation of the mutation rates. Excluding those (as it was done in the estimation of the wild-type mutation rate (<xref ref-type="bibr" rid="bib40">Ossowski et al, 2010</xref>, Science)) decreased our estimation slightly, but it is still more than 1.5 times higher as compared to published rates. Like for the original calculations this observation does not reveal a statistically significant difference.</p><p>Independent of any mechanistic reason for a putative enrichment in spontaneous mutations we had to check whether the mutation frequency is elevated to such drastic degrees that it would interfere with the identification with NCO-GCs. This was not the case.</p><p><italic>Interestingly, they estimate a higher conversion frequency at COs – do the authors have enough information to know whether specific types of mutations (e.g., transitions/transversions) are occurring</italic>?</p><p>Indeed, our analysis (of the tetrad samples) suggests higher conversion frequencies at COs as compared to NCOs, however this difference is not significant.
The overall limited number of spontaneous point mutations makes it hard to characterize the spectrum of mutations. However among these eight mutations were 6 transitions and 2 transversions. This fits to the enrichment of transitions among spontaneous mutations, which was reported earlier (<xref ref-type="bibr" rid="bib40">Ossowski et al, 2010</xref>, Science). We mention this in the updated version of the manuscript.</p></body></sub-article></article>