Initial commit of read.cross.tidy() #20

aaronwolen · 2014-08-22T15:02:43Z

Hi Karl,

This PR adds support for a new read.cross() format I'm calling "tidy" because genotype, phenotype and map data are stored in separate files using a standard data.frame-like format:

Phenotypes:

	1	2	3	4	5
T264	118.3	264	194.9	264	145.4

Genotypes:

	1	2	3	4	5
D10M44	B	-	-	B	H
D1M3	B	B	H	B	H
D1M75	B	B	H	H	H
D1M215	H	B	H	H	H
D1M309	H	H	H	H	B

Map:

	chr	cm
D10M44	1	0
D1M3	1	0.996
D1M75	1	24.85
D1M215	1	40.41
D1M309	1	49.99

The advantage of this format is each file can be easily loaded into R with read.table() for analysis/visualization outside of R/qtl. For projects using R/qtl I usually maintain two sets of genotype/phenotype files: 1 for general analysis and 1 set formatted for R/qtl. Adding something like read.cross.tidy() would allow me to avoid that redundancy, so that's my selfish motivation behind this PR.

I'd call it draft code at this point; it works but lacks the extensive data checks found in the other read.cross.*() functions. I can bring it up to parity but wanted to gauge your level interest before proceeding.

kbroman · 2014-08-23T20:39:00Z

Great idea! I'll merge it into the devel branch. I'm reserving the master branch for the latest release.

aaronwolen · 2014-08-25T13:53:03Z

Cool! I'll integrate it with read.cross() and add some of the missing data checks. Let me know if you have any other suggestions.

…ng them

… included

aaronwolen · 2014-08-26T14:09:21Z

I made some improvements to the original PR:

added write.cross.tidy()
added "tidy" support to read.cross() and write.cross()
NA columns are inserted for individuals missing in the genotype or phenotype files

I wrote a few simple tests to ensure a cross created with read.cross.tidy() is identical to one created with an existing read.cross.*() function and verify that cross data can be round tripped:

files -> `read.cross.tidy()` -> `write.cross.tidy()` -> files -> `read.cross.tidy()`

I didn't want to muck up your existing testing infrastructure so I kept my tests in a separate branch, which isn't part of this PR.

kbroman · 2014-09-03T17:27:35Z

Thanks, @aaronwolen! I've incorporated your code into the devel branch. I also added a test and added a small bit to the documentation for read.cross and write.cross.

Initial commit of read.cross.tidy()

645fa59

aaronwolen added 2 commits August 25, 2014 09:49

read.cross.tidy: Remove unused argument

574b09f

read.cross.tidy: Fix bug in default mapfile name

7353580

aaronwolen added 9 commits August 25, 2014 12:32

read.cross.tidy: set estimate map to FALSE

89b147b

read.cross.tidy: Drop rownames prior to returning results

733ce8e

read.cross: Add support for tidy format

a16dc0a

read.cross.tidy: Insert NA for missing individuals rather than droppi…

6836324

…ng them

read.cross.tidy: Ensure ordering of individuals

3b55bbd

read.cross.tidy: Add id column to pheno if informative

6df9156

write.cross: Add support for tidy format

4657fb3

write.cross.tidy: Fix allele assignment

2393c72

read.cross.tidy: Clarify code determining whether id column should be…

5302f0f

… included

kbroman closed this Sep 3, 2014

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Initial commit of read.cross.tidy() #20

Initial commit of read.cross.tidy() #20

aaronwolen commented Aug 22, 2014

kbroman commented Aug 23, 2014

aaronwolen commented Aug 25, 2014

aaronwolen commented Aug 26, 2014

kbroman commented Sep 3, 2014

Initial commit of read.cross.tidy() #20

Initial commit of read.cross.tidy() #20

Conversation

aaronwolen commented Aug 22, 2014

kbroman commented Aug 23, 2014

aaronwolen commented Aug 25, 2014

aaronwolen commented Aug 26, 2014

kbroman commented Sep 3, 2014