blanks are special again #175

wasade · 2015-10-28T03:48:29Z

Yet another way to describe body sites for blanks

Fixes the double SampleID Column

wasade · 2015-10-29T16:42:56Z

@squirrelo @EmbrietteH, is this case still necessary to handle?

squirrelo · 2015-10-29T16:59:24Z

Still investigating.

wasade · 2015-11-06T18:16:09Z

The changes just pushed in were necessary to resolve issues encountered with the production data.

wasade · 2015-11-06T18:22:54Z

Assuming tests pass, would it be possible for a rapid review on this?

cc @squirrelo @EmbrietteH @ElDeveloper @josenavas @antgonza

…nto clean_metadata_bug

wasade · 2015-11-06T20:20:42Z

Tests should be passing, possible for review please?

ElDeveloper · 2015-11-06T20:22:37Z

No, stop asking ... jk, I'm on it.

On (Nov-06-15|12:20), Daniel McDonald wrote:

Tests should be passing, possible for review please?

Reply to this email directly or view it on GitHub:
#175 (comment)

ElDeveloper · 2015-11-06T20:27:23Z

ipynb/primary-processing/04-prepare_metaanalyses.md

-The first step in the process is to merge the the individual tables into larger ones, and then to merge the larger tables into a final one. We're not using QIIME's `parallel_merge_otu_tables.py` here as we also need one of the intermediate tables for subsequent processing.
-
-The first merge we'll do is between the Global Gut and the American Gut.
+We also need to make sure the metadata (the information about the samples) are also merged and consistent. Prior to merge, we're going to add in some additional detail about every sample, such as a column in the mapping file that is the combination of the study title and the body site. We're also going to "generalize" body sites to the type of site they're from (e.g., the back of the hand is just "skin"). This process will also clean the metadata to remove blanks and unknown sample types.


Very minor, but the usage of the term mapping file comes undefined in this paragraph, and it seems like you were just introducing to the concept of "metadata".

@mortonjt is going to clean this up

Sounds good!

On (Nov-06-15|12:37), Daniel McDonald wrote:

@@ -40,70 +40,76 @@ We're also going to generate some new files, so let's get them setup.

ag_pgp_hmp_gg_cleaned_md = agu.get_new_path(agenv.paths['ag-pgp-hmp-gg-cleaned-md'])

-The first step in the process is to merge the the individual tables into larger ones, and then to merge the larger tables into a final one. We're not using QIIME's parallel_merge_otu_tables.py here as we also need one of the intermediate tables for subsequent processing.

-The first merge we'll do is between the Global Gut and the American Gut.
+We also need to make sure the metadata (the information about the samples) are also merged and consistent. Prior to merge, we're going to add in some additional detail about every sample, such as a column in the mapping file that is the combination of the study title and the body site. We're also going to "generalize" body sites to the type of site they're from (e.g., the back of the hand is just "skin"). This process will also clean the metadata to remove blanks and unknown sample types.

@mortonjt is going to clean this up

Reply to this email directly or view it on GitHub:
https://github.com/biocore/American-Gut/pull/175/files#r44184726

ElDeveloper · 2015-11-06T20:33:03Z

Looks good to me, just a copule of comments, nothing blocking. Should be ready to go granted that tests pass.

mortonjt · 2015-11-06T20:35:15Z

👍

blanks are special again

Fixes from #175

Daniel McDonald and others added 3 commits October 27, 2015 20:36

blanks are special again

d35299e

to many spaces

193873f

Merge pull request biocore#176 from JWDebelius/duplicate_sample_id_fix

a1f9a85

Fixes the double SampleID Column

The various issues encountered on production data

b25cf7d

Daniel McDonald and others added 4 commits November 6, 2015 11:45

test failures

971ab34

Merge branch 'clean_metadata_bug' of github.com:wasade/American-Gut i…

c50524d

…nto clean_metadata_bug

Lingering bits

9bc9152

should run the tests first locally...

bd9e432

ElDeveloper reviewed Nov 6, 2015
View reviewed changes

mortonjt added a commit that referenced this pull request Nov 6, 2015

Merge pull request #175 from wasade/clean_metadata_bug

99b7e70

blanks are special again

mortonjt merged commit 99b7e70 into biocore:master Nov 6, 2015

mortonjt added a commit to mortonjt/American-Gut that referenced this pull request Nov 6, 2015

Fixes from biocore#175

2451c04

jwdebelius added a commit that referenced this pull request Nov 9, 2015

Merge pull request #178 from mortonjt/blanks

486dfc0

Fixes from #175

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

blanks are special again #175

blanks are special again #175

wasade commented Oct 28, 2015

wasade commented Oct 29, 2015

squirrelo commented Oct 29, 2015

wasade commented Nov 6, 2015

wasade commented Nov 6, 2015

wasade commented Nov 6, 2015

ElDeveloper commented Nov 6, 2015

ElDeveloper Nov 6, 2015

wasade Nov 6, 2015

ElDeveloper Nov 6, 2015

ElDeveloper commented Nov 6, 2015

mortonjt commented Nov 6, 2015

blanks are special again #175

blanks are special again #175

Conversation

wasade commented Oct 28, 2015

wasade commented Oct 29, 2015

squirrelo commented Oct 29, 2015

wasade commented Nov 6, 2015

wasade commented Nov 6, 2015

wasade commented Nov 6, 2015

ElDeveloper commented Nov 6, 2015

ElDeveloper Nov 6, 2015

Choose a reason for hiding this comment

wasade Nov 6, 2015

Choose a reason for hiding this comment

ElDeveloper Nov 6, 2015

Choose a reason for hiding this comment

-The first step in the process is to merge the the individual tables into larger ones, and then to merge the larger tables into a final one. We're not using QIIME's parallel_merge_otu_tables.py here as we also need one of the intermediate tables for subsequent processing.

ElDeveloper commented Nov 6, 2015

mortonjt commented Nov 6, 2015

-The first step in the process is to merge the the individual tables into larger ones, and then to merge the larger tables into a final one. We're not using QIIME's `parallel_merge_otu_tables.py` here as we also need one of the intermediate tables for subsequent processing.