Column merging #7

bschilder · 2019-11-08T20:56:32Z

extract_snpvar.py

For scenarios where someone's summary stats file has both SNP and CHR/POS, I modified the merging procedure so that column names don't get duplicated.
lines 66-76

#merge the dfs
    # BMS edits
    print('Merging metadata files by...')
    if all(x in df_snps.columns for x in ['SNP', 'BP', 'CHR']):
        logging.info('   SNP and position.')
        df = df_meta.merge(df_snps, on=['SNP', 'A1', 'A2', 'SNP', 'BP', 'CHR'], how='inner')
    elif 'SNP' in df_snps.columns:
        print('   SNP.')
        df = df_meta.merge(df_snps, on=['SNP', 'A1', 'A2'], how='inner')
    else:
        print('   position.')
        df = df_meta.merge(df_snps, on=['CHR', 'BP', 'A1', 'A2'], how='inner')

The text was updated successfully, but these errors were encountered:

omerwe · 2019-11-08T21:14:45Z

Great suggestion! Committed (with slight modifications)

omerwe closed this as completed Nov 8, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Column merging #7

Column merging #7

bschilder commented Nov 8, 2019

omerwe commented Nov 8, 2019

Column merging #7

Column merging #7

Comments

bschilder commented Nov 8, 2019

extract_snpvar.py

omerwe commented Nov 8, 2019