rank_genes_groups refactoring #723

Koncopd · 2019-07-02T13:04:06Z

Hi, @falexwolf
Do you have any specific things in mind for rank_genes_groups refactoring? What should be done?

The text was updated successfully, but these errors were encountered:

ivirshup · 2019-07-04T05:49:07Z

I think there was some discussion of this (among other topics) here: #562

falexwolf · 2019-08-29T08:18:29Z

Some notes from a brief discussion with Sergei.

make helper functions for each method so that level of indentation and length is decreased
replace lists rankings_gene_... by DataFrame
think about simplifying the wilcoxon implementation, compare with scipy stats implementation and potentially update the test
investigate how the logreg implementation behaves for different choices of reference groups

fidelram · 2019-08-30T08:39:49Z

Can rank_genes_groups be linked to use diffxpy on top of the available methods?

I am using the following code to convert the output of rank_genes_groups to a data frame, in case is useful:

def rank_genes_groups_df(adata, key='rank_genes_groups'):
    # create a data frame with columns from .uns['rank_genes_groups'] (eg. names, 
    # logfoldchanges, pvals). 
    # Ideally, the list of columns should be consistent between methods
    # but 'logreg' does not return logfoldchanges for example

    dd = []
    groupby = adata.uns['rank_genes_groups']['params']['groupby']
    for group in adata.obs[groupby].cat.categories:
        cols = []
        # inner loop to make data frame by concatenating the columns per group
        for col in adata.uns[key].keys():
            if col != 'params':
                   cols.append(pd.DataFrame(adata.uns[key][col][group], columns=[col]))
        
        df = pd.concat(cols,axis=1)
        df['group'] = group
        dd.append(df)

    # concatenate the individual group data frames into one long data frame
    rgg = pd.concat(dd)
    rgg['group'] = rgg['group'].astype('category')
    return rgg.set_index('group')

This results on a table like this:

Koncopd self-assigned this Jul 2, 2019

Koncopd mentioned this issue Mar 3, 2020

Split rank_genes_groups #1081

Closed

Koncopd mentioned this issue Apr 8, 2020

rank_genes_groups refactoring 2nd try #1156

Merged

ivirshup added the Area – Differential Expression Differential expression label Jul 26, 2021

ivirshup mentioned this issue Jul 26, 2021

diffxpy integration #1955

Open

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

rank_genes_groups refactoring #723

rank_genes_groups refactoring #723

Koncopd commented Jul 2, 2019

ivirshup commented Jul 4, 2019

falexwolf commented Aug 29, 2019

fidelram commented Aug 30, 2019 •

edited

rank_genes_groups refactoring #723

rank_genes_groups refactoring #723

Comments

Koncopd commented Jul 2, 2019

ivirshup commented Jul 4, 2019

falexwolf commented Aug 29, 2019

fidelram commented Aug 30, 2019 • edited

fidelram commented Aug 30, 2019 •

edited