Skip to content

yjzhang/cellmesh

Repository files navigation

The data consists of two matrices: gene-mesh cell type and gene-pmid.

the gene-mesh matrix is corpus.mm. cell_info.json is a dict of index to mesh cell type. gene.dict.text contains info for each gene, where the indices are the same as corpus.mm. gene_index(e.g. gene_index 0 means the 0-th row of gene/cell matrix) ncbi's taxid,geneid,gene_symbol, dfs(doc freq, number of cells the gene co-occurs with)