**Set environment**

In [1]:
suppressMessages(suppressWarnings(source("../run_config_project_sing.R")))
show_env()

You are working on        Singularity: singularity_proj_encode_fcc 
BASE DIRECTORY (FD_BASE): /data/reddylab/Kuei 
REPO DIRECTORY (FD_REPO): /data/reddylab/Kuei/repo 
WORK DIRECTORY (FD_WORK): /data/reddylab/Kuei/work 
DATA DIRECTORY (FD_DATA): /data/reddylab/Kuei/data 

You are working with      ENCODE FCC 
PATH OF PROJECT (FD_PRJ): /data/reddylab/Kuei/repo/Proj_ENCODE_FCC 
PROJECT RESULTS (FD_RES): /data/reddylab/Kuei/repo/Proj_ENCODE_FCC/results 
PROJECT SCRIPTS (FD_EXE): /data/reddylab/Kuei/repo/Proj_ENCODE_FCC/scripts 
PROJECT DATA    (FD_DAT): /data/reddylab/Kuei/repo/Proj_ENCODE_FCC/data 
PROJECT NOTE    (FD_NBK): /data/reddylab/Kuei/repo/Proj_ENCODE_FCC/notebooks 
PROJECT DOCS    (FD_DOC): /data/reddylab/Kuei/repo/Proj_ENCODE_FCC/docs 
PROJECT LOG     (FD_LOG): /data/reddylab/Kuei/repo/Proj_ENCODE_FCC/log 
PROJECT REF     (FD_REF): /data/reddylab/Kuei/repo/Proj_ENCODE_FCC/references 



**Check input files**

In [2]:
txt_fdiry = file.path(FD_REF, "genome_gene")
vec = dir(txt_fdiry)
for (txt in vec){cat(txt, "\n")}

biomart.ensembl_gene_111.GRCh38.p14.gene_stable_id2name_tss.tsv.gz 
biomart.ensembl_gene_111.GRCh38.p14.gene_stable_id2name.tsv.gz 
demap.v24Q2.AchillesCommonEssentialControls.csv 


## Import data

**Import essential gene list**

In [3]:
### set file directory
txt_fdiry = file.path(FD_REF, "genome_gene")
txt_fname = "demap.v24Q2.AchillesCommonEssentialControls.csv"
txt_fpath = file.path(txt_fdiry, txt_fname)

### read table
dat = read_csv(txt_fpath, show_col_types = FALSE)

### assign and show
dat_gene_essential_import = dat
print(dim(dat))
fun_display_table(head(dat, 3))

[1] 1247    1


Gene
AAMP (14)
AARS1 (16)
AASDHPPT (60496)


## Arrange table

In [5]:
dat = dat_gene_essential_import
dat = dat %>% tidyr::separate(col="Gene", into=c("Gene", "Note"), sep = " ")

dat_gene_essential_arrange = dat
print(dim(dat))
fun_display_table(head(dat))

[1] 1247    2


Gene,Note
AAMP,(14)
AARS1,(16)
AASDHPPT,(60496)
ABCB7,(22)
ABCE1,(6059)
ABCF1,(23)


In [7]:
dat = dat_gene_essential_arrange
dat = dat %>% 
    dplyr::mutate(Source = "demap", Version = "v24Q2", File = "AchillesCommonEssentialControls") %>%
    dplyr::select(Gene, Source, Version, File) %>%
    dplyr::distinct()

dat_gene_essential_label = dat
print(dim(dat))
fun_display_table(head(dat))

[1] 1247    4


Gene,Source,Version,File
AAMP,demap,v24Q2,AchillesCommonEssentialControls
AARS1,demap,v24Q2,AchillesCommonEssentialControls
AASDHPPT,demap,v24Q2,AchillesCommonEssentialControls
ABCB7,demap,v24Q2,AchillesCommonEssentialControls
ABCE1,demap,v24Q2,AchillesCommonEssentialControls
ABCF1,demap,v24Q2,AchillesCommonEssentialControls


## Export table

In [8]:
### set file directory
txt_fdiry = file.path(FD_RES, "annotation", "gene_essential")
txt_fname = "demap.v24Q2.AchillesCommonEssentialControls.tsv"
txt_fpath = file.path(txt_fdiry, txt_fname)

### write table
dir.create(txt_fdiry)
dat = dat_gene_essential_label
write_tsv(dat, txt_fpath)