-
Notifications
You must be signed in to change notification settings - Fork 19
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add in missing cellType/antibody entries for encodeTFBSmm10 #25
Comments
Oh interesting -- digging a little further, it seems like all of the entries with NAs for cellType and antibody are "RepPeaks" type, meaning they are individual replicates. However, their merged/consensus data ("Peaks") are also present in LOLACore. For most analyses, one probably wouldn't want to be analysing both the individual replicates and their merged data as though they were independent. It might actually be better to drop these from LOLA? |
Probably right. But would you rather keep the "RepPeaks" or the individual replicates? |
I'd rather keep the consensus than the replicates (ie the Peaks rather than the RepPeaks). I've been noticing quite a lot of deviation between replicates, which is presumably why they did them in the first place. |
Thanks for reporting this @oneillkza -- I found 45 extra files in there that shouldn't have been. They had already been excluded from the annotation, but because they were left in the folder, they were still getting read (a feature of LOLA, really...). Anyway, I've taken them out now and will update the public core databases soon. thanks! |
New version is now deployed here: http://cloud.databio.org/regiondb/ |
Awesome! Thanks! |
Hi there
I noticed that a handful of the entries for encodeTFBSmm10 have NA in the cellType and antibody annotations. Fortunately, this data seems to be available here, and I've enclosed a code snippet that adds the missing entries to
lola.db$regionAnno
to make things easier for you. (Right now I'm using the below code as a workaround for myself.)Thanks for making and maintaining a very useful tool!
The text was updated successfully, but these errors were encountered: