-
Notifications
You must be signed in to change notification settings - Fork 1
/
README.txt
50 lines (38 loc) · 2.67 KB
/
README.txt
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
/_README.txt - this file
* * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * *
* *
* Please note, most of the data files contained in this DOI are *
* compressed into GZip files (.gz extension). *
* Mac and Linux OS's can extract this file type natively. *
* Windows OS requires software to extract the archive. 7-Zip *
* (http://www.7-zip.org) is free and open source software that will *
* allow windows PCs to open and decompress the archive. *
* *
* * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * *
This dataset generated by GOMAP is a high-coverage and reproducible functional
annotation set based on Gene Ontology (GO) term assignments that covers all
protein coding gene models in the Joint Genome Institute (JGI) Glycine Max
genome assembly Wm82.a4.v1 (genotype Williams 82, assembly 4.0, gene model
annotation 1.0) with a median of 11 annotations per gene model.
It was created in April of 2019.
For more information about the GOMAP pipeline, please visit
https://dill-picl.org/projects/gomap
Each of the following directories contains its own README.txt. Please refer to
that file for more details.
/0_GOMAP-input The input for GOMAP: the peptide sequences we downloaded,
the script we used on them to generate the input for our
pipeline, and that script's result which is the GOMAP input.
/1_GOMAP-output The functional annotation dataset as output by GOMAP,
follows GO Annotation File 2 (GAF 2) format.
/2_cleanup Scripts and accompanying resources to clean, modify, and
complement the GOMAP-output to generate our final result.
/3_final-result Cleaned annotation dataset. FINAL RESULT.
Follows GO Annotation File 2 (GAF 2) format.
Also contains a locus mapping to Wm82.a2.v1,
see README in that folder for details.
/4_make_GO_db Making An Organism Package From Annotations Available From A Set Of Named Data.Frames.
Testing Gene Ontology annotation for a list of soybean genes
* * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * *
files 0, 1, 2, 3 were downloaded from https://dill-picl.org/projects/gomap/gomap-datasets/
in file 3. The GO Annotation File 2 (GAF 2) was modified to remove errors in orinial one.
file 4 was newly added by myself.