Skip to content
ktym edited this page May 16, 2014 · 7 revisions
  • CDS

  • The concept of CDS is ambiguous.

  • INSDC location

http://togows.org/entry/nucleotide/NC_000019 (Human genome GRCh38 chr19)

     gene            complement(2289774..2308156)
                     /gene="LINGO3"
                     /gene_synonym="LERN2; LRRN6B"
                     /note="leucine rich repeat and Ig domain containing 3;
                     Derived by automated computational analysis using gene
                     prediction method: BestRefSeq."
                     /db_xref="GeneID:645191"
                     /db_xref="HGNC:21206"
                     /db_xref="MIM:609792"
     mRNA            complement(join(2289774..2291821,2308075..2308156))
                     /gene="LINGO3"
                     /gene_synonym="LERN2; LRRN6B"
                     /product="leucine rich repeat and Ig domain containing 3"
                     /note="Derived by automated computational analysis using
                     gene prediction method: BestRefSeq."
                     /transcript_id="NM_001101391.1"
                     /db_xref="GI:157426828"
                     /db_xref="GeneID:645191"
                     /db_xref="HGNC:21206"
                     /db_xref="MIM:609792"
     CDS             complement(2289997..2291775)
                     /gene="LINGO3"
                     /gene_synonym="LERN2; LRRN6B"
                     /note="Derived by automated computational analysis using
                     gene prediction method: BestRefSeq."
                     /codon_start=1
                     /product="leucine-rich repeat and immunoglobulin-like
                     domain-containing nogo receptor-interacting protein 3
                     precursor"
                     /protein_id="NP_001094861.1"
                     /db_xref="GI:157426829"
                     /db_xref="CCDS:CCDS45905.1"
                     /db_xref="GeneID:645191"
                     /db_xref="HGNC:21206"
                     /db_xref="MIM:609792"
                     /translation="MTCWLCVLSLPLLLLPAAPPPAGGCPARCECTVQTRAVACTRRR
                     LTAVPDGIPAETRLLELSRNRIRCLNPGDLAALPALEELDLSENAIAHVEPGAFANLP
                     RLRVLRLRGNQLKLIPPGVFTRLDNLTLLDLSENKLVILLDYTFQDLHSLRRLEVGDN
                     DLVFVSRRAFAGLLALEELTLERCNLTALSGESLGHLRSLGALRLRHLAIASLEDQNF
                     RRLPGLLHLEIDNWPLLEEVAAGSLRGLNLTSLSVTHTNITAVPAAALRHQAHLTCLN
                     LSHNPISTVPRGSFRDLVRLRELHLAGALLAVVEPQAFLGLRQIRLLNLSNNLLSTLE
                     ESTFHSVNTLETLRVDGNPLACDCRLLWIVQRRKTLNFDGRLPACATPAEVRGDALRN
                     LPDSVLFEYFVCRKPKIRERRLQRVTATAGEDVRFLCRAEGEPAPTVAWVTPQHRPVT
                     ATSAGRARVLPGGTLEIQDARPQDSGTYTCVASNAGGNDTYFATLTVRPEPAANRTPG
                     EAHNETLAALRAPLDLTTILVSTAMGCITFLGVVLFCFVLLFVWSRGRGQHKNNFSVE
                     YSFRKVDGPAAAAGQGGARKFNMKMI"

http://togows.org/entry/nucleotide/NC_000013

LOCUS       NC_000013          114364328 bp    DNA     linear   CON 03-FEB-2014
DEFINITION  Homo sapiens chromosome 13, GRCh38 Primary Assembly.
ACCESSION   NC_000013 GPC_000001305
VERSION     NC_000013.11  GI:568815585
DBLINK      BioProject: PRJNA168
            Assembly: GCF_000001405.26
KEYWORDS    RefSeq.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
            Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 114364328)
  CONSRTM   International Human Genome Sequencing Consortium
  TITLE     Finishing the euchromatic sequence of the human genome
  JOURNAL   Nature 431 (7011), 931-945 (2004)
   PUBMED   15496913

  :

COMMENT     REFSEQ INFORMATION: The reference sequence is identical to
            CM000675.2.
            On Feb 3, 2014 this sequence version replaced gi:224589804.
            Assembly Name: GRCh38 Primary Assembly
            The DNA sequence is composed of genomic sequence, primarily
            finished clones that were sequenced as part of the Human Genome
            Project. PCR products and WGS shotgun sequence have been added
            where necessary to fill gaps or correct errors. All such additions
            are manually curated by GRC staff. For more information see:
            http://genomereference.org.
            
            ##Genome-Annotation-Data-START##
            Annotation Provider         :: NCBI
            Annotation Status           :: Full annotation
            Annotation Version          :: Homo sapiens Annotation Release 106
            Annotation Pipeline         :: NCBI eukaryotic genome annotation
                                           pipeline
            Annotation Software Version :: 5.2
            Annotation Method           :: Best-placed RefSeq; Gnomon
            Features Annotated          :: Gene; mRNA; CDS; ncRNA
            ##Genome-Annotation-Data-END##
FEATURES             Location/Qualifiers
     source          1..114364328
                     /organism="Homo sapiens"
                     /mol_type="genomic DNA"
                     /db_xref="taxon:9606"
                     /chromosome="13"
     assembly_gap    1..10000
                     /estimated_length=10000
                     /gap_type="telomere"
     assembly_gap    10001..16000000
                     /estimated_length=15990000
                     /gap_type="short_arm"
     centromere      16000001..18051248
                     /note="Linear centromere model derived predominantly from
                     reads generated in PMID: 17803354. This region does not
                     represent an actual centromere sequence, as long-range
                     ordering of repeats and unmapped WGS contigs is not
                     provided by the model. For details of model production,
                     see http://arxiv.org/abs/1307.0035."

       :

     gene            32315480..32399672
                     /gene="BRCA2"
                     /gene_synonym="BRCC2; BROVCA2; FACD; FAD; FAD1; FANCB;
                     FANCD; FANCD1; GLM3; PNCA2"
                     /note="breast cancer 2, early onset; Derived by automated
                     computational analysis using gene prediction method:
                     BestRefSeq."
                     /db_xref="GeneID:675"
                     /db_xref="HGNC:1101"
                     /db_xref="MIM:600185"
     mRNA            join(32315480..32315667,32316422..32316527,
                     32319077..32319325,32325076..32325184,32326101..32326150,
                     32326242..32326282,32326499..32326613,32329443..32329492,
                     32330919..32331030,32332272..32333387,32336265..32341196,
                     32344558..32344653,32346827..32346896,32354861..32355288,
                     32356428..32356609,32357742..32357929,32362523..32362693,
                     32363179..32363533,32370402..32370557,32370956..32371100,
                     32376670..32376791,32379317..32379515,32379750..32379913,
                     32380007..32380145,32394689..32394933,32396898..32397044,
                     32398162..32399672)
                     /gene="BRCA2"
                     /gene_synonym="BRCC2; BROVCA2; FACD; FAD; FAD1; FANCB;
                     FANCD; FANCD1; GLM3; PNCA2"
                     /product="breast cancer 2, early onset"
                     /inference="similar to RNA sequence, mRNA (same
                     species):RefSeq:NM_000059.3"
                     /exception="annotated by transcript or proteomic data"
                     /note="The RefSeq transcript has 1 substitution compared
                     to this genomic sequence; Derived by automated
                     computational analysis using gene prediction method:
                     BestRefSeq."
                     /transcript_id="NM_000059.3"
                     /db_xref="GI:119395733"
                     /db_xref="GeneID:675"
                     /db_xref="HGNC:1101"
                     /db_xref="MIM:600185"
     CDS             join(32316461..32316527,32319077..32319325,
                     32325076..32325184,32326101..32326150,32326242..32326282,
                     32326499..32326613,32329443..32329492,32330919..32331030,
                     32332272..32333387,32336265..32341196,32344558..32344653,
                     32346827..32346896,32354861..32355288,32356428..32356609,
                     32357742..32357929,32362523..32362693,32363179..32363533,
                     32370402..32370557,32370956..32371100,32376670..32376791,
                     32379317..32379515,32379750..32379913,32380007..32380145,
                     32394689..32394933,32396898..32397044,32398162..32398770)
                     /gene="BRCA2"
                     /gene_synonym="BRCC2; BROVCA2; FACD; FAD; FAD1; FANCB;
                     FANCD; FANCD1; GLM3; PNCA2"
                     /inference="similar to AA sequence (same
                     species):RefSeq:NP_000050.2"
                     /exception="annotated by transcript or proteomic data"
                     /note="The RefSeq protein has 1 substitution compared to
                     this genomic sequence; Derived by automated computational
                     analysis using gene prediction method: BestRefSeq."
                     /codon_start=1
                     /product="breast cancer type 2 susceptibility protein"
                     /protein_id="NP_000050.2"
                     /db_xref="GI:119395734"
                     /db_xref="CCDS:CCDS9344.1"
                     /db_xref="GeneID:675"
                     /db_xref="HGNC:1101"
                     /db_xref="MIM:600185"
                     /translation="MPIGSKERPTFFEIFKTRCNKADLGPISLNWFEELSSEAPPYNS
                     EPAEESEHKNNNYEPNLFKTPQRKPSYNQLASTPIIFKEQGLTLPLYQSPVKELDKFK
                     LDLGRNVPNSRHKSLRTVKTKMDQADDVSCPLLNSCLSESPVVLQCTHVTPQRDKSVV
                     CGSLFHTPKFVKGRQTPKHISESLGAEVDPDMSWSSSLATPPTLSSTVLIVRNEEASE
                     TVFPHDTTANVKSYFSNHDESLKKNDRFIASVTDSENTNQREAASHGFGKTSGNSFKV
                     NSCKDHIGKSMPNVLEDEVYETVVDTSEEDSFSLCFSKCRTKNLQKVRTSKTRKKIFH
                     EANADECEKSKNQVKEKYSFVSEVEPNDTDPLDSNVANQKPFESGSDKISKEVVPSLA
                     CEWSQLTLSGLNGAQMEKIPLLHISSCDQNISEKDLLDTENKRKKDFLTSENSLPRIS
                     SLPKSEKPLNEETVVNKRDEEQHLESHTDCILAVKQAISGTSPVASSFQGIKKSIFRI
                     RESPKETFNASFSGHMTDPNFKKETEASESGLEIHTVCSQKEDSLCPNLIDNGSWPAT
                     TTQNSVALKNAGLISTLKKKTNKFIYAIHDETSYKGKKIPKDQKSELINCSAQFEANA
                     FEAPLTFANADSGLLHSSVKRSCSQNDSEEPTLSLTSSFGTILRKCSRNETCSNNTVI
                     SQDLDYKEAKCNKEKLQLFITPEADSLSCLQEGQCENDPKSKKVSDIKEEVLAAACHP
                     VQHSKVEYSDTDFQSQKSLLYDHENASTLILTPTSKDVLSNLVMISRGKESYKMSDKL
                     KGNNYESDVELTKNIPMEKNQDVCALNENYKNVELLPPEKYMRVASPSRKVQFNQNTN
                     LRVIQKNQEETTSISKITVNPDSEELFSDNENNFVFQVANERNNLALGNTKELHETDL
                     TCVNEPIFKNSTMVLYGDTGDKQATQVSIKKDLVYVLAEENKNSVKQHIKMTLGQDLK
                     SDISLNIDKIPEKNNDYMNKWAGLLGPISNHSFGGSFRTASNKEIKLSEHNIKKSKMF
                     FKDIEEQYPTSLACVEIVNTLALDNQKKLSKPQSINTVSAHLQSSVVVSDCKNSHITP
                     QMLFSKQDFNSNHNLTPSQKAEITELSTILEESGSQFEFTQFRKPSYILQKSTFEVPE
                     NQMTILKTTSEECRDADLHVIMNAPSIGQVDSSKQFEGTVEIKRKFAGLLKNDCNKSA
                     SGYLTDENEVGFRGFYSAHGTKLNVSTEALQKAVKLFSDIENISEETSAEVHPISLSS
                     SKCHDSVVSMFKIENHNDKTVSEKNNKCQLILQNNIEMTTGTFVEEITENYKRNTENE
                     DNKYTAASRNSHNLEFDGSDSSKNDTVCIHKDETDLLFTDQHNICLKLSGQFMKEGNT
                     QIKEDLSDLTFLEVAKAQEACHGNTSNKEQLTATKTEQNIKDFETSDTFFQTASGKNI
                     SVAKESFNKIVNFFDQKPEELHNFSLNSELHSDIRKNKMDILSYEETDIVKHKILKES
                     VPVGTGNQLVTFQGQPERDEKIKEPTLLGFHTASGKKVKIAKESLDKVKNLFDEKEQG
                     TSEITSFSHQWAKTLKYREACKDLELACETIEITAAPKCKEMQNSLNNDKNLVSIETV
                     VPPKLLSDNLCRQTENLKTSKSIFLKVKVHENVEKETAKSPATCYTNQSPYSVIENSA
                     LAFYTSCSRKTSVSQTSLLEAKKWLREGIFDGQPERINTADYVGNYLYENNSNSTIAE
                     NDKNHLSEKQDTYLSNSSMSNSYSYHSDEVYNDSGYLSKNKLDSGIEPVLKNVEDQKN
                     TSFSKVISNVKDANAYPQTVNEDICVEELVTSSSPCKNKNAAIKLSISNSNNFEVGPP
                     AFRIASGKIVCVSHETIKKVKDIFTDSFSKVIKENNENKSKICQTKIMAGCYEALDDS
                     EDILHNSLDNDECSTHSHKVFADIQSEEILQHNQNMSGLEKVSKISPCDVSLETSDIC
                     KCSIGKLHKSVSSANTCGIFSTASGKSVQVSDASLQNARQVFSEIEDSTKQVFSKVLF
                     KSNEHSDQLTREENTAIRTPEHLISQKGFSYNVVNSSAFSGFSTASGKQVSILESSLH
                     KVKGVLEEFDLIRTEHSLHYSPTSRQNVSKILPRVDKRNPEHCVNSEMEKTCSKEFKL
                     SNNLNVEGGSSENNHSIKVSPYLSQFQQDKQQLVLGTKVSLVENIHVLGKEQASPKNV
                     KMEIGKTETFSDVPVKTNIEVCSTYSKDSENYFETEAVEIAKAFMEDDELTDSKLPSH
                     ATHSLFTCPENEEMVLSNSRIGKRRGEPLILVGEPSIKRNLLNEFDRIIENQEKSLKA
                     SKSTPDGTIKDRRLFMHHVSLEPITCVPFRTTKERQEIQNPNFTAPGQEFLSKSHLYE
                     HLTLEKSSSNLAVSGHPFYQVSATRNEKMRHLITTGRPTKVFVPPFKTKSHFHRVEQC
                     VRNINLEENRQKQNIDGHGSDDSKNKINDNEIHQFNKNNSNQAAAVTFTKCEEEPLDL
                     ITSLQNARDIQDMRIKKKQRQRVFPQPGSLYLAKTSTLPRISLKAAVGGQVPSACSHK
                     QLYTYGVSKHCIKINSKNAESFQFHTEDYFGKESLWTGKGIQLADGGWLIPSNDGKAG
                     KEEFYRALCDTPGVDPKLISRIWVYNHYRWIIWKLAAMECAFPKEFANRCLSPERVLL
                     QLKYRYDTEIDRSRRSAIKKIMERDDTAAKTLVLCVSDIISLSANISETSSNKTSSAD
                     TQKVAIIELTDGWYAVKAQLDPPLLAVLKNGRLTVGQKIILHGAELVGSPDACTPLEA
                     PESLMLKISANSTRPARWYTKLGFFPDPRPFPLPLSSLFSDGGNVGCVDVIIQRAYPI
                     QWMEKTSSGLYIFRNEREEEKEAAKYVEAQQKRLEALFTKIQEEFEEHEENTTKPYLP
                     SRALTRQQVRALQDGAELYEAVKNAADPAYLEGYFSEEQLRALNNHRQMLNDKKQAQI
                     QLEIRKAMESAEQKEQGLSRDVTTVWKLRIVSYSKKEKDSVILSIWRPSSDLYSLLTE
                     GKRYRIYHLATSKSKSKSERANIQLAATKKTQYQQLPVSDEILFQIYQPREPLHFSKF
                     LDPDFQPSCSEVDLIGFVVSVVKKTGLAPFVYLSDECYNLLAIKFWIDLNEDIIKPHM
                     LIAASNLQWRPESKSGLLTLFAGDFSVFSASPKEGHFQETFNKMKNTVENIDILCNEA
                     ENKLMHILHANDPKWSTPTKDCTSGPYTAQIIPGTGNKLLMSSPNCEIYYQSPLSLCM
                     AKRKSVSTPVSAQMTSKSCKGEKEIDDQKNCKKRRALDFLSRLPLPPPVSPICTFVSP
                     AAQKAFQPPRSCGTKYETPIKKKELNSPQMTPFKKFNEISLLESNSIADEELALINTQ
                     ALLSGSTGEKQFISVSESTRTAPTSSEDYLRLKRRCTTSLIKEQESSQASTEECEKNK
                     QDTITTKKYI"

http://togows.org/entry/nucleotide/NC_000013.ttl

@prefix rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#> .
@prefix rdfs: <http://www.w3.org/2000/01/rdf-schema#> .
@prefix xsd: <http://www.w3.org/2001/XMLSchema#> .
@prefix obo: <http://purl.obolibrary.org/obo/> .
@prefix faldo: <http://biohackathon.org/resource/faldo#> .
@prefix insdc: <http://ddbj.nig.ac.jp/ontologies/sequence/> .

### SO:chromosome

<urn:uuid:02856d67-76b5-4ac3-8cb2-906a70cf4434>
  a obo:SO_0000340, obo:SO_0000987 ;  # SO:chromosome, SO:linear
  rdfs:label "Homo sapiens chromosome 13, GRCh38 Primary Assembly." ;
  faldo:location <urn:uuid:d57a11ec-59a9-4d7b-bac7-56b917b743da> ;  # Location:1..114364328
  insdc:chromosome 13 ;  # (should be a string literal e.g., "X"?)
  insdc:dblink <urn:uuid:79037198-5ab5-47f7-b7b1-591240b6f77a> ;  # BioProject:PRJNA168
  insdc:mol_type "genomic DNA" ;
  insdc:organism "Homo sapiens" ;  # (should be a taxonomy URI?)
  insdc:sequence_date "2014-02-03"^^xsd:date ;
  insdc:sequence_fasta <http://togows.org/entry/nucleotide/NC_000013.11.fasta> ;  # (should point to RefSeq entry?)
  insdc:sequence_length 114364328 ;
  insdc:sequence_version "NC_000013.11" ;
  rdfs:seeAlso <http://identifiers.org/ncbigi/GI:568815585>, <http://identifiers.org/pubmed/11237011>, <http://identifiers.org/pubmed/15057823>, <http://identifiers.org/pubmed/15496913>, <http://identifiers.org/refseq/NC_000013.11>, <http://identifiers.org/taxonomy/9606> .

<urn:uuid:79037198-5ab5-47f7-b7b1-591240b6f77a>
  rdfs:label "PRJNA168" ;
  rdfs:seeAlso <http://identifiers.org/bioproject/PRJNA168> .

<urn:uuid:d57a11ec-59a9-4d7b-bac7-56b917b743da>
  faldo:begin <urn:uuid:4c69e38f-9d53-4878-9e1c-c37feda387ae> ;
  faldo:end <urn:uuid:84873470-b28a-4859-991d-607f0556a73b> ;
  insdc:location "1..114364328" ;
  a faldo:Region .

<urn:uuid:4c69e38f-9d53-4878-9e1c-c37feda387ae>
  faldo:position 1 ;
  faldo:reference <urn:uuid:02856d67-76b5-4ac3-8cb2-906a70cf4434> ;
  a faldo:ExactPosition, faldo:ForwardStrandPosition .

<urn:uuid:84873470-b28a-4859-991d-607f0556a73b>
  faldo:position 114364328 ;
  faldo:reference <urn:uuid:02856d67-76b5-4ac3-8cb2-906a70cf4434> ;
  a faldo:ExactPosition, faldo:ForwardStrandPosition .

### SO:gene

<urn:uuid:111eaf02-aef0-447d-86d3-d08c9503d21a>
  a obo:SO_0000704 ;  # SO:gene
  rdfs:label "BRCA2" ;
  faldo:location <urn:uuid:1fae3244-d357-4d88-8aae-5a85c2f3420d> ;
  insdc:gene "BRCA2" ;
  insdc:gene_synonym "BRCC2; BROVCA2; FACD; FAD; FAD1; FANCB; FANCD; FANCD1; GLM3; PNCA2" ;
  insdc:note "breast cancer 2, early onset; Derived by automated computational analysis using gene prediction method: BestRefSeq." ;
  obo:so_part_of <urn:uuid:02856d67-76b5-4ac3-8cb2-906a70cf4434> ;  # SO:chromosome
  rdfs:seeAlso <http://identifiers.org/hgnc/1101>, <http://identifiers.org/ncbigene/675>, <http://identifiers.org/omim/600185> .

<urn:uuid:1fae3244-d357-4d88-8aae-5a85c2f3420d>
  faldo:begin <urn:uuid:285d883a-6b42-498b-b7c1-bc9d27a4244a> ;
  faldo:end <urn:uuid:f0d81998-eccd-4166-88bb-09db666da8be> ;
  insdc:location "32315480..32399672" ;
  a faldo:Region .

<urn:uuid:285d883a-6b42-498b-b7c1-bc9d27a4244a>
  faldo:position 32315480 ;
  faldo:reference <urn:uuid:02856d67-76b5-4ac3-8cb2-906a70cf4434> ;
  a faldo:ExactPosition, faldo:ForwardStrandPosition .

<urn:uuid:f0d81998-eccd-4166-88bb-09db666da8be>
  faldo:position 32399672 ;
  faldo:reference <urn:uuid:02856d67-76b5-4ac3-8cb2-906a70cf4434> ;
  a faldo:ExactPosition, faldo:ForwardStrandPosition .

### SO:mRNA

<urn:uuid:30fc864c-a0f2-4a74-b7bc-551f46c9ae9b>
  a obo:SO_0000234 ;  # SO:mRNA
  rdfs:label "BRCA2" ;
  faldo:location <urn:uuid:e156c6ed-a986-45b3-975b-70756aa702cb> ;
  insdc:exception "annotated by transcript or proteomic data" ;
  insdc:gene "BRCA2" ;
  insdc:gene_synonym "BRCC2; BROVCA2; FACD; FAD; FAD1; FANCB; FANCD; FANCD1; GLM3; PNCA2" ;
  insdc:inference "similar to RNA sequence, mRNA (same species):RefSeq:NM_000059.3" ;
  insdc:note "The RefSeq transcript has 1 substitution compared to this genomic sequence; Derived by automated computational analysis using gene prediction method: BestRefSeq." ;
  insdc:product "breast cancer 2, early onset" ;
  insdc:transcript_id "NM_000059.3" ;
  obo:so_has_part (<urn:uuid:4f5eba89-d098-4cbe-97e3-ed449424a659>
    <urn:uuid:c1a4ee53-bf08-475d-ba23-f725b066f2bf>
    <urn:uuid:edd45122-ebf5-4fc9-8b06-da352446d28d>
    <urn:uuid:62294db5-6d1d-48bb-82bf-273e93b8edfb>
    <urn:uuid:b0e0f008-8fef-4000-ad9f-e243e2d4578c>
    <urn:uuid:be04dc41-8f5b-4f4e-ae25-2d8c69521448>
    <urn:uuid:36749a62-99b8-4d4c-a179-7a6ddf290c8d>
    <urn:uuid:34d1c841-5f36-4829-9585-66f881e64951>
    <urn:uuid:72b59757-3bef-44a0-bac2-dbd6b4731029>
    <urn:uuid:a7e8046a-9f7e-4085-92db-e85d984b5c62>
    <urn:uuid:8dc07828-fef7-4bbe-998b-f9fdf3f337d5>
    <urn:uuid:669ade82-aee3-4503-91a6-a970b788f0f3>
    <urn:uuid:15bf32d9-2c9f-48c2-9435-94f71c1edc5a>
    <urn:uuid:df42ef2c-15eb-49d7-8a17-c1cc35f9ff5d>
    <urn:uuid:9c04bf8a-b443-4a95-83b3-16c31edc96da>
    <urn:uuid:e11809d3-9a77-4d7f-9a59-28ba38ebb4d8>
    <urn:uuid:6bdf18b7-5030-4885-a039-66b83fe3da6c>
    <urn:uuid:f7575eb1-a72c-4b90-8dc5-e6ce8b848efd>
    <urn:uuid:8949a9d9-1fdc-4598-8678-ee17597964ef>
    <urn:uuid:e33e4111-9f20-4eaa-b313-d424b718d685>
    <urn:uuid:801bef0e-91ca-4aa6-b80f-cd9240b6209e>
    <urn:uuid:66fec543-b9ef-419f-867d-16131bf700e3>
    <urn:uuid:eddd68f4-80ab-4e53-bbff-f5b872c92218>
    <urn:uuid:e709c5a8-ff8d-46f3-bf42-6ba2613acbb7>
    <urn:uuid:11e61c32-4d79-456f-aedb-1b0a98dcfea1>
    <urn:uuid:20008d49-bbca-41a4-8c38-91041734e6bd>
    <urn:uuid:dc12e0b0-f45d-4154-a126-bea8006767ce>
  ) ;
  obo:so_part_of <urn:uuid:111eaf02-aef0-447d-86d3-d08c9503d21a> ;  # SO:gene
  rdfs:seeAlso <http://identifiers.org/hgnc/1101>, <http://identifiers.org/ncbigene/675>, <http://identifiers.org/ncbigi/GI:119395733>, <http://identifiers.org/omim/600185> .

<urn:uuid:e156c6ed-a986-45b3-975b-70756aa702cb>
  faldo:begin <urn:uuid:70b063b5-adc4-46f0-b218-5371c6c15b7f> ;
  faldo:end <urn:uuid:649ee5cf-3740-421f-9ad5-8b33587e23fe> ;
  insdc:location "join(32315480..32315667,32316422..32316527,32319077..32319325,32325076..32325184,32326101..32326150,32326242..32326282,32326499..32326613,32329443..32329492,32330919..32331030,32332272..32333387,32336265..32341196,32344558..32344653,32346827..32346896,32354861..32355288,32356428..32356609,32357742..32357929,32362523..32362693,32363179..32363533,32370402..32370557,32370956..32371100,32376670..32376791,32379317..32379515,32379750..32379913,32380007..32380145,32394689..32394933,32396898..32397044,32398162..32399672)" ;
  a faldo:Region .

<urn:uuid:70b063b5-adc4-46f0-b218-5371c6c15b7f>
  faldo:position 32315480 ;
  faldo:reference <urn:uuid:02856d67-76b5-4ac3-8cb2-906a70cf4434> ;
  a faldo:ExactPosition, faldo:ForwardStrandPosition .

<urn:uuid:649ee5cf-3740-421f-9ad5-8b33587e23fe>
  faldo:position 32399672 ;
  faldo:reference <urn:uuid:02856d67-76b5-4ac3-8cb2-906a70cf4434> ;
  a faldo:ExactPosition, faldo:ForwardStrandPosition .

### SO:CDS

<urn:uuid:7c341d83-ac71-4651-b216-7cca60d080b9>
  a obo:SO_0000316 ;  # SO:CDS
  rdfs:label "BRCA2" ;
  faldo:location <urn:uuid:c105c69f-62ea-4f10-93c4-f4effaa36a47> ;
  insdc:codon_start 1 ;
  insdc:exception "annotated by transcript or proteomic data" ;
  insdc:gene "BRCA2" ;
  insdc:gene_synonym "BRCC2; BROVCA2; FACD; FAD; FAD1; FANCB; FANCD; FANCD1; GLM3; PNCA2" ;
  insdc:inference "similar to AA sequence (same species):RefSeq:NP_000050.2" ;
  insdc:note "The RefSeq protein has 1 substitution compared to this genomic sequence; Derived by automated computational analysis using gene prediction method: BestRefSeq." ;
  insdc:product "breast cancer type 2 susceptibility protein" ;
  insdc:translation "MPIGSKERPTFFEIFKTRCNKADLGPISLNWFEELSSEAPPYNSEPAEESEHKNNNYEPNLFKTPQRKPSYNQLASTPIIFKEQGLTLPLYQSPVKELDKFKLDLGRNVPNSRHKSLRTVKTKMDQADDVSCPLLNSCLSESPVVLQCTHVTPQRDKSVVCGSLFHTPKFVKGRQTPKHISESLGAEVDPDMSWSSSLATPPTLSSTVLIVRNEEASETVFPHDTTANVKSYFSNHDESLKKNDRFIASVTDSENTNQREAASHGFGKTSGNSFKVNSCKDHIGKSMPNVLEDEVYETVVDTSEEDSFSLCFSKCRTKNLQKVRTSKTRKKIFHEANADECEKSKNQVKEKYSFVSEVEPNDTDPLDSNVANQKPFESGSDKISKEVVPSLACEWSQLTLSGLNGAQMEKIPLLHISSCDQNISEKDLLDTENKRKKDFLTSENSLPRISSLPKSEKPLNEETVVNKRDEEQHLESHTDCILAVKQAISGTSPVASSFQGIKKSIFRIRESPKETFNASFSGHMTDPNFKKETEASESGLEIHTVCSQKEDSLCPNLIDNGSWPATTTQNSVALKNAGLISTLKKKTNKFIYAIHDETSYKGKKIPKDQKSELINCSAQFEANAFEAPLTFANADSGLLHSSVKRSCSQNDSEEPTLSLTSSFGTILRKCSRNETCSNNTVISQDLDYKEAKCNKEKLQLFITPEADSLSCLQEGQCENDPKSKKVSDIKEEVLAAACHPVQHSKVEYSDTDFQSQKSLLYDHENASTLILTPTSKDVLSNLVMISRGKESYKMSDKLKGNNYESDVELTKNIPMEKNQDVCALNENYKNVELLPPEKYMRVASPSRKVQFNQNTNLRVIQKNQEETTSISKITVNPDSEELFSDNENNFVFQVANERNNLALGNTKELHETDLTCVNEPIFKNSTMVLYGDTGDKQATQVSIKKDLVYVLAEENKNSVKQHIKMTLGQDLKSDISLNIDKIPEKNNDYMNKWAGLLGPISNHSFGGSFRTASNKEIKLSEHNIKKSKMFFKDIEEQYPTSLACVEIVNTLALDNQKKLSKPQSINTVSAHLQSSVVVSDCKNSHITPQMLFSKQDFNSNHNLTPSQKAEITELSTILEESGSQFEFTQFRKPSYILQKSTFEVPENQMTILKTTSEECRDADLHVIMNAPSIGQVDSSKQFEGTVEIKRKFAGLLKNDCNKSASGYLTDENEVGFRGFYSAHGTKLNVSTEALQKAVKLFSDIENISEETSAEVHPISLSSSKCHDSVVSMFKIENHNDKTVSEKNNKCQLILQNNIEMTTGTFVEEITENYKRNTENEDNKYTAASRNSHNLEFDGSDSSKNDTVCIHKDETDLLFTDQHNICLKLSGQFMKEGNTQIKEDLSDLTFLEVAKAQEACHGNTSNKEQLTATKTEQNIKDFETSDTFFQTASGKNISVAKESFNKIVNFFDQKPEELHNFSLNSELHSDIRKNKMDILSYEETDIVKHKILKESVPVGTGNQLVTFQGQPERDEKIKEPTLLGFHTASGKKVKIAKESLDKVKNLFDEKEQGTSEITSFSHQWAKTLKYREACKDLELACETIEITAAPKCKEMQNSLNNDKNLVSIETVVPPKLLSDNLCRQTENLKTSKSIFLKVKVHENVEKETAKSPATCYTNQSPYSVIENSALAFYTSCSRKTSVSQTSLLEAKKWLREGIFDGQPERINTADYVGNYLYENNSNSTIAENDKNHLSEKQDTYLSNSSMSNSYSYHSDEVYNDSGYLSKNKLDSGIEPVLKNVEDQKNTSFSKVISNVKDANAYPQTVNEDICVEELVTSSSPCKNKNAAIKLSISNSNNFEVGPPAFRIASGKIVCVSHETIKKVKDIFTDSFSKVIKENNENKSKICQTKIMAGCYEALDDSEDILHNSLDNDECSTHSHKVFADIQSEEILQHNQNMSGLEKVSKISPCDVSLETSDICKCSIGKLHKSVSSANTCGIFSTASGKSVQVSDASLQNARQVFSEIEDSTKQVFSKVLFKSNEHSDQLTREENTAIRTPEHLISQKGFSYNVVNSSAFSGFSTASGKQVSILESSLHKVKGVLEEFDLIRTEHSLHYSPTSRQNVSKILPRVDKRNPEHCVNSEMEKTCSKEFKLSNNLNVEGGSSENNHSIKVSPYLSQFQQDKQQLVLGTKVSLVENIHVLGKEQASPKNVKMEIGKTETFSDVPVKTNIEVCSTYSKDSENYFETEAVEIAKAFMEDDELTDSKLPSHATHSLFTCPENEEMVLSNSRIGKRRGEPLILVGEPSIKRNLLNEFDRIIENQEKSLKASKSTPDGTIKDRRLFMHHVSLEPITCVPFRTTKERQEIQNPNFTAPGQEFLSKSHLYEHLTLEKSSSNLAVSGHPFYQVSATRNEKMRHLITTGRPTKVFVPPFKTKSHFHRVEQCVRNINLEENRQKQNIDGHGSDDSKNKINDNEIHQFNKNNSNQAAAVTFTKCEEEPLDLITSLQNARDIQDMRIKKKQRQRVFPQPGSLYLAKTSTLPRISLKAAVGGQVPSACSHKQLYTYGVSKHCIKINSKNAESFQFHTEDYFGKESLWTGKGIQLADGGWLIPSNDGKAGKEEFYRALCDTPGVDPKLISRIWVYNHYRWIIWKLAAMECAFPKEFANRCLSPERVLLQLKYRYDTEIDRSRRSAIKKIMERDDTAAKTLVLCVSDIISLSANISETSSNKTSSADTQKVAIIELTDGWYAVKAQLDPPLLAVLKNGRLTVGQKIILHGAELVGSPDACTPLEAPESLMLKISANSTRPARWYTKLGFFPDPRPFPLPLSSLFSDGGNVGCVDVIIQRAYPIQWMEKTSSGLYIFRNEREEEKEAAKYVEAQQKRLEALFTKIQEEFEEHEENTTKPYLPSRALTRQQVRALQDGAELYEAVKNAADPAYLEGYFSEEQLRALNNHRQMLNDKKQAQIQLEIRKAMESAEQKEQGLSRDVTTVWKLRIVSYSKKEKDSVILSIWRPSSDLYSLLTEGKRYRIYHLATSKSKSKSERANIQLAATKKTQYQQLPVSDEILFQIYQPREPLHFSKFLDPDFQPSCSEVDLIGFVVSVVKKTGLAPFVYLSDECYNLLAIKFWIDLNEDIIKPHMLIAASNLQWRPESKSGLLTLFAGDFSVFSASPKEGHFQETFNKMKNTVENIDILCNEAENKLMHILHANDPKWSTPTKDCTSGPYTAQIIPGTGNKLLMSSPNCEIYYQSPLSLCMAKRKSVSTPVSAQMTSKSCKGEKEIDDQKNCKKRRALDFLSRLPLPPPVSPICTFVSPAAQKAFQPPRSCGTKYETPIKKKELNSPQMTPFKKFNEISLLESNSIADEELALINTQALLSGSTGEKQFISVSESTRTAPTSSEDYLRLKRRCTTSLIKEQESSQASTEECEKNKQDTITTKKYI" ;
  obo:so_has_part (<urn:uuid:a105e0f2-c7dd-47d5-b9cf-9afb85cc48b7>
    <urn:uuid:cec5f214-f49b-4226-b560-f08dcf06b2e5>
    <urn:uuid:a27fe315-b4ae-4b6c-8533-06c729cbc377>
    <urn:uuid:f2931c4a-3224-4483-ae08-4edcf79a9a90>
    <urn:uuid:baa6a449-214e-40f5-bb83-a0daa76a3dd6>
    <urn:uuid:9cf9294f-e183-4d06-8420-589c9feea2a5>
    <urn:uuid:b1f09a10-efb2-42f6-83fd-9e58bcf9d31e>
    <urn:uuid:3043988a-9dc5-4a4f-9d30-a6bd1d59aa0d>
    <urn:uuid:ce41afc0-303d-4a91-8e2c-47d626802a5e>
    <urn:uuid:26f2d784-bd23-4459-8c90-7b7d1c3aee63>
    <urn:uuid:80c4e39a-64b1-4a14-bd80-87bb1fccec17>
    <urn:uuid:6dcac1b5-f589-42c0-9732-62d3652570f7>
    <urn:uuid:94da7ade-8b1b-40eb-8fe9-49c6da70aefc>
    <urn:uuid:7cf1088f-575c-4cf6-beec-7851f251bef6>
    <urn:uuid:c2db2956-b989-448e-85a0-183d166934c6>
    <urn:uuid:03e74f9d-badb-4706-97a0-d286f80a97f0>
    <urn:uuid:7512ba7b-afac-4081-98e7-dd2d49884be7>
    <urn:uuid:1cf56c3b-8dce-41c7-95df-717c0e1527ec>
    <urn:uuid:29931521-3c11-4784-b4b9-ef5b1775c7f7>
    <urn:uuid:7bf26190-4c22-459f-a879-3cfc99c4cccc>
    <urn:uuid:a8bf2410-d7fa-4797-9ed9-a80a7b02b757>
    <urn:uuid:1658fdfc-2b2a-4fdb-9973-797e220cee71>
    <urn:uuid:a2330f3d-f608-42b6-aebe-998e5f47b745>
    <urn:uuid:c3fc0d32-c4e4-4639-adaa-2ba461dbf0a5>
    <urn:uuid:5d0899e9-418f-40c8-90a3-c65332de1e98>
    <urn:uuid:93419b50-ee11-4f57-a6c9-2781970dc21e>
  ) ;
  obo:so_part_of <urn:uuid:111eaf02-aef0-447d-86d3-d08c9503d21a> ;  # SO:gene (-> SO:mRNA? and share the same exon URIs?)
  rdfs:seeAlso <http://identifiers.org/ccds/CCDS9344.1>, <http://identifiers.org/hgnc/1101>, <http://identifiers.org/ncbigene/675>, <http://identifiers.org/ncbigi/GI:119395734>, <http://identifiers.org/ncbiprotein/NP_000050.2>, <http://identifiers.org/omim/600185> .

<urn:uuid:c105c69f-62ea-4f10-93c4-f4effaa36a47>
  faldo:begin <urn:uuid:7abb9350-b799-46d4-8b05-5fc0dbc9e00c> ;
  faldo:end <urn:uuid:b01bb356-b63d-4f82-ad40-13dbd377f565> ;
  insdc:location "join(32316461..32316527,32319077..32319325,32325076..32325184,32326101..32326150,32326242..32326282,32326499..32326613,32329443..32329492,32330919..32331030,32332272..32333387,32336265..32341196,32344558..32344653,32346827..32346896,32354861..32355288,32356428..32356609,32357742..32357929,32362523..32362693,32363179..32363533,32370402..32370557,32370956..32371100,32376670..32376791,32379317..32379515,32379750..32379913,32380007..32380145,32394689..32394933,32396898..32397044,32398162..32398770)" ;
  a faldo:Region .

<urn:uuid:7abb9350-b799-46d4-8b05-5fc0dbc9e00c>
  faldo:position 32316461 ;
  faldo:reference <urn:uuid:02856d67-76b5-4ac3-8cb2-906a70cf4434> ;
  a faldo:ExactPosition, faldo:ForwardStrandPosition .

<urn:uuid:b01bb356-b63d-4f82-ad40-13dbd377f565>
  faldo:position 32398770 ;
  faldo:reference <urn:uuid:02856d67-76b5-4ac3-8cb2-906a70cf4434> ;
  a faldo:ExactPosition, faldo:ForwardStrandPosition .

### SO:exon

<urn:uuid:a105e0f2-c7dd-47d5-b9cf-9afb85cc48b7>
  faldo:begin <urn:uuid:dbcbaadd-407e-4590-932e-2ca5a21e1c9e> ;
  faldo:end <urn:uuid:a81cb138-96a0-458c-b963-cd5b4af0fd01> ;
  obo:so_part_of <urn:uuid:c105c69f-62ea-4f10-93c4-f4effaa36a47> ;
  a faldo:Region, obo:SO_0000147 .  # SO:exon

  :

<urn:uuid:93419b50-ee11-4f57-a6c9-2781970dc21e>
  faldo:begin <urn:uuid:90c43947-789e-4c39-a502-357e539fa71e> ;
  faldo:end <urn:uuid:200030b9-8545-4f0a-bf58-c99f4d76c3fe> ;
  obo:so_part_of <urn:uuid:c105c69f-62ea-4f10-93c4-f4effaa36a47> ;
  a faldo:Region, obo:SO_0000147 .  # SO:exon

<urn:uuid:90c43947-789e-4c39-a502-357e539fa71e>
  faldo:position 32398162 ;
  faldo:reference <urn:uuid:02856d67-76b5-4ac3-8cb2-906a70cf4434> ;
  a faldo:ExactPosition, faldo:ForwardStrandPosition .

<urn:uuid:200030b9-8545-4f0a-bf58-c99f4d76c3fe>
  faldo:position 32398770 ;
  faldo:reference <urn:uuid:02856d67-76b5-4ac3-8cb2-906a70cf4434> ;
  a faldo:ExactPosition, faldo:ForwardStrandPosition .

http://togows.org/entry/nucleotide/NC_000013.gff

TBA