# Identificación y selección del virus objetivo
El genoma de referencia del virus del dengue (DENV) se identificó en el recurso '*Genome*' de la base de datos del '**National Center for Biotechnology Information**' - *NCBI* (https://www.ncbi.nlm.nih.gov/genome/?term=dengue+virus) (*Sayers et al., 2022*), y se seleccionó la secuencia de referencia *NC_001474.2* correspondiente al serotipo 2 del DENV (DENV-2) para el reconocimiento de las 10 proteínas objetivo que se abarcó en la presente investigación: <u>3 proteínas estructurales</u> (*ancC*, *prM* y *E) y <u>7 proteínas no estructurales</u> (*NS1*, NS2A*, *NS2B*, *NS3*, *NS4A*, *NS4B* y *NS5*).

In [4]:
# Instalar el paquete 'biopython'
!pip install biopython

Defaulting to user installation because normal site-packages is not writeable


In [5]:
# Importar Biopython (Bio) y los módulos 'Entrez' y 'SeqIO'
import Bio
from Bio import Entrez
from Bio import SeqIO

In [6]:
# Configurar mi correo electrónico
Entrez.email = 'victor.cornejo@unmsm.edu.pe'

In [86]:
# Visualizar resumen de secuencia id 'NC_001474.2'
handle = Entrez.efetch(db = 'nuccore', #database
                       id = 'NC_001474.2', #identifier
                       rettype = 'gb',
                       retmode = 'text')
whole_sequence_gb = SeqIO.read(handle, 'genbank')
print(whole_sequence_gb)

ID: NC_001474.2
Name: NC_001474
Description: Dengue virus 2, complete genome
Database cross-references: BioProject:PRJNA485481
Number of features: 37
/molecule_type=RNA
/topology=linear
/data_file_division=VRL
/date=11-JUL-2019
/accessions=['NC_001474']
/sequence_version=2
/keywords=['RefSeq']
/source=Dengue virus type 2
/organism=Dengue virus type 2
/taxonomy=['Viruses', 'Riboviria', 'Orthornavirae', 'Kitrinoviricota', 'Flasuviricetes', 'Amarillovirales', 'Flaviviridae', 'Flavivirus']
/references=[Reference(title='Construction of infectious cDNA clones for dengue 2 virus: strain 16681 and its attenuated vaccine derivative, strain PDK-53', ...), Reference(title='Direct Submission', ...), Reference(title='Direct Submission', ...)]
/comment=REVIEWED REFSEQ: This record has been curated by NCBI staff. The
reference sequence was derived from U87411.
On Nov 1, 2007 this sequence version replaced NC_001474.1.
The mature peptides were added by the NCBI staff following other
annotations for De

In [83]:
# Mostrar la secuencia de referencia del genoma completo del DENV-2 (Formato GenBank)
handle = Entrez.efetch(db = 'nucleotide',
                       id = 'NC_001474.2',
                       rettype = 'gb',
                       retmode = 'text')
print(handle.read())

LOCUS       NC_001474              10723 bp    RNA     linear   VRL 11-JUL-2019
DEFINITION  Dengue virus 2, complete genome.
ACCESSION   NC_001474
VERSION     NC_001474.2
DBLINK      BioProject: PRJNA485481
KEYWORDS    RefSeq.
SOURCE      Dengue virus type 2
  ORGANISM  Dengue virus type 2
            Viruses; Riboviria; Orthornavirae; Kitrinoviricota; Flasuviricetes;
            Amarillovirales; Flaviviridae; Flavivirus.
REFERENCE   1  (bases 1 to 10723)
  AUTHORS   Kinney,R.M., Butrapet,S., Chang,G.J., Tsuchiya,K.R., Roehrig,J.T.,
            Bhamarapravati,N. and Gubler,D.J.
  TITLE     Construction of infectious cDNA clones for dengue 2 virus: strain
            16681 and its attenuated vaccine derivative, strain PDK-53
  JOURNAL   Virology 230 (2), 300-308 (1997)
   PUBMED   9143286
REFERENCE   2  (bases 1 to 10723)
  CONSRTM   NCBI Genome Project
  TITLE     Direct Submission
  JOURNAL   Submitted (01-NOV-2007) National Center for Biotechnology
            Information, NIH, Bethe

In [84]:
# Guardar la secuencia de referencia del genoma completo del DENV-2 (Formato GenBank)
handle = Entrez.efetch(db = 'nucleotide', id = 'NC_001474.2', rettype = 'gb', retmode = 'text')
record = SeqIO.read(handle, 'gb')
outputname = '/home/victor/Escritorio/Tesis/RESULTADOS/1. Identificación y selección del virus objetivo/NC_001474_Dengue virus 2.gb'
SeqIO.write(record,outputname,'gb')

1

In [80]:
# Mostrar la secuencia de referencia del genoma completo del DENV-2 (Formato fasta)
handle = Entrez.efetch(db = 'nucleotide',
                       id = 'NC_001474.2',
                       rettype = 'fasta',
                       retmode = 'text')
print(handle.read())

>NC_001474.2 Dengue virus 2, complete genome
AGTTGTTAGTCTACGTGGACCGACAAAGACAGATTCTTTGAGGGAGCTAAGCTCAACGTAGTTCTAACAG
TTTTTTAATTAGAGAGCAGATCTCTGATGAATAACCAACGGAAAAAGGCGAAAAACACGCCTTTCAATAT
GCTGAAACGCGAGAGAAACCGCGTGTCGACTGTGCAACAGCTGACAAAGAGATTCTCACTTGGAATGCTG
CAGGGACGAGGACCATTAAAACTGTTCATGGCCCTGGTGGCGTTCCTTCGTTTCCTAACAATCCCACCAA
CAGCAGGGATATTGAAGAGATGGGGAACAATTAAAAAATCAAAAGCTATTAATGTTTTGAGAGGGTTCAG
GAAAGAGATTGGAAGGATGCTGAACATCTTGAATAGGAGACGCAGATCTGCAGGCATGATCATTATGCTG
ATTCCAACAGTGATGGCGTTCCATTTAACCACACGTAACGGAGAACCACACATGATCGTCAGCAGACAAG
AGAAAGGGAAAAGTCTTCTGTTTAAAACAGAGGATGGCGTGAACATGTGTACCCTCATGGCCATGGACCT
TGGTGAATTGTGTGAAGACACAATCACGTACAAGTGTCCCCTTCTCAGGCAGAATGAGCCAGAAGACATA
GACTGTTGGTGCAACTCTACGTCCACGTGGGTAACTTATGGGACGTGTACCACCATGGGAGAACATAGAA
GAGAAAAAAGATCAGTGGCACTCGTTCCACATGTGGGAATGGGACTGGAGACACGAACTGAAACATGGAT
GTCATCAGAAGGGGCCTGGAAACATGTCCAGAGAATTGAAACTTGGATCTTGAGACATCCAGGCTTCACC
ATGATGGCAGCAATCCTGGCATACACCATAGGAACGACACATTTCCAAAGAGCCCTGATTTTCATCTTAC
TGACAGCTGTCACTCCTTCAATGACAATGCGT

In [82]:
# Guardar la secuencia de referencia del genoma completo del DENV-2 (Formato fasta)
handle = Entrez.efetch(db = 'nucleotide', id = 'NC_001474.2', rettype = 'fasta', retmode = 'text')
record = SeqIO.read(handle, 'fasta')
outputname = '/home/victor/Escritorio/Tesis/RESULTADOS/1. Identificación y selección del virus objetivo/NC_001474_Dengue virus 2.fasta'
SeqIO.write(record,outputname,'fasta')

1