## Work with PATRIC genomes

Mackinac provides functions for working with PATRIC genomes which can be used to reconstruct models using the PATRIC web service, the ModelSEED web service, or with template models.

PATRIC updates the annotations for genomes on a regular schedule so the data returned by the genome functions can be different depending on when you get the data. Check the [PATRIC News](https://docs.patricbrc.org/news/index.html) for details on the changes in each PATRIC release.

In [1]:
import mackinac

Get summary information for a PATRIC genome with `get_genome_summary()`. You can [search](https://www.patricbrc.org/view/DataType/Genomes) on the PATRIC website for available organisms. The genome ID [226186.12](https://www.patricbrc.org/view/Genome/226186.12) is for the gram negative bacteria Bacteroides thetaiotaomicron VPI-5482. Note that information available in the summary can be different for different genomes depending on the source of the genome.

In [2]:
mackinac.get_genome_summary('226186.12')

{'_version_': 1585509101369032700,
 'assembly_accession': 'GCA_000011065.1',
 'bioproject_accession': 'PRJNA399',
 'biosample_accession': 'SAMN02604314',
 'brc1_cds': 0,
 'cell_shape': 'Rod',
 'chromosomes': 1,
 'class': 'Bacteroidia',
 'comments': ['Bacteroides thetaiotaomicron strain VPI-5482. This is the type strain for this organism and was isolated from the feces of a healthy adult.'],
 'common_name': 'Bacteroides_thetaiotaomicron_VPI-5482',
 'completion_date': '2003-03-29T00:00:00Z',
 'contigs': 0,
 'date_inserted': '2014-12-08T22:10:24.729Z',
 'date_modified': '2015-03-16T03:17:09.594Z',
 'disease': ['Peritonitis'],
 'document_type': 'genome',
 'family': 'Bacteroidaceae',
 'gc_content': 42.9,
 'genbank_accessions': 'AE015928,AY171301',
 'genome_id': '226186.12',
 'genome_length': 6293399,
 'genome_name': 'Bacteroides thetaiotaomicron VPI-5482',
 'genome_status': 'Complete',
 'genus': 'Bacteroides',
 'gram_stain': '-',
 'habitat': 'Host-associated',
 'isolation_comments': 'isolat

Get the features for an annotated genome with `get_genome_features()`. Both PATRIC and RefSeq annotations are available. Note that the number of features and details of the features can change as PATRIC updates the annotations.

In [3]:
features = mackinac.get_genome_features('226186.12', annotation='PATRIC')
len(features)

4965

The returned list has detailed information about each feature including the type and DNA sequence. If the feature is a coding sequence, the returned data also includes the amino acid sequence.

In [4]:
features[100]

{'aa_length': 1052,
 'aa_sequence': 'MKIEKFYLFLLACFVAIGAYSQDGQQKMTGDEKSQQQSDAKVKITGQVFDESGEGIPGANVTLKSNPTSGTVTDLDGKFILMASPQKDVLVVSFIGYNTQEFPLKGKTNVTIQLSQNVNELDAVEIVAFGTQKKESVIGSITTLSPKSLRVPSSNMTTALAGQVAGIISYQTSGEPGADDASFFVRGIASFGFNTSPLILIDNIESTSTDLGRLNPDDIESFSIMKDAMATALYGSRGANGVVLVKTKEGERGKTKFDVRIEGSNSRPTSNIELADPVTYMKLHNEAILTRDPSAPVMYSDDKIDRTVPGSGSIIYPTNDWRRQLMKNSTWNGRANMSISGGGNSATYYVSLRYTKDQGLLNVDGKNNFNNNINLQTYQMRANVNINVTKTTQVRVNLSGIFDTYEGPIYSGSDIYKMVMKSNPVLFPAVYPTDEQHKYIKHILFGNSDDGSYLNPYAEMVKGYKEYENTTLLATLGVTQDLNFITKGLKFEGFFNVSRKSYYGQTRQYKPYYYALSSYDFMTEKYSIENINPDSGTEYLDFSPGDKTVNNVMTIETRTSYNQTFGDHSVGGLIVTQYIDSKNPNYKTLQESLPSRNMGVSGRFTYAYSDRYFTEFNFGYNASERFDKKHRWGFFPSVGGGWMISNEPFFQPLSSKITKLKLRASYGLVGNDKIGRVDERFLYLSNVNMNAGGASFGYENKYSRPGVNVSRYANPAIGWEKSRKANFALEASFYGFDLIAEYFTEHRTDILQKRASIPSVMGYQADVYANIGETKGHGVDLDLKYQKNLNKNAFLIVRGNLTYAHSEYLKYEDNTYDKEWWKYKIGYSPNQKWGYIAEGLFIDDAEVANSPVQFGDYKAGDIKYRDMNGDGVINSLDQVPIGHPTSPEINYGFGSTFSYKGFDINFQFHGSAQSSFWIDYDKMSPFFKDSKMSQKTNNQLVKFIANSYWSESNRNRYATWPRL