Skip to content

miguelperezenciso/dna2image

Repository files navigation

Computer generation of fruit shapes from DNA sequence

Code to generate images from dna sequence

Citation:

M. Pérez-Enciso, C. Pons, A. Graell, A.J. Monforte, L.M. Zingaretti. Computer generation of fruit shapes from DNA sequence. Biorxiv. submitted

mperezenciso@gmail.com

Summary

The generation of realistic plant and animal images from marker information could be a main contribution of artificial intelligence to genetics and breeding. Since morphological traits are highly variable and highly heritable, this must be possible. However, a suitable algorithm has not been proposed yet. This paper is a proof of concept demonstrating the feasibility of this proposal using ‘decoders’, a class of deep learning architecture. We apply it to Cucurbitaceae, the family harboring the largest variability in fruit shape in the plant kingdom, and to tomato. We generate Cucurbitaceae shapes assuming a hypothetical, but plausible, evolutive path along observed fruit shapes. In tomato, we analyze 129 crosses for which image and genotype data were available. In both instances, a simple decoder was able to recover expected shapes with large accuracy. For the tomato pedigree, we also show that the algorithm can be trained to generate offspring images from their parents’ shapes, fully bypassing genotype information.

Contents

Jupyter notebooks

Folders

data: contains tomato contours, pedigree and genotype data

images: contains cucurbita images

Warning: The img2img code requires file TraditomImgset.pkl (~1Gb), which is available from dropbox link https://www.dropbox.com/s/hvmt1a2qursameq/TraditomImgset.pkl?dl=0

Some relevant sites / documentation used

Relevant image libraries are

Required non standard libraries

About

Code to generate images from dna sequence

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published