Computer generation of fruit shapes from DNA sequence

Code to generate images from dna sequence

Citation:

M. Pérez-Enciso, C. Pons, A. Graell, A.J. Monforte, L.M. Zingaretti. Computer generation of fruit shapes from DNA sequence. Biorxiv. submitted

mperezenciso@gmail.com

Summary

The generation of realistic plant and animal images from marker information could be a main contribution of artificial intelligence to genetics and breeding. Since morphological traits are highly variable and highly heritable, this must be possible. However, a suitable algorithm has not been proposed yet. This paper is a proof of concept demonstrating the feasibility of this proposal using ‘decoders’, a class of deep learning architecture. We apply it to Cucurbitaceae, the family harboring the largest variability in fruit shape in the plant kingdom, and to tomato. We generate Cucurbitaceae shapes assuming a hypothetical, but plausible, evolutive path along observed fruit shapes. In tomato, we analyze 129 crosses for which image and genotype data were available. In both instances, a simple decoder was able to recover expected shapes with large accuracy. For the tomato pedigree, we also show that the algorithm can be trained to generate offspring images from their parents’ shapes, fully bypassing genotype information.

Jupyter notebooks

Generates 2D and 3D ellipses: https://github.com/miguelperezenciso/dna2image/blob/main/dna2img.ellipse.ipynb
Generates cucurbit shapes: https://github.com/miguelperezenciso/dna2image/blob/main/dna2img.cucurbita.ipynb
Generates tomato shapes: https://github.com/miguelperezenciso/dna2image/blob/main/dna2img.tomato.ipynb
Generates 'offspring' ellipses from 'parents' ellipses: https://github.com/miguelperezenciso/dna2image/blob/main/img2img.ipynb
Generates 'offspring' tomato shapes from 'parents' shapes: https://github.com/miguelperezenciso/dna2image/blob/main/img2img.ipynb

Folders

data: contains tomato contours, pedigree and genotype data

images: contains cucurbita images

Warning: The img2img code requires file TraditomImgset.pkl (~1Gb), which is available from dropbox link https://www.dropbox.com/s/hvmt1a2qursameq/TraditomImgset.pkl?dl=0

Some relevant sites / documentation used

Relevant image libraries are

skimage: basic image processing (https://scikit-image.org/)
opencv: advanced, classical library (https://pypi.org/project/opencv-python/, https://docs.opencv.org/4.x/d6/d00/tutorial_py_root.html)
PIL: basic operations, saving, rotating: https://pillow.readthedocs.io/en/stable/handbook/index.html

Required non standard libraries

pymrt: as downloaded from the web to generate 3D images, DO NOT use the repository version available from pip.
procrustes: https://github.com/theochem/procrustes

Name		Name	Last commit message	Last commit date
Latest commit History 36 Commits
.vscode		.vscode
data		data
images		images
procrustes		procrustes
pymrt		pymrt
LICENSE		LICENSE
README.md		README.md
dna2img.cucurbita.ipynb		dna2img.cucurbita.ipynb
dna2img.ellipse.ipynb		dna2img.ellipse.ipynb
dna2img.tomato.ipynb		dna2img.tomato.ipynb
img2img.ipynb		img2img.ipynb
model.png		model.png

License

miguelperezenciso/dna2image

Folders and files

Latest commit

History

Repository files navigation

Computer generation of fruit shapes from DNA sequence

Citation:

Summary

Contents

Jupyter notebooks

Folders

Some relevant sites / documentation used

Relevant image libraries are

Required non standard libraries

About

Resources

License

Stars

Watchers

Forks

Languages