Auto-encoding transformer

mnistdatadouble.py creates dataset for ynet.
gpuautotransformer.py has the code for image auto encoder
imageprep.py has the code for audio autotransformer.
The main code is written from the tutorial found here: https://medium.com/mlearning-ai/vision-transformers-from-scratch-pytorch-a-step-by-step-guide-96c3313c2e0c
The audiomnist dataset can be found at https://www.kaggle.com/datasets/sripaadsrinivasan/audio-mnist
The MFCC code is adapted from https://github.com/aniruddhapal211316/spoken_digit_recognition/blob/main/dataset.py
Ynet100.py and ynet100a.py refer to 100 pairings of one vision input and one audio input respectively with 100 inputs from the cross modality.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
LICENSE		LICENSE
README.md		README.md
amnistdoubleydata.py		amnistdoubleydata.py
audiocheck.py		audiocheck.py
audiotomfcc.py		audiotomfcc.py
audtospect.py		audtospect.py
audtospect2.py		audtospect2.py
autotrans_main.py		autotrans_main.py
autotrans_utils.py		autotrans_utils.py
aynet.py		aynet.py
aynetc.py		aynetc.py
cynet.py		cynet.py
gpuautotrans.py		gpuautotrans.py
imageprep.py		imageprep.py
imageprep2.py		imageprep2.py
latlook.py		latlook.py
latlookdata.py		latlookdata.py
mnistcombineddata.py		mnistcombineddata.py
mnistdoubleydata.py		mnistdoubleydata.py
mnistdoubleydata10.py		mnistdoubleydata10.py
mnistdoubleydata100.py		mnistdoubleydata100.py
mnisttrunc.py		mnisttrunc.py
neurons.py		neurons.py
perceiverar.py		perceiverar.py
percimnist.py		percimnist.py
rnntransclass.py		rnntransclass.py
sharednet.py		sharednet.py
soundauto.py		soundauto.py
trialforpatch.py		trialforpatch.py
trialfuncs.py		trialfuncs.py
vit.py		vit.py
ynet.py		ynet.py
ynet10.py		ynet10.py
ynet100.py		ynet100.py
yneta10.py		yneta10.py
ynetc.py		ynetc.py

IBM/autotransformer