Skip to content

dataset format for training new model #1

@alexander-ratzan

Description

@alexander-ratzan

Hi there, I'm trying to train a krakencoder model from scratch on a few HCP-YA parcellations that were not included in the original model. I have all the data stored in tsvs as in /example_data and have organized them into /example_data/mydata_4S456Parcels_FCcorrhpf.mat etc. I'm currently matching the format to /example_data/exampledata_10subj_fs86_inputs.mat but I'm noticing that this is not the expected format for the training input data. To validate this I tried using the /example_data/exampledata_10subj_fs86_inputs.mat to train a model and it indeed caused the same error.

Command:  run_training.py --subjectfile example_data/subject_splits_993subj_683train_79val_196test_retestInTest.mat --inputdata [FCcorr_fs86_hpf]@FC=example_data/exampledata_10subj_fs86_inputs.mat [SCifod2act_fs86_volnorm]@SC=example_data/exampledata_10subj_fs86_inputs.mat --dataflavors SCifod2act FCcorr --datagroups SCFC --latentsize 128 --latentunit --transformation pca256 --dropout .5 --losstype correye+enceye.w10+neidist+encdist.w10+mse.w1000+latentsimloss.w10000 --trainvalsplitfrac .8 --valsplitfrac .1 --outputprefix mykraken_example --epochs 10 --checkpointepochsevery 1 --displayepochs 2

Loading subject splits from example_data/subject_splits_993subj_683train_79val_196test_retestInTest.mat
	993 subjects
	683 subjidx_train
	79 subjidx_val
	196 subjidx_test
input_file: example_data/exampledata_10subj_fs86_inputs.mat
groupname: FC
inputfield: None
connfield: None
None of the following fields were found in the input file example_data/exampledata_10subj_fs86_inputs.mat: ['data', 'encoded', 'FC', 'SC', 'C', 'volnorm']
Traceback (most recent call last):
  File "/scratch/*/neuroinformatics/krakencoder/run_training.py", line 1185, in <module>
    run_training_command(sys.argv[1:])
  File "/scratch/*/neuroinformatics/krakencoder/run_training.py", line 516, in run_training_command
    conndata_alltypes[xc]=load_input_data(os.path.expanduser(input_file),group=groupname)
                          ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/scratch/*/neuroinformatics/krakencoder/krakencoder/data.py", line 1060, in load_input_data
    raise Exception("Input type not found")
Exception: Input type not found

Can you please provide the specific data format, or even better if you can add an example training data file so that I can match my inputs to that structure? Thanks for the help and great package!

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions