You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
At this line, why we combine the negative dataset (assumed, retrieved from UniProt) with the inactive dataset of MIC. the dataset will be come highly imbalanced. Those(assumed) negative sequences from Uniprot is easier to be predicted as negative.
Hi, in the script mic_classifier_training_prodecure.ipynb, there are about 3000 mic_x_train, and about 10000 negatives_x_train.
But why in the training output, it says 'Train on 20457 samples, validate on 1312 samples'?
Thank you for your time
The text was updated successfully, but these errors were encountered: