# Generate Cross-Validation with Spatial Leave-Half-Stand-Out (SLOSO)

In [1]:
from MuseoToolBox.vectorTools import crossValidationSelection

inVector = '../data/train.gpkg'
levelField = 'Class'
standField = 'uniqueFID'

## Select a sampling Method
In crossValidationSelection, a class samplingMethods contains a function for each method.
Here we choose the standCV to generate a Cross-Validation by stand (or by group/polygon).
standCV allows you to SLOO (to keep just one stand/polygon for the validation), or leave False to use 50% of stands for training and 50% for validation.

In [2]:
samplingMethod = crossValidationSelection.samplingMethods.standCV(standField,SLOO=False,maxIter=3,seed=12)
crossValidation = crossValidationSelection.sampleSelection(inVector,levelField,samplingMethod)

Now the crossValidation is ready to compute. You have two choices : 
### Generate the Cross-Validation for Scikit-Learn

In [3]:
CV = crossValidation.getCrossValidationForScikitLearn()
# print each idx
for tr,vl in CV:
    print(tr)
    print(vl)

[ 0  1  7  4  5 14 10 11]
[ 2  3  8  6  9 15 16 12 13]
[ 1  3  7  5  9 15 12 13]
[ 0  2  8  4  6 14 16 10 11]
[ 1  2  8  5  6 15 10 11]
[ 0  3  7  4  9 14 16 12 13]


### Save the Cross-Validation in as many as files as training/validation iteration.

As Cross-Validation are generated on demand, you have to reinitialize the process and please make sure to have defined a seed to have exactly the same CV.

In [4]:
CV = crossValidation.saveVectorFiles('../data/cv.sqlite')
for tr,vl in CV:
    print(tr,vl)

../data/cv_train_0.sqlite ../data/cv_valid_0.sqlite
../data/cv_train_1.sqlite ../data/cv_valid_1.sqlite
../data/cv_train_2.sqlite ../data/cv_valid_2.sqlite
