The correspondence between the training dataset and the testing dataset #10

tangqideng · 2024-01-04T15:13:01Z

Should we train a model on tran1.csv and test it on test1.csv, and then train another model on train2.csv and test it on test2.csv?
This is because tran1.csv and test.csv may follow the same distribution, and train2.csv and test2.csv may follow another different distribution.
As concept drift is common in time series data, and if train1.csv and test2.csv do not have the same distribution, the model's performance will significantly deteriorate during testing.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

The correspondence between the training dataset and the testing dataset #10

The correspondence between the training dataset and the testing dataset #10

tangqideng commented Jan 4, 2024 •

edited

Loading

The correspondence between the training dataset and the testing dataset #10

The correspondence between the training dataset and the testing dataset #10

Comments

tangqideng commented Jan 4, 2024 • edited Loading

tangqideng commented Jan 4, 2024 •

edited

Loading