Join GitHub today
GitHub is home to over 50 million developers working together to host and review code, manage projects, and build software together.
Sign upcreateDataPartition creates approximate splits #284
Comments
Yes, if turn off the default stratified sampling by setting the number of |
|
Thank you Vadim. |
|
I, too, was just bitten by this. I understand the rationale for the splitting procedure to respect the structure in the outcome variable, but it's counterintuitive to a new user that |
Hi,
createDataPartition creates correct splits for 100%, 80% and 0% but approximate (inaccurate) splits for 70% 50% and 10%. I did not test all the numbers with apply, but I am sure for 70% it should return 70 instead of 72. Unless that is a feature and not a bug.
creates
Is there a way to create accurate splits?
Cheers
Tobias