Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

About training and test data split #14

Open
PSYZ1234 opened this issue Jul 3, 2020 · 2 comments
Open

About training and test data split #14

PSYZ1234 opened this issue Jul 3, 2020 · 2 comments

Comments

@PSYZ1234
Copy link

PSYZ1234 commented Jul 3, 2020

Hi, thanks for your code!
I have a question about training and test data split in generate_training_tuples_baseline.py. Line 30, you set p1 = [5735712.768124,620084.402381], p2 = [5735611.299219,620540.270327], p3 = [5735237.358209,620543.094379], and p4 = [5734749.303802,619932.693364]. I think they are the mean value of "northing" and "easting".
I just want to know how to get the four different mean values. And why you choose four mean values not five or more.
Thanks for your help in advance!

@yopi1838
Copy link

yopi1838 commented Apr 8, 2021

Pardon for my lack of understanding but I think that's just her way of splitting the datasets. you can see in the definition of check_in_test function. If the specified northing and easting from the datasets are within the defined points (+- x_width and y_width), the point cloud is included into the test dataset, else it will be added into the train dataset. I think you can add more points if you wanted to filter more of the raw dataset into the test dataset.

@yopi1838
Copy link

yopi1838 commented Apr 8, 2021

Pardon for my lack of understanding but I think that's just her way of splitting the datasets. you can see in the definition of check_in_test function. If the specified northing and easting from the datasets are within the defined points (+- x_width and y_width), the point cloud is included into the test dataset, else it will be added into the train dataset. I think you can add more points if you wanted to filter more of the raw dataset into the test dataset.

image
This was further shown in her paper where she stated that the regions were randomly selected.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants