Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Some puzzles about dataset processing #11

Open
yvfengZhong opened this issue Sep 14, 2021 · 5 comments
Open

Some puzzles about dataset processing #11

yvfengZhong opened this issue Sep 14, 2021 · 5 comments

Comments

@yvfengZhong
Copy link

I once encountered a problem when preprocessing the scannetv2 dataset. I tried to solve this problem, but I'm not sure whether my solution is reasonable. I'd like to discuss it with you.

When I execute the command python batch_ load_ scannet_ data.py, an error occurred.

p1

I read the file batch_load_scannet_data.py and found that the function of the file is to select the corresponding folder in the directory data/scannet/scans/ for data processing according to the directory name in the file data/scannet/meta_data/scannetv2.txt and save the generated results in the directory data/scannet/scannet_data/.

p2

I don't know if my understanding is correct.

Then, I read the file data/scannet/meta_data/scannetv2.txt and found that it contains 806 scenes. Directory data/scannet/scans/ contains only 706 scenes for train and val. I think the problem is that there is a mismatch between the two.

So I copied all the files in directory data/scannet/scans_test/ to directory data/scannet/scans/. At this point, executing the command python batch_load_scannet_data.py can work normally.

I want to know, am I right in this way? Looking forward to your reply.

@yvfengZhong
Copy link
Author

I have noticed that your batch_load_scannet_data.py is modified from votenet, so I have checked votenet and found that it did use scannet_train.txt.

Uploading 屏幕快照 2021-09-20 下午11.03.54.png…

I have looked at many other codes that also use scannet_train.txt in batch_load_scannet_data.py. I think you may have made a mistake for some reason.

In addition, I found that your OBJ_CLASS_IDS is different from others. Why?

@ga92xug
Copy link

ga92xug commented May 27, 2022

Had this issue as well. It can be ignored for now. The behavior might change later. OBJ_CLASS_IDS is intentionally different.

@Coobiw
Copy link

Coobiw commented Dec 7, 2022

Actually, you can put all the scans including both train-val and test to the data-dir because the code will distinguish them automatically. If you process test scans, the code will just sampling 50k points and not process the label info because of the lack of manual annotations.
As for the category of classes, this dataset just take part of the ScanNetv2 classes into consideration.

@cactusycy
Copy link

So how should I deal with this issue? I'm right now so puzzled with this problem

@yvfengZhong
Copy link
Author

You can replace the "scannetv2.txt" with "scannetv2_trainval.txt".

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants