Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How to extract overlays from images for the U-net mass detector? #22

Open
guyucowboy opened this issue Aug 5, 2017 · 8 comments
Open

Comments

@guyucowboy
Copy link

Hi, julian,
How do you extract overlays from images for the U-net mass detector?
The overlays in "resources\segmenter_traindata\ *_o.png". How to generate those files?
Thank you!

@juliandewit
Copy link
Owner

Hello,
I labeled them by hand.
They are in the resources file.
You don't need this part of the solution to get a good score.
It improves everything just a little.

@guyucowboy
Copy link
Author

Thanks for your reply!

@guyucowboy
Copy link
Author

@juliandewit
Hi,julian.
How do you know or discover the "mass"feature (or other feature ) of nodule detection which improves everything just a little?
How to get this conclusion?
Thanks!

@guyucowboy guyucowboy reopened this Aug 11, 2017
@guyucowboy
Copy link
Author

hi,julian.
Another simple question: when you submit the final_submission.csv to kaggle competition, do you keep as it is? Or change the "cancer" value to 1 if it is bigger than 0.5 and vice versa ?
Thanks!

@juliandewit
Copy link
Owner

Keep it as it is.
You can still submit to Kaggle and see the results.

@juliandewit
Copy link
Owner

I did local cross validation + cross validation against leaderboard.
BOTH needed to show improvements.
I tried around 50 features. Almost none gave consistent improvements.
Mainly due to the outlier-leaderboard but we did not know that at that time.

@guyucowboy
Copy link
Author

Hi, julian. Thank you for your reply!
I am newcomer to Kaggle. I find the public and private leaderboard, what is the outlier-leaderboard?
Thanks again.

@guyucowboy
Copy link
Author

Hi, julian. what are the 50 features? Are they the bottleneck features of the CNN network? Or some of the features are irrelevant with CNN network and are just calculated by some formulas ? Do you use some feature selection algorithms ? Maybe the combination of some features indicated by feature selection algorithm would improve the result even if each feature make a little effect. But I am not sure about that.
Thanks.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants