Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Investigate Validation Set #7

Closed
JoshVarty opened this issue Apr 10, 2019 · 4 comments
Closed

Investigate Validation Set #7

JoshVarty opened this issue Apr 10, 2019 · 4 comments

Comments

@JoshVarty
Copy link
Owner

Currently our train set overperforms the test set. Why is this? How can we improve?

@JoshVarty
Copy link
Owner Author

Train

  • 4,970 clips from Freesound
  • 19,815 clips from Flickr

Test

  • 1,120 from Freesound

This suggests to me that we should:

  1. Choose validation examples from Freesound
  2. Ensure the classes are balanced? What does the balance look like anyway?

@JoshVarty
Copy link
Owner Author

So we've created it, now we need to test the validation scores against the test scores. Are they close? Does increasing validation score also increase test score?

@JoshVarty
Copy link
Owner Author

Taken from #25

Curated Training Set

image

Noisy Training Set

image

Almost all clips are 15 seconds long.

Test Set

image

So it looks to me like the test set is taken from roughly the same distribution as the curated training set.

@JoshVarty
Copy link
Owner Author

I'm feeling pretty good about the results of our validation lwlwrap compared to our test lwlwrap. Whenever we see an increase in validation score, we get an increase in LB score. See #26

The gap between our validation score and test score may be due to:

  1. The test clips are slightly longer. (Fewer clips are less than 2 seconds long)
  2. The leaderboard score may not be calculated across all results, perhaps to make it harder to game?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant