Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Dataset checks I #413

Merged
merged 13 commits into from
Feb 3, 2020
Merged

Dataset checks I #413

merged 13 commits into from
Feb 3, 2020

Conversation

tylerganter
Copy link
Contributor

@tylerganter tylerganter commented Jan 28, 2020

Adding functionality to validate labels and datasets for common issues that can be automated away. This functionality includes:

  • ensuring the data filename is stored in the labels file
  • fixing underscores versus spaces and conforming capitalization to a provided schema (syntax checker)

Start with dataset level functions, which have been added to eta/core/datasets/standardize.py

Then refer to eta/core/image.py and eta/core/video.py for support at the labels level (only for syntax checker)

@tylerganter tylerganter added the feature Work on a feature request label Jan 28, 2020
@tylerganter tylerganter requested a review from a team January 28, 2020 20:21
@tylerganter tylerganter self-assigned this Jan 28, 2020
@tylerganter tylerganter marked this pull request as ready for review January 29, 2020 17:40
@tylerganter tylerganter changed the title WIP: Dataset checks Dataset checks I Jan 29, 2020
@tylerganter tylerganter changed the base branch from develop to dataset-cleaning January 31, 2020 19:05
@tylerganter tylerganter merged commit aac57a3 into dataset-cleaning Feb 3, 2020
@tylerganter tylerganter deleted the dataset-checks branch February 3, 2020 18:54
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
feature Work on a feature request
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

1 participant