Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Datumaro] Mean and std for dataset #1734

Merged
merged 4 commits into from
Jun 22, 2020
Merged

[Datumaro] Mean and std for dataset #1734

merged 4 commits into from
Jun 22, 2020

Conversation

zhiltsov-max
Copy link
Contributor

Motivation and context

When learning computer vision models, image mean and std computed channelwise are often used for nomalization to improve model quality. This PR adds a CLI command to compute them for a dataset.

datum import <path>
datum stats

How has this been tested?

Unit test

Checklist

License

  • I submit my code changes under the same MIT License that covers the project.
    Feel free to contact the maintainers if that's a concern.
  • I have updated the license header for each file (see an example below)
# Copyright (C) 2020 Intel Corporation
#
# SPDX-License-Identifier: MIT

@coveralls
Copy link

coveralls commented Jun 17, 2020

Pull Request Test Coverage Report for Build 5871

  • 35 of 56 (62.5%) changed or added relevant lines in 2 files are covered.
  • 6 unchanged lines in 2 files lost coverage.
  • Overall coverage decreased (-0.05%) to 65.769%

Changes Missing Coverage Covered Lines Changed/Added Lines %
datumaro/datumaro/components/operations.py 32 34 94.12%
datumaro/datumaro/cli/contexts/project/init.py 3 22 13.64%
Files with Coverage Reduction New Missed Lines %
datumaro/datumaro/components/extractor.py 1 91.01%
cvat/apps/engine/media_extractors.py 5 75.88%
Totals Coverage Status
Change from base Build 5855: -0.05%
Covered Lines: 10945
Relevant Lines: 16237

💛 - Coveralls

@nmanovic nmanovic self-requested a review as a code owner June 22, 2020 13:51
Copy link
Contributor

@nmanovic nmanovic left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@nmanovic nmanovic added this to Reviewer approved in Dataset framework (Datumaro) via automation Jun 22, 2020
@nmanovic nmanovic merged commit 12f7855 into develop Jun 22, 2020
Dataset framework (Datumaro) automation moved this from Reviewer approved to Done Jun 22, 2020
@nmanovic nmanovic deleted the zm/dm-statistics branch June 22, 2020 13:52
frndmg pushed a commit to signatrix/cvat that referenced this pull request Aug 5, 2020
* Add meanstd

* Add stats cli

* Update changelog

Co-authored-by: Nikita Manovich <40690625+nmanovic@users.noreply.github.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Development

Successfully merging this pull request may close these issues.

None yet

3 participants