Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Was any preprocessing/cleaning/labeling of the data done (e.g., discretization or bucketing, tokenization, part-of-speech tagging, SIFT feature extraction, removal of instances, processing of missing values)? If so, please provide a description. If not, you may skip the remainder of the questions in this section. #82

Closed
nocotan opened this issue Jul 8, 2021 · 2 comments · Fixed by #140
Assignees
Labels
datasheet documentation Improvements or additions to documentation

Comments

@nocotan
Copy link
Contributor

nocotan commented Jul 8, 2021

The following question should be answered:

Was any preprocessing/cleaning/labeling of the data done (e.g., discretization or bucketing, tokenization, part-of-speech tagging, SIFT feature extraction, removal of instances, processing of missing values)? If so, please provide a description. If not, you may skip the remainder of the questions in this section.

@nocotan nocotan added documentation Improvements or additions to documentation datasheet labels Jul 8, 2021
@wildsnowman wildsnowman self-assigned this Aug 22, 2021
@wildsnowman
Copy link
Collaborator

wildsnowman commented Aug 22, 2021

We extracted the CNN features from images and treated them as input data in our image-based tasks. As a result, our dataset contains the features but does not include raw photos, making them anonymized.
The CNN we used is an official pre-trained VGG16, and we adopted the outputs of the 'fc6' layer before applying ReLU as the feature.
We used the Chainer implementation for extracting CNN features. For more information on the Chainer implementation, please refer to the reference page:
https://docs.chainer.org/en/v7.8.0/reference/generated/chainer.links.VGG16Layers.html

@wildsnowman
Copy link
Collaborator

We exclude the outfits that contain less than four items. Other than that, we did not remove any instances in creating our dataset. However, we excluded some data in each independent task. In detail, please refer to each task description.

@wildsnowman wildsnowman reopened this Aug 22, 2021
@wildsnowman wildsnowman linked a pull request Aug 24, 2021 that will close this issue
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
datasheet documentation Improvements or additions to documentation
Projects
None yet
Development

Successfully merging a pull request may close this issue.

2 participants