Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

commit.labels and refactoring inconsistencies #29

Open
shehan opened this issue Jan 23, 2022 · 1 comment
Open

commit.labels and refactoring inconsistencies #29

shehan opened this issue Jan 23, 2022 · 1 comment

Comments

@shehan
Copy link

shehan commented Jan 23, 2022

In the commit collection there are some records that do not have a refactoring_codebased label even though there is a record in the refactoring collection for that specific commit.

Example:
Query the commit and refactoring collections for the Commit.id = "5e84c0b6770ce37ee9d708b3"
There are 21 records in the refactoring collection
However, in the commit collection, the labels field is empty. Shouldn't the labels field be populated?

A few more commit id's exhibiting this phenomenon:
5e84c208770ce37ee9d771dd
5e84c0f9770ce37ee9d71a89

I am using Smartshark version 2.1 (full db)

@atrautsch
Copy link
Contributor

You are right, the labels are missing and should be there. There was an error with labelSHARK which writes the labels which I missed in the collection phase. Project maven is the only one that is impacted by this.

If you like, you can import the labels separately.

wget http://141.5.100.155/commit_labels_maven.json.gz
gunzip commit_labels_maven.json.gz
mongoimport --db=smartshark_2_1 --collection=commit --mode=merge --upsertFields=_id commit_labels_maven.json

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants