Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Data & UI] I would like to expose all activity and influence index for all users and repos #297

Closed
1 task done
frank-zsy opened this issue Mar 10, 2022 · 12 comments · Fixed by #301
Closed
1 task done

Comments

@frank-zsy
Copy link
Contributor

Type of Issue

  • Feature Request

Introduction to the issue

I would like to expose all activity and influence index for all users and repos, we may need new diagrams to show the data in hypercrx like trending line charts.

@tyn1998
Copy link
Member

tyn1998 commented Mar 10, 2022

Maybe in Code Tab, we can present the latest data such like current activity & incluence. And if users seek for more detailed info such as trend or history of the indices, they can find in Perceptor Tab.

Just a little idea.

@frank-zsy
Copy link
Contributor Author

I could be done like this but I think maybe users want to see the trending right on the repo's home page, I am not sure which one is better. Maybe we can implement an edition and find some users to choose.

@frank-zsy
Copy link
Contributor Author

frank-zsy commented Mar 12, 2022

I have put the data online for anyone to use. As there are too many repos and users in the history, almost 40 million, I just put the users and repos with recent activity. So the amount is about 1 million users and 1 million repos. The data will update monthly to update the users and repos which have active events in last month.

The users info URL is like:
https://hypertrons-oss.x-lab.info/activity_influence/actors/f/frank-zsy.json , to avoid too many files in one directory, I add the subdirectory before the final file which uses the lower case of the first character of the login which is f for frank-zsy, and l for LinuxSuRen.

The repos info URL is like:
https://hypertrons-oss.x-lab.info/activity_influence/repos/hypertrons/hypertrons-crx.json , this one is as simple as it looks like, the path will be repos/${owner}/${repo}.json.

Right now only activity and influence data are provided for all users and repos from 2015-1 to 2021-12 group by month.

@tyn1998
Copy link
Member

tyn1998 commented Mar 12, 2022

Got it! Thank you.

@tyn1998
Copy link
Member

tyn1998 commented Mar 18, 2022

@frank-zsy, why 404 NOT FOUND "https://hypertrons-oss.x-lab.info/activity_influence/actors/x/xiaoya-Esther.json"? Is this data not computed?

@frank-zsy
Copy link
Contributor Author

@frank-zsy, why 404 NOT FOUND "https://hypertrons-oss.x-lab.info/activity_influence/actors/x/xiaoya-Esther.json"? Is this data not computed?

That maybe because she did not have any activity duration 2021.12, so her data is not included. Since I recalculate the influence data, I will export a new version with all active users and repos in last 6 months. And I am consider to remove the redundant prefix dir x and flatten all the files.

@tyn1998
Copy link
Member

tyn1998 commented Mar 23, 2022

Got it~

@frank-zsy
Copy link
Contributor Author

frank-zsy commented Mar 23, 2022

For performance consideration, I may reduce the data set to all users and repos which have at least 10 activity score in last 6 month which will remove lots of inactive users and repos. Does this make sense for you?

@tyn1998
Copy link
Member

tyn1998 commented Mar 23, 2022

It is not a problem because currently the Trend charts component just won't show if corresponding data is not found.

and something to confirm: the time span of the data is still 2015~2021, right?

@frank-zsy
Copy link
Contributor Author

The time span will be 2015 to last month now. For now it is 2022.2 and 2022.3 for next month.

@tyn1998
Copy link
Member

tyn1998 commented Mar 23, 2022

That would be nice!

@frank-zsy
Copy link
Contributor Author

The new data has been uploaded under a new domain: https://oss.x-lab.info/hypercrx_actor/tyn1998.json, the repo data is like: https://oss.x-lab.info/hypercrx_repo/hypertrons/hypertrons-crx.json .

As the data will be exposed to public, we will use oss.x-lab.info in the future, is that OK with you? @tyn1998

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging a pull request may close this issue.

2 participants