Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Import 2017 gsoc org data #59

Open
jayvdb opened this issue Dec 18, 2017 · 3 comments
Open

Import 2017 gsoc org data #59

jayvdb opened this issue Dec 18, 2017 · 3 comments

Comments

@jayvdb
Copy link
Member

jayvdb commented Dec 18, 2017

In 2017, coala was gsoc org 5817061024464896.

As this repository is generic, the tool must find the project identifier using only the org_name which is exposed in the community app.

Scrape the list of org projects from that page , e.g. 5154725527814144, pull in relevant data, such as student and mentor display name.

The scraper must be part of the django system so that any GSOC org can load their 2017 data using the scraper.

@jayvdb
Copy link
Member Author

jayvdb commented Dec 18, 2017

The extracted data should be cached in the generated website, like the GCI data, with correct timestamps, and only regenerated if the data is missing.

The fact that no data was found also needs to be cached, to avoid running the scraper in every build on non-gsoc orgs.

@jayvdb
Copy link
Member Author

jayvdb commented Dec 18, 2017

See also https://gitlab.com/coala/GSoC-2017/

@pradeepgangwar
Copy link

@jayvdb Please assign this to me.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Development

Successfully merging a pull request may close this issue.

3 participants