Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

GitHub Backup #62

Open
plehegar opened this issue Jul 11, 2017 · 7 comments
Open

GitHub Backup #62

plehegar opened this issue Jul 11, 2017 · 7 comments
Assignees

Comments

@plehegar
Copy link
Member

We do backup repos and their data. Not sure how well it is documented however. Needs some facts finding.

@tripu
Copy link
Member

tripu commented Jul 11, 2017

I can point you in the way of our current approach to GH backups.

(BTW, we have this public project, gh-backup; almost two years without maintenance.
Either we take it up, or we discontinue it — and make that clear on the project page.)

@dontcallmedom
Copy link
Member

Having looked at the backups, they are indeed incomplete when it comes to non-git data:

  • we only have the latest 30 issues per repo (corresponding to the default pagination of the github API)
  • we only capture data about issues, nothing about the comments made on the issues (where most of the value actually resides)

It would probably be useful to take a thorough review of the tool we use for the backup, and in particular compare it with the many data that github exposes via its API:
https://developer.github.com/v3/

@plehegar
Copy link
Member Author

I wonder, don't we have the issues/comments through pheme?

@dontcallmedom
Copy link
Member

we have a subset of the data on issue and comments via pheme; but pheme is not a backup system and is not built for long-term reliability.

@xfq
Copy link
Member

xfq commented May 26, 2018

The User Migration API announced recently seems helpful.

@r12a
Copy link

r12a commented Jun 7, 2018

Could the systems team please clarify for the staff the current situation wrt github backups, and say something about plans for ongoing improvements. I thought i heard last week that we have backups of most of the data, but not in an accessible form. It sounds like we should be working on that as a pretty high priority, given that much of our institutional knowledge is affected.

@vivienlacourba
Copy link
Member

Hi @r12a, W3C Systeam already has backups of our GitHub data in place. We will communicate on this soon.

@vivienlacourba vivienlacourba self-assigned this Jun 7, 2018
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

6 participants