Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Write a tool to export data from bpo #4

Closed
12 tasks done
ezio-melotti opened this issue Nov 11, 2020 · 1 comment
Closed
12 tasks done

Write a tool to export data from bpo #4

ezio-melotti opened this issue Nov 11, 2020 · 1 comment
Assignees

Comments

@ezio-melotti
Copy link
Member

ezio-melotti commented Nov 11, 2020

In order to import data into GitHub we need to export bpo data in a format compatible with the importer tool.

There are at least 5 ways to do this:

  1. Using the Roundup Python API to directly access the db (see below);
  2. Using roundup-admin to export the data and then parsing the output;
  3. Using the REST API;
  4. Using the XMLRPC interface;
  5. Accessing the PostreSQL DB directly.

The first option is likely the easiest solution. The script that generates the weekly "Summary of Python tracker issue" does something similar to access the database and extract data about the issues. The Roundup documentation has a table that summarizes the available functions.

By using one of these solutions, we can write a tool that extracts the data from bpo and rearranges them in the right format. The tool will also need reformat the issues (see #3), rearrange the labels, and possibly make other changes. The first version of the tool doesn't need to include these changes -- they can be added once we solved the other issues.

We should also take care of exporting attachments such as patches, sample scripts, screenshots, etc..

Update (2021-09-16)
I'm writing a tool using the first option above:

  • write a tool to extract data from Roundup
  • write a tool to download a sample repo
  • export data in a compatible format:
  • automate archive creation
    • delete test repo
    • create archive
@ezio-melotti ezio-melotti self-assigned this Nov 11, 2020
@ezio-melotti ezio-melotti changed the title Export data from bpo Write a tool to export data from bpo Nov 13, 2020
@ezio-melotti ezio-melotti added this to To do in GitHub migration via automation Nov 13, 2020
@ezio-melotti ezio-melotti moved this from To do to In progress in GitHub migration Apr 6, 2021
@ezio-melotti
Copy link
Member Author

This is now done: all relevant items are exported. For files/attachments, we decided to keep hosting them on bpo, and simply add a direct link to them.

Some issue fields still need some tweaking (see #5). Some messages still need link/ref rewriting (see #3).

GitHub migration automation moved this from In progress to Done Mar 12, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
Development

No branches or pull requests

1 participant