This is a project i made for importing files from confluence to outline.
I wanted to migrate from Atlassian Confluence to Outline without paying for the confluence importer. In confluence, you can export entire spaces to HTML files, and you can import HTML files in outline. The confluence export dumps every file in one folder, but includes a sitemap with the tree structure in HTML. I made this program to parse the sitemap to a tree structure and import files in the correct order and structure.
- Parse an HTML sitemap to extract document structure.
- Clean up HTML files by removing unnecessary elements.
- Import documents into Outline collections using the API.
- Progress bar and detailed logging for import operations.
- CLI interface for parsing, importing, and making API requests.
- Run the CLI and choose an operation (parse sitemap, import collection, API requests).
- Provide the path to your HTML sitemap or collection folder.
- The tool parses the sitemap, cleans up each HTML file, and imports them into Outline, preserving hierarchy.