Substack Archiver

Substack Archiver is a simple Python program that allows users to archive the contents of their Substack newsletter. The program saves all the contents in a separated HTML file and also creates multiple JSON dumps. It is important to note that the HTML is raw, without any CSS or JS. It just provides a basic structure for reading the content. However you can use the Firefox Reader Viewer or Chromes Reader Mode, to get a pretty view of the webcontent. Note that direct video or audio files from Substack are not supported yet.

Getting Started

To use Substack Archiver, you will need to fill out the "config.env" file with your Substack login information. Once you have done this, you will need to install the requirements by running the following command:

pip install -r requirements.txt

To run the Substack Archiver, simply enter the following command in your terminal:

python substack_archiver.py "substack_name" "url_of_substack"

TODO

Add support for audio files
Add support for video files

Output

The program will save each Substack post in a separated HTML file and also create multiple JSON dumps. The HTML files will contain all the articles, images, and other content from your Substack newsletter. The JSON dumps will contain metadata such as the title, date, author, and tags for each article.

License

This project is licensed under the MIT License - see the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
README/images		README/images
.gitignore		.gitignore
LICENSE.md		LICENSE.md
README.md		README.md
colors.py		colors.py
config.env		config.env
requirements.txt		requirements.txt
substack_archiver.py		substack_archiver.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README/images

README/images

.gitignore

.gitignore

LICENSE.md

LICENSE.md

README.md

README.md

colors.py

colors.py

config.env

config.env

requirements.txt

requirements.txt