Skip to content
@ArchiveBox

ArchiveBox

The self-hosted internet archiving solution maintained by @pirate. #webarchiving #internetarchiving #digipres

Pinned Loading

  1. ArchiveBox ArchiveBox Public

    🗃 Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more...

    Python 22k 1.2k

  2. abx-dl abx-dl Public

    ⬇️ A CLI tool to download all discovered content from a URL (like youtube-dl/yt-dlp, forum-dl, gallery-dl). 🎭 Uses headless Chrome to get HTML, JS, CSS, images/video/audio/subtitles, PDFs, screensh…

    Python 15 1

  3. archivebox-browser-extension archivebox-browser-extension Public

    Official ArchiveBox browser extension: automatically/manually preserve your browsing history using ArchiveBox.

    TypeScript 242 21

  4. pydantic-pkgr pydantic-pkgr Public

    📦 Modern Python library for managing system dependencies with package managers like apt, brew, pip, npm, etc.

    Python 13

  5. internet-archiving-talk internet-archiving-talk Public

    Forked from pirate/internet-archiving-talk

    🎭 An introduction to the Internet Archiving ecosystem, tooling, and some of the ethical dilemmas that the community faces.

    JavaScript 14 1

  6. good-karma-kit good-karma-kit Public

    😇 A Docker Compose bundle to run on servers with spare CPU, RAM, disk, and bandwidth to help the world. Includes Tor, ArchiveWarrior, BOINC, and more...

    317 8

Repositories

Showing 10 of 17 repositories
  • ArchiveBox Public

    🗃 Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more...

    ArchiveBox/ArchiveBox’s past year of commit activity
  • pydantic-pkgr Public

    📦 Modern Python library for managing system dependencies with package managers like apt, brew, pip, npm, etc.

    ArchiveBox/pydantic-pkgr’s past year of commit activity
    Python 13 MIT 0 0 0 Updated Oct 29, 2024
  • abx-dl Public

    ⬇️ A CLI tool to download all discovered content from a URL (like youtube-dl/yt-dlp, forum-dl, gallery-dl). 🎭 Uses headless Chrome to get HTML, JS, CSS, images/video/audio/subtitles, PDFs, screenshots, article text, git srcs, and more...

    ArchiveBox/abx-dl’s past year of commit activity
    Python 15 MIT 1 0 0 Updated Oct 21, 2024
  • docker-archivebox Public

    Home of the official docker image for ArchiveBox

    ArchiveBox/docker-archivebox’s past year of commit activity
    47 GPL-3.0 12 1 1 Updated Oct 16, 2024
  • pip-archivebox Public archive

    Official Python package for ArchiveBox, the self-hosted internet archiving solution.

    ArchiveBox/pip-archivebox’s past year of commit activity
    13 GPL-3.0 2 0 7 Updated Oct 5, 2024
  • homebrew-archivebox Public archive

    Homebrew formula for the ArchiveBox self-hosted internet archiving solution.

    ArchiveBox/homebrew-archivebox’s past year of commit activity
    Ruby 26 GPL-3.0 3 0 0 Updated Oct 5, 2024
  • debian-archivebox Public archive

    Home of the official apt/deb package for Ubuntu/Debian-based systems.

    ArchiveBox/debian-archivebox’s past year of commit activity
    Python 17 GPL-3.0 5 0 1 Updated Oct 5, 2024
  • docs Public

    Source for the Github Wiki / ReadTheDocs documentation for AchiveBox, the self-hosted internet archiving solution.

    ArchiveBox/docs’s past year of commit activity
    CSS 14 4 0 1 Updated Oct 5, 2024
  • readability-extractor Public

    Javascript/Node wrapper around Mozilla's Readability library so that ArchiveBox can call it as a oneshot CLI command to extract each page's article text.

    ArchiveBox/readability-extractor’s past year of commit activity
    JavaScript 36 13 0 2 Updated Sep 16, 2024
  • archivebox-browser-extension Public

    Official ArchiveBox browser extension: automatically/manually preserve your browsing history using ArchiveBox.

    ArchiveBox/archivebox-browser-extension’s past year of commit activity
    TypeScript 242 MIT 21 19 0 Updated Jul 12, 2024

Sponsors

  • @jgillman

Most used topics

Loading…