Skip to content
@ArchiveBox

ArchiveBox

The self-hosted internet archiving solution maintained by @pirate. #webarchiving #internetarchiving #digipres

Pinned

  1. ArchiveBox ArchiveBox Public

    🗃 Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more...

    Python 20k 1.1k

  2. archivebox-browser-extension archivebox-browser-extension Public

    Official ArchiveBox browser extension: automatically/manually preserve your browsing history using ArchiveBox.

    TypeScript 165 13

  3. archivebox-proxy archivebox-proxy Public

    Official ArchiveBox MITM proxy: saves URLs of all requests passing through to an ArchiveBox server for archival.

    Python 9

  4. internet-archiving-talk internet-archiving-talk Public

    Forked from pirate/internet-archiving-talk

    🎭 An introduction to the Internet Archiving ecosystem, tooling, and some of the ethical dilemmas that the community faces.

    JavaScript 12 1

  5. good-karma-kit good-karma-kit Public

    😇 A Docker Compose bundle to run on servers with spare CPU, RAM, disk, and bandwidth to help the world. Includes Tor, ArchiveWarrior, BOINC, and more...

    297 8

  6. pydantic-pkgr pydantic-pkgr Public

    A modern Python library for managing system dependencies with package managers like apt, brew, pip, npm, etc.

    Python 6

Repositories

Showing 10 of 16 repositories
  • pydantic-pkgr Public

    A modern Python library for managing system dependencies with package managers like apt, brew, pip, npm, etc.

    Python 6 MIT 0 0 0 Updated May 23, 2024
  • ArchiveBox Public

    🗃 Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more...

  • archivebox-spreadsheet-bot Public

    This is a bot that provides ArchiveBox integration with Google Sheets for new URL ingestion, archived URL management, and automated QA (optionally AI-powered).

    0 GPL-3.0 0 0 0 Updated May 21, 2024
  • pip-archivebox Public

    Official Python package for ArchiveBox, the self-hosted internet archiving solution.

    14 GPL-3.0 2 0 2 Updated May 21, 2024
  • debian-archivebox Public

    Home of the official apt/deb package for Ubuntu/Debian-based systems.

    Python 18 GPL-3.0 5 0 1 Updated May 20, 2024
  • good-karma-kit Public

    😇 A Docker Compose bundle to run on servers with spare CPU, RAM, disk, and bandwidth to help the world. Includes Tor, ArchiveWarrior, BOINC, and more...

    297 MIT 8 0 0 Updated May 11, 2024
  • docs Public

    Source for the Github Wiki / ReadTheDocs documentation for AchiveBox, the self-hosted internet archiving solution.

    CSS 12 3 0 4 Updated May 7, 2024
  • readability-extractor Public

    Javascript/Node wrapper around Mozilla's Readability library so that ArchiveBox can call it as a oneshot CLI command to extract each page's article text.

    JavaScript 33 13 0 0 Updated Apr 11, 2024
  • archivebox-browser-extension Public

    Official ArchiveBox browser extension: automatically/manually preserve your browsing history using ArchiveBox.

    TypeScript 165 MIT 13 15 0 Updated Apr 11, 2024
  • community Public

    A wiki of the broader Web Archiving Community: important organizations, alternative projects, blog posts, and more.

    4 0 0 0 Updated Feb 21, 2024