Cain

A naively simple personal web resource archive system.

Introduction

Cain is a extremely simple tool to let you download and organize online resources (mainly web pages) for offline view. It takes a URL and downloads the resources referenced by it, and place them in a local directory. This forms one record. These records are organized in a category tree, which is just a directory structure under some pre-defined root directory (specified in the config file).

As the web evolved, for better or for worse, archiving a web page is not as simple as running “wget” on its URL. Some “modern” web pages are actually empty, with its content filled at the client side by some JavaScript code. Web developers do this usually because this makes the website work like an app, where the frontend is not nessesarily HTML, and it relies on some API to communicate with the server. This is really bad for archiving, because we cannot just download the HTML itself in this case, which again is just empty. We need to access the content via the API, which is usually private or behind authentication.

Twitter and (the new) Reddit are two of the most infamous ones. As of now Cain does support Twitter, but not others.

For normal web pages, Cain uses Monolith to archive it.

Installation

Cargo.

Monolith is needed to archive normal web pages.

Usage

Create a config file at ~/.config/cain/config.toml with content:

root_dir = "/some/path"

As of now root_dir is the only config option.

Run

cain record -c "category/subcategory" "Some Title" https://google.com/

to archive the Google main page under category/subcategory.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
src		src
COPYING.txt		COPYING.txt
Cargo.toml		Cargo.toml
README.adoc		README.adoc
design.adoc		design.adoc

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

src

src

COPYING.txt

COPYING.txt

Cargo.toml

Cargo.toml

README.adoc

README.adoc

design.adoc

design.adoc

Repository files navigation

Cain

Introduction

Installation

Usage

About

Releases

Packages

Languages

License

MetroWind/cain

Folders and files

Latest commit

History

Repository files navigation

Cain

Introduction

Installation

Usage

About

Resources

License

Stars

Watchers

Forks

Languages