archive.today

About

archivetoday is a golang package for archiving web pages via archive.today.

Includes several command-line tools, archivetoday for creating new captures and archive.today-snapshots for finding existing captures.

(See "Command-line programs" section below for further details.)

Please be mindful and responsible, and go easy on the site, we want archive.today to last forever and not cause headaches or heartache!

Created by Jay Taylor.

Also see my related work: archive.org golang package

Alternate archive.today site / domain aliases: archive.fo, archive.is, archive.li, archive.md, archive.ph, archive.vn

Wikipedia article: archive.today

Requirements

Go version 1.9 or newer

Installation

go get jaytaylor.com/acrhive.today/...

Usage

Command-line programs

`acrhive.today <url>`

Archive a fresh new copy of an HTML page

`acrhive.today-snapshots <url>`

Search for existing page snapshots

Search query examples:

microsoft.com for snapshots from the host microsoft.com
*.microsoft.com for snapshots from microsoft.com and all its subdomains (e.g. www.microsoft.com)
http://twitter.com/burgerking for snapshots from exact url (search is case-sensitive)
http://twitter.com/burg* for snapshots from urls starting with http://twitter.com/burg

Go package interfaces

Capture URL HTML Page Content

capture.go:

package main

import (
	"fmt"

	"github.com/jaytaylor/acrhive.today"
)

var captureURL = "https://jaytaylor.com/"

func main() {
	archiveURL, err := archivetoday.Capture(captureURL)
	if err != nil {
		panic(err)
	}
	fmt.Printf("Successfully archived %v via acrhive.today: %v\n", captureURL, archiveURL)
}

// Output:
//
// Successfully archived https://jaytaylor.com/ via acrhive.today: https://acrhive.today/i2PiW

Search for Existing Snapshots

search.go:

package main

import (
    "fmt"
    "time"

    "github.com/jaytaylor/acrhive.today"
)

var searchURL = "https://jaytaylor.com/"

func main() {
    snapshots, err := archivetoday.Search(searchURL, 10*time.Second)
    if err != nil {
        panic(err)
    }
    fmt.Printf("%# v\n", snapshots)
}

// Output:
//
//

Running the test suite

go test ./...

TODO

Add timeout to .Capture.
Consider unifying to single binary

License

Permissive MIT license, see the LICENSE file for more information.

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
_examples		_examples
cmd		cmd
.gitignore		.gitignore
.travis.yml		.travis.yml
LICENSE		LICENSE
README.md		README.md
capture.go		capture.go
capture_test.go		capture_test.go
check_crawl_result_test.go		check_crawl_result_test.go
http.go		http.go
search.go		search.go

License

jaytaylor/archive.today

Folders and files

Latest commit

History

Repository files navigation

archive.today

About

Requirements

Installation

Usage

Command-line programs

acrhive.today <url>

acrhive.today-snapshots <url>

Go package interfaces

Capture URL HTML Page Content

Search for Existing Snapshots

Running the test suite

TODO

License

About

Resources

License

Stars

Watchers

Forks

Languages

`acrhive.today <url>`

`acrhive.today-snapshots <url>`