Aranea

A general purpose web crawler.

Usage: php index.php -u [url] [options]...

Arguments:
  -d
  --debug
      Print debug messages.

  -h
  --help
      Print this help message.

  --ignore-nofollow
      Ignore robots.txt and rel="nofollow" on links

  -H
  --span-hosts
      Enable spanning across hosts when doing recursive retrieving.

  -l <depth>
  --level <depth>
      Specify maximum recursion depth level.

	--max-redirects <number>
      Follow no more than <number> redirects per page.

	--max-urls <number>
      Terminate after having found <number> URLs.

  -o <directory>
  --output-directory <directory>
      Log retrieved data to files in a directory.

  -q
  --quiet
      Turn off regular output.

  -r
  --recursive
      Turn on recursive retrieving. The default maximum depth is 5.

  -T <seconds>
  --timeout <seconds>
      Set the network timeout to <seconds> seconds.

  --connect-timeout <seconds>
      Set the connect timeout to <seconds> seconds.

  -u <url>
  --url <url>
      Retrieve a URL.

  -v
  --verbose
      Turn on verbose output.

  -w <seconds>
  --wait <seconds>
      Wait the specified number of seconds between the retrievals.

Example output:

$ php index.php -u https://mozilla.org -r
200 63.245.215.20 https://www.mozilla.org/en-US/
200 63.245.215.20 https://www.mozilla.org/en-US/mission/
200 63.245.215.20 https://www.mozilla.org/en-US/about/
200 63.245.215.20 https://www.mozilla.org/en-US/products/
200 63.245.215.20 https://www.mozilla.org/en-US/contribute/
...

Using Docker

$ docker run --rm aliasio/aranea -u https://mozilla.org -r

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
src/Aranea		src/Aranea
.Dockerignore		.Dockerignore
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
composer.json		composer.json
index.php		index.php

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Aranea

About

Releases

Packages

Languages

License

AliasIO/Aranea

Folders and files

Latest commit

History

Repository files navigation

Aranea

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages