crawler

Just primitive web-crawler, which helps you to get all the "usable" urls from some pages.

Installation

WTF is installation? I'm noob, download it and compile!

Usage

Lol, run it in lein and open your found urls recorded in file("out__urls.txt"). And add your urls to file "urls.txt" for crawling, before you start.

$ cd MyWayToClojureProjects/crawler
$ lein run depth

Options

--depth

How many times crawler should process bunches of urls? That's it - a number of waves!

Examples

1)Add to file "urls.txt" url:

http://www.example.com

and save file. Use only full url, like in example above.

$ cd  MyWayToClojureProjects/crawler

$ lein run 3

Open file "out__urls"
????
PROFIT!!!!

Bugs

¯\(ツ)/¯

That You Think

Law doesn't work, Authorities are illegal, Society is rotting.

License

Distributed under the Eclipse Public License either version 1.0 or (at your option) any later version.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
doc		doc
src/crawler		src/crawler
target		target
test/crawler		test/crawler
.nrepl-port		.nrepl-port
LICENSE		LICENSE
README.md		README.md
out_urls.txt		out_urls.txt
project.clj		project.clj
urls.txt		urls.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

crawler

Installation

Usage

Options

Examples

Bugs

That You Think

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

crawler

Installation

Usage

Options

Examples

Bugs

That You Think

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages