Skip to content
This repository has been archived by the owner on Oct 2, 2020. It is now read-only.

nikoksr/web-monkey

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

89 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Web Monkey

Small web-crawler created with the imagination of monkeys jumping from tree to tree.

Idea

  • Url Monkey

    A monkey jumps on a tree which represents an url. On his way climbing up the tree the monkey discovers many branches which visualize sub-urls of the main-url.

    Example

    All of them are being noted on a list of known trees, but only if they aren't already known and noted at this point in time. On his way to the treetop the monkey eventually discovers branches which don't belong to the tree he's climbing, so he notes these aswell and he keeps on climbing towards the top. Reaching the top the monkey has discovered every single branch surrounding his tree. He decides to take a look on his list of noted trees, chooses the first tree on the list following his current tree and jumps over to it.

    He repeats this process over and over again.

    web-monkey example
  • Plot Monkey

    A monkey that takes a database, created by an UrlMonkey, and creates plots based on the data. He tries to visualize the connections and distribution of urls found under a given url.

    Crawl URLs til Ctrl-C and save the results afterwards:

    web-monkey save

    Create the plot:

    plot-monkey example

    The resulting plot:

    plot-monkey result
  • Pic Monkey

    A monkey jumps on a tree and starts discovering its branches where he finds leafs. Leafs represent sub-objects of urls; pictures in this case. The monkey notes every leaf on his list and after discovering every leaf the tree has to offer, the monkey returns the list to the user. The user now has a list of all pictures found under the given url. The user can give the monkey a search-tag to specify his search for leafs. The monkey is only going to return leafs which match the wanted search-tag.

    You can search pictures by tag. This searches the given tag in the src and alt attributes of all images.

    pic-monkey example 1 pic-monkey example 2

    Or you can search without a tag which will give you a list of all images on that page.

    pic-monkey example 3

About

Small webcrawler created with the imagination of monkeys jumping from tree to tree.

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages