Skip to content

H-N41K/Tarantula

Repository files navigation

Tarantula

Tarantula-spider
Tarantula-text

An elegant search engine


Our System

  • We have built our custom web crawler which will crawl the WWW recursively in a DFS manner.
  • The extracted data is stored in the database and then pre-processed and indexed.
  • The stored pages are assigned scores using metrics like location score, frequency score, inbound link score, distance score, & PageRank score.

Features

  • Web Search
  • Image Search
  • Torrent Search
  • Quick Search (I'm feeling lucky)
  • Voice Input

Research Paper

http://ijics.com/gallery/25-may-1190.pdf


Presentation (PDF)

Tarantula_Presentation.pdf


Demo

https://www.youtube.com/watch?v=sjpkbDCQqLE


Future Scope

  • A provision for semantic search.
  • Empowering intelligence to the crawler.

Contributing

We are open to enhancements & bug-fixes 😄


Contributors