Keenbo

A simple to use search engine for search data.

Project consist of different modules:

crawler: crawls the pages and save in database
search-engine: search API on stored data
forward-extractor: run a mapreduce for calculate incomming link for a website to have better results

Getting Started

For run this project on your own local machine or server you should install Zookeeper and hbase and hadoop and elasticsearch and kafka.

Prerequisites

For installing dependencies for this project, read wikis and configure it properly depend on your servers.

Running

For running application, you can use .sh files inside bin folder.

Running the tests

Test all projects with below command:

mvn test

Built With

Spark - Used to run mapreduces
Kafka - Used to handling links queue
ElasticSearch - Used to run search queries
Redis - Used to check duplicated pages
HBase - Used to store data
DropWizard - Used to monitoring
JSoup - Used to parse the pages
Caffeine - Used to store requested urls to send request politely
Jackson - Used to serializing objects
Maven - Used to Dependency Management

Authors

Amin Borjian - github
Danial Erfanian - github
Ehsan Karimi - github
MohammadReza Pakzadian - github

See also the list of contributors who participated in this project.

Name		Name	Last commit message	Last commit date
Latest commit History 825 Commits
.travis		.travis
backward-extractor		backward-extractor
classifier		classifier
common		common
crawler		crawler
hbase-count		hbase-count
page-collector		page-collector
pagerank		pagerank
search		search
shuffler		shuffler
site-graph		site-graph
word-graph		word-graph
.gitignore		.gitignore
.travis.yml		.travis.yml
README.md		README.md
pom.xml		pom.xml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Keenbo

Getting Started

Prerequisites

Running

Running the tests

Built With

Authors

About

Releases

Packages

Contributors 5

Languages

nimbo3/Keenbo

Folders and files

Latest commit

History

Repository files navigation

Keenbo

Getting Started

Prerequisites

Running

Running the tests

Built With

Authors

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 5

Languages

Packages