Join GitHub today
GitHub is home to over 28 million developers working together to host and review code, manage projects, and build software together.Sign up
A job search engine built using Elasticsearch to illustrate the role of tracking, measurement and evaluation for search quality
Fetching latest commit…
Cannot retrieve the latest commit at this time.
|Type||Name||Latest commit message||Commit time|
|Failed to load latest commit information.|
OVERVIEW - A job search engine with simple front-end built in PHP using Elasticsearch. Crawler is written in Perl. The purpose is to illustrate the role of tracking, measurement and evaluation for search quality. The entire process is described in the blog post https://wilsonwong.co/2016/10/06/search-engineering-101. INSTALLATION - ActivePerl for Windows - PHP - Elasticsearch - MySQL CONFIGURATION - add the following to elasticsearch.yml http.cors.allow-origin: "*" http.cors.enabled: true node.master: true index.max_result_window: 200000 - configure the values in the ./define.php file and the ./crawler/conf.pl for database connections, Elasticsearch index, target URL for crawling, etc. - all the crawled jobs are first stored in ./crawler/rawcontent/