Skip to content

[Proof of concept][WIP] The UI for scrappers cluster management

Notifications You must be signed in to change notification settings

fargelus/omniscrapper_hq

 
 

Repository files navigation

OmniScrapper HQ

Web application for OmniScrapper management.

It main goal is to provide:

  • Ability to define scrappers in UI
  • Ability to test defined scrappers from UI
  • Ability to schedule scrapping schedule

Setup

Demo

If you want just to see how the application works, you can use dockerized setup to start project with a single command.

Run

docker-compose up -d postgres redis kafka web then run docker-compose up -d scheduler runner

in the directory of project, and they visit localhost:2300.

Dockerized version has already defines scrapper for HackerNews website, which will be started right after services start.

Start

  • Copy env variables cp .env.development.sample .env.development
  • Run make db-prepare it will create db, run migrations and fill database with data
  • To start the server run make server

Seeds

Run rake db:seed to fill the database with sample data ready for testing.

It will add one real site, schema and task for scrapping. Feel free to go to tasks page and run sample scrapping.

About

[Proof of concept][WIP] The UI for scrappers cluster management

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Ruby 87.4%
  • HTML 9.9%
  • CSS 1.8%
  • Makefile 0.3%
  • Dockerfile 0.3%
  • JavaScript 0.2%
  • Shell 0.1%