Skip to content

Latest commit

 

History

History
25 lines (22 loc) · 785 Bytes

CHANGELOG.md

File metadata and controls

25 lines (22 loc) · 785 Bytes

27/01/2020

  • Readme.md Changelog.md and .gitignore added to project
  • Add rules on Scrapy LinkExtractors
  • Add command line arguments on project
  • Add license

03/02/2020

  • Some changes on spiders
  • Some changes on README
  • SQLAlchemy models created
  • Postgres Client object created
  • pipeline feed Postgres client with data
  • init_db script created to fill database with required models

06/02/2020

  • Some changes on kariera.gr crawler
  • Some changes on project pipeline
  • Check if job posting record already exists in Postgres
  • Docker configuration added for Scrapy project
  • README.md changed

12/02/2020

  • Use docker selenium and webdriver for chrome
  • middleware that uses selenium is used to download js protected content
  • indeed, kariera, skywalker crawlers tested