Skip to content

A simple web scraper that gets the number of classified ads with a specific search term on finn.no. The result is saved in a .csv-file and uploaded to a S3 bucket.

Notifications You must be signed in to change notification settings

StaaleA/FinnScraper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 

Repository files navigation

FinnScraper

A simple web scraper that gets the number of classified ads with a specific search term on finn.no. The result is saved in a .csv-file and uploaded to a S3 bucket. The script is running daily on a EC2 instance with help of crontab.

Dependencies

###lxml

sudo yum install python-devel
sudo yum install libxml2-devel
sudo yum install libxslt-devel  
sudo yum install gcc        
sudo easy_install pip
sudo pip install lxml

###Boto3

sudo pip install boto3

###Flask

If you want to have a virtual enviroment:

sudo pip install virtualenv
# In a project folder:
virtualenv venv
# Activate the enviroment:
. venv/bin/activate
pip install Flask

About

A simple web scraper that gets the number of classified ads with a specific search term on finn.no. The result is saved in a .csv-file and uploaded to a S3 bucket.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages