Scrapping Job Offers in Craigslist

This project scrape data about Engineering job offers in New York City from craigslist.com using Scrapy, a framework for extracting the data from websites.

The project has one spider able to scrape the textual and image data of all Engineering job offers in New York City.

The textual data is available in jobs.csv file. The extracted images are in images folder.

How to use

You will need Python 3.x to run the scripts. Python can be downloaded here.

You have to install Scrapy framework:

In command prompt/Terminal: pip install scrapy
If you are using Anaconda Python distribution: conda install -c conda-forge scrapy

Once you have installed Scrapy framework, just clone/download this project, access the folder in command prompt/Terminal and run the following command:

scrapy crawl jobs -o jobs.csv

You can change the output format to JSON or XML by change the output file extension (ex: jobs.json).

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
craigslist		craigslist
images		images
README.md		README.md
jobs.csv		jobs.csv
scrapy.cfg		scrapy.cfg

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Scrapping Job Offers in Craigslist

How to use

About

Languages

alynnebc/scrapping-craigslist

Folders and files

Latest commit

History

Repository files navigation

Scrapping Job Offers in Craigslist

How to use

About

Topics

Resources

Stars

Watchers

Forks

Languages