Three web-scraper projects below were designed mainly using Python, BeautifulSoup, Requests, and Flask.
Github doesn't support flask. Check the result & code on : https://replit.com/@BoyoonChoi/SuperScrapper#main.py
- Takes an input, and search for remote job opening on three websites:
-
- Stackoverflow. f"https://stackoverflow.com/jobs?r=true&q={word}"
-
- WeWorkRemotely. "https://weworkremotely.com"
-
- RemoteOK. f"https://remoteok.io/remote-dev+{word}-jobs"
-
- Results are shown on the report.html page, and user can download csv file by clicking on a button
we scrape Python job listings on both Indeed and StackOverflow, mainly using
BeautifulSoup
code example below
def extract_job(html):
title = html.find("span", title=True).string
company = html.find("span",{"class":"companyName"}).string
location = html.select_one("pre>div").text
job_id=html['data-jk']
return {
'title':title,
'company':company,
'location':location,
'link':f"https://www.indeed.com/viewjob?jk={job_id}"}
we scrape job listing from each company, and save them in individual csv files.
you can run my code here: https://replit.com/@BoyoonChoi/AlbaHeaven-Scraper#main.py