A bot that posts job openings at Reuters News
-
Updated
Nov 16, 2024 - Python
A bot that posts job openings at Reuters News
Undetectable, Lightning-Fast, and Adaptive Web Scraping for Python
Web scraper which aggregates pre-print academic economics papers from 20+ sources; presents titles, abstracts, authors and hyperlinks on an online dashboard. Auto-updates daily.
Sharesansar Nepal NEPSE daily share price data scraping with Python. Scrapes all daily floor sheet from sharesansar site.
Personal Finance Data Pipeline & Dashboard
A discord bot to notify the new courses from Courspora
Extracting the "dot plot" economic projections posted online by the Federal Open Market Committee
Test scraped job details from companies websites against peviitor.ro
📊 Blazing fast Python framework for web crawling, scraping, testing, and reporting. Supports pytest. Stealth abilities: UC Mode and CDP Mode.
A very simple news crawler with a funny name
Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XML
Starter project for using Steel with Python SDK and Selenium.
Python toolkit for preprocessing data for the City Controller's Gun Violence Dashboard
Scrapy, a fast high-level web crawling & scraping framework for Python.
Crawlee—A web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with BeautifulSoup, Playwright, and raw HTTP. Both headful and headless mode. With proxy rotation.
Python utility for fetching any historical data using caching. Suitable for news, posts, weather, etc.
AgentQL is an AI-powered query language for web scraping and automation. It uses natural language selectors to find data on any page, including authenticated content. AgentQL queries are self-healing as UI changes and work across similar sites. Users can define structured data output, making AgentQL versatile for developers and data scientists.
This project uses requests and BeautifulSoup to scrape articles from Google News in categories like Sports, Entertainment, Health, Technology, Business, and Law. Selenium is used to extract detailed content for sentiment analysis, categorizing articles as Positive, Negative, or Neutral with TextBlob. For Entertainment news, Llama is used to generat
An API wrapper for Scrappey.com written in Python (cloudflare, datadome bypass & solver)
NBA Stats API via Basketball Reference
Add a description, image, and links to the web-scraping topic page so that developers can more easily learn about it.
To associate your repository with the web-scraping topic, visit your repo's landing page and select "manage topics."