web-scrapping
Here are 211 public repositories matching this topic...
Some of my scraping tools and scripts
-
Updated
Feb 15, 2025 - JavaScript
AgentQL is an AI-powered query language for web scraping and automation. It uses natural language selectors to find data on any page, including authenticated content. AgentQL queries are self-healing as UI changes and work across similar sites. Users can define structured data output, making AgentQL versatile for developers and data scientists.
-
Updated
Feb 18, 2025 - Python
Popular NodeJS Libraries for Developing Data-Driven AI Applications and Sample Codes
-
Updated
Feb 13, 2025 - JavaScript
A python based web scraper implementation of suedtirol.info accommodation lead scraper based on given region
-
Updated
Feb 6, 2025 - Python
📌 "Automated tool to match Apple Podcasts with Spotify using API & fuzzy matching."
-
Updated
Feb 3, 2025 - Python
Repository designed to help freshers easily grasp the basics of web scripting, offering simple guides and examples to build a strong foundation.
-
Updated
Jan 28, 2025 - Python
Insight Platter: A comprehensive platform offering actionable insights from restaurant reviews through web scraping, sentiment analysis, and data visualization.
-
Updated
Jan 28, 2025 - Python
Node scrapper: A puppeteer bot that pulls and spits out the number of sold PS5’s and Xbox Series X per day on eBay.
-
Updated
Jan 28, 2025 - JavaScript
The Xing Job Application Assistant automates job applications on the Xing platform, making it easier for users to search and apply based on their preferences. With secure manual login, this assistant uses saved session cookies to apply to jobs and according to specific job preferences set by the user.
-
Updated
Jan 28, 2025 - Python
This Python-based Amazon Scraper is designed to efficiently extract detailed product data from Amazon's product pages. The tool leverages powerful libraries like BeautifulSoup4 and csv, along with the Scrapingant API to simulate browser behavior and bypass Amazon’s anti-scraping algorithms.
-
Updated
Jan 15, 2025 - Python
A beginner level scrapping for learning purpose.
-
Updated
Dec 29, 2024 - Jupyter Notebook
Exception Handling, Logging, Assertions, File Handling, Pickling Unpickling, OOP's Concepts, Garbage Collection, Multithreading, Regular Expression, Web Scrapping, Shallow Copy vs Deep Copy, Python Database Connectivity, Pydoc, Python UnitTesting with Selenium, pytest with selenium.
-
Updated
Dec 27, 2024 - Python
Web Application for real-time Stock Market Price Prediction and Analysis. This web app demonstrates the integration of data engineering, machine learning, and user-centric design. Its robust architecture and reliable predictions offer significant utility for forecasting tasks and highlight practical implementation skills.
-
Updated
Dec 25, 2024 - Python
This repository contains a project to analyze customer feedback data for British Airways (BA) and uncover insights about the airline. It includes code, data, and documentation for data scraping and preparation, data analysis, and presentation of results and recommendations.
-
Updated
Dec 11, 2024 - Jupyter Notebook
"Test-driven web development with Python" book in PDF format.
-
Updated
Dec 6, 2024 - Python
Webcrawl is a Python web crawler that recursively follows links from a starting URL to extract and print unique HTTP links. Using 'requests and 'BeautifulSoup', it avoids revisits, handles errors, and supports configurable crawling depth. Ideal for gathering and analyzing web links.
-
Updated
Nov 19, 2024 - Python
A RAG Application facilitates communication with web pages and PDF content, extracting information and enabling user interaction with the extracted data. This tool is designed for researchers, content analysts, and anyone needing to process and interact with web and PDF content efficiently.
-
Updated
Nov 16, 2024 - Python
Improve this page
Add a description, image, and links to the web-scrapping topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the web-scrapping topic, visit your repo's landing page and select "manage topics."