Skip to content
#

web-scrapping

Here are 211 public repositories matching this topic...

agentql

AgentQL is an AI-powered query language for web scraping and automation. It uses natural language selectors to find data on any page, including authenticated content. AgentQL queries are self-healing as UI changes and work across similar sites. Users can define structured data output, making AgentQL versatile for developers and data scientists.

  • Updated Feb 18, 2025
  • Python
Web-Scraping-Starter-Kit

Repository designed to help freshers easily grasp the basics of web scripting, offering simple guides and examples to build a strong foundation.

  • Updated Jan 28, 2025
  • Python

Insight Platter: A comprehensive platform offering actionable insights from restaurant reviews through web scraping, sentiment analysis, and data visualization.

  • Updated Jan 28, 2025
  • Python

The Xing Job Application Assistant automates job applications on the Xing platform, making it easier for users to search and apply based on their preferences. With secure manual login, this assistant uses saved session cookies to apply to jobs and according to specific job preferences set by the user.

  • Updated Jan 28, 2025
  • Python

This Python-based Amazon Scraper is designed to efficiently extract detailed product data from Amazon's product pages. The tool leverages powerful libraries like BeautifulSoup4 and csv, along with the Scrapingant API to simulate browser behavior and bypass Amazon’s anti-scraping algorithms.

  • Updated Jan 15, 2025
  • Python

Exception Handling, Logging, Assertions, File Handling, Pickling Unpickling, OOP's Concepts, Garbage Collection, Multithreading, Regular Expression, Web Scrapping, Shallow Copy vs Deep Copy, Python Database Connectivity, Pydoc, Python UnitTesting with Selenium, pytest with selenium.

  • Updated Dec 27, 2024
  • Python

Web Application for real-time Stock Market Price Prediction and Analysis. This web app demonstrates the integration of data engineering, machine learning, and user-centric design. Its robust architecture and reliable predictions offer significant utility for forecasting tasks and highlight practical implementation skills.

  • Updated Dec 25, 2024
  • Python

This repository contains a project to analyze customer feedback data for British Airways (BA) and uncover insights about the airline. It includes code, data, and documentation for data scraping and preparation, data analysis, and presentation of results and recommendations.

  • Updated Dec 11, 2024
  • Jupyter Notebook

Webcrawl is a Python web crawler that recursively follows links from a starting URL to extract and print unique HTTP links. Using 'requests and 'BeautifulSoup', it avoids revisits, handles errors, and supports configurable crawling depth. Ideal for gathering and analyzing web links.

  • Updated Nov 19, 2024
  • Python

Improve this page

Add a description, image, and links to the web-scrapping topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the web-scrapping topic, visit your repo's landing page and select "manage topics."

Learn more