You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Crawlee—A web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with BeautifulSoup, Playwright, and raw HTTP. Both headful and headless mode. With proxy rotation.
The Apify SDK for Python is the official library for creating Apify Actors in Python. It provides useful features like actor lifecycle management, local storage emulation, and actor event handling.
This repo is a part of blog series on several web scraping projects where we will explore scraping techniques to crawl data from simple websites to websites using advanced protection.
Our project leverages data mining techniques, including Apify, Airflow, BERT, PostgreSQL, and Power BI, to conduct in-depth sentiment analysis for bank branches. We successfully identify customer sentiments, highlight positive and negative aspects, and provide valuable insights for enhancing the customer experience and gaining a competitive edge.
Our project's mission is to build a resilient data warehouse solution, facilitating comprehensive analysis of customer feedback for each CIH Bank branch. In the ever-competitive banking landscape, harnessing customer sentiment is paramount for service improvement, elevated satisfaction, and data-informed decisions