Skip to content

rajputpriyankaa/playwright-dynamic-scraper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

4 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

🎭 Web Scraping with Playwright

Python Playwright License

Python-based web scraper built with Microsoft Playwright β€” designed specifically for modern websites with dynamic content, JavaScript rendering, and infinite scroll pagination.


πŸ“Œ What This Project Does

  • Scrapes JavaScript-heavy websites that Selenium or BeautifulSoup can't handle
  • Handles infinite scroll pages by simulating user scrolling
  • Waits for dynamic elements to load before extracting data
  • Exports clean, structured data to CSV or JSON

🧰 Tech Stack

Tool Purpose
Python 3.8+ Core language
Playwright (sync) Browser automation for dynamic sites
Pandas Data processing & export


πŸ’‘ Why Playwright Over Selenium?

Feature Selenium Playwright
Async support ❌ βœ…
Auto-wait for elements ❌ βœ…
Infinite scroll handling Manual Built-in
Speed Slower Faster
Modern JS sites Partial Full support

πŸ™‹ Author

Priyanka Rajput β€” GitHub

About

Web scraping for sites with dynamic content and infinite scroll pagination.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages