pythonicshariful

Follow

Shariful Islam pythonicshariful

Follow

Web Scraping & ML Enthusiast

3 followers · 0 following

pythonicshariful/README.md

🕸️ What I Do

Web scraping at scale with Python (Selenium, Playwright, Scrapy, BeautifulSoup)
Automation pipelines for data collection, cleaning, and storage
API integrations (REST/GraphQL) and browser automation
Data wrangling with Pandas, exporting to CSV/JSON/DB
Learning ML & AI to build smarter data products

🔧 Tech Stack

✨ Highlights

Built bots that extract thousands of pages/day with rotating proxies & retries
Designed resilient anti-bot bypass flows (stealth drivers, human-like waits, captchas via services)
Delivered clean datasets ready for analysis & model training
Currently exploring feature engineering, vector databases, and LLM-powered scraping assistants

/

📊 GitHub Stats

🧪 ML & AI Learning Journey

🎯 Current focus: data labeling, feature engineering, small ML models for classification/regression
🧠 Next up: LLM-assisted scraping, RAG for document-heavy sites, agent workflows
📚 Notes & experiments live here → /labs

🗂️ Example Services I Offer

Full-site data extraction (anti-bot aware) → CSV/JSON/DB
PDF/image capture & text extraction (OCR)
API discovery & reverse engineering for private endpoints
Dashboard/API to deliver data (FastAPI + simple UI)
Ongoing monitoring for price changes, stock, new listings

💌 Need data? Open an issue or reach out!

💬 Connect

🐍 Fun

Made with ❤️, Python, and a lot of headless browsers.

Popular repositories Loading

insurance-charge-predictor insurance-charge-predictor Public

This project predicts medical insurance charges based on personal details such as age, gender, BMI, number of children, smoking habits, and region. It uses a Machine Learning model trained on the i…

Jupyter Notebook 3 1
phone-number-extractor phone-number-extractor Public

A Python script that extracts phone numbers from images using Tesseract OCR and Regex. Automatically organizes processed images into success and failed folders, and saves results to a CSV file.

Python 1
pythonicshariful pythonicshariful Public