Skip to content
View pythonicshariful's full-sized avatar

Block or report pythonicshariful

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 250 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
pythonicshariful/README.md

Typing SVG

coding

followers stars profile views


πŸ•ΈοΈ What I Do

  • Web scraping at scale with Python (Selenium, Playwright, Scrapy, BeautifulSoup)
  • Automation pipelines for data collection, cleaning, and storage
  • API integrations (REST/GraphQL) and browser automation
  • Data wrangling with Pandas, exporting to CSV/JSON/DB
  • Learning ML & AI to build smarter data products

πŸ”§ Tech Stack


✨ Highlights

  • Built bots that extract thousands of pages/day with rotating proxies & retries
  • Designed resilient anti-bot bypass flows (stealth drivers, human-like waits, captchas via services)
  • Delivered clean datasets ready for analysis & model training
  • Currently exploring feature engineering, vector databases, and LLM-powered scraping assistants

/


πŸ“Š GitHub Stats


πŸ§ͺ ML & AI Learning Journey

  • 🎯 Current focus: data labeling, feature engineering, small ML models for classification/regression
  • 🧠 Next up: LLM-assisted scraping, RAG for document-heavy sites, agent workflows
  • πŸ“š Notes & experiments live here β†’ /labs

πŸ—‚οΈ Example Services I Offer

  • Full-site data extraction (anti-bot aware) β†’ CSV/JSON/DB
  • PDF/image capture & text extraction (OCR)
  • API discovery & reverse engineering for private endpoints
  • Dashboard/API to deliver data (FastAPI + simple UI)
  • Ongoing monitoring for price changes, stock, new listings

πŸ’Œ Need data? Open an issue or reach out!


πŸ’¬ Connect


🐍 Fun

snake animation


Made with ❀️, Python, and a lot of headless browsers.

Popular repositories Loading

  1. insurance-charge-predictor insurance-charge-predictor Public

    This project predicts medical insurance charges based on personal details such as age, gender, BMI, number of children, smoking habits, and region. It uses a Machine Learning model trained on the i…

    Jupyter Notebook 3 1

  2. phone-number-extractor phone-number-extractor Public

    A Python script that extracts phone numbers from images using Tesseract OCR and Regex. Automatically organizes processed images into success and failed folders, and saves results to a CSV file.

    Python 1

  3. pythonicshariful pythonicshariful Public