Skip to content

ScrapeIt is a web scraping application built with ReactJS and TailwindCSS. It enables the extraction of HTML content or text from web pages by providing a URL.

License

Notifications You must be signed in to change notification settings

singodiyashubham87/ScrapeIt

Repository files navigation

ScrapeIt - Web Scraper App

ScrapeIt is a web scraper app that allows you to extract text or HTML content from web pages by providing a URL.

Tech Stack

  • React: Frontend framework for building user interfaces.
  • Auth0: Authentication and authorization management.
  • Axios: HTTP client for making API requests.
  • Tailwind: For styling the components.
  • DOMPurify: Library for sanitizing and preventing XSS vulnerabilities.
  • html2pdf.js: Library for generating PDFs from HTML content.
  • API Ninjas: Third-party API used for web scraping.

Getting Started

Prerequisite

Installation

  1. Clone the repository:
git clone https://github.com/singodiyashubham87/ScrapeIt.git
cd ScrapeIt
  1. Install dependencies:
npm install
  1. Edit the .env file like this and add your Auth0 Credentials and API Ninjas API key in the placeholders:
VITE_AUTH0_DOMAIN="AUTH0_DOMAIN"
VITE_AUTH0_CLIENT_ID="AUTH0_CLIENT_ID"
VITE_AUTH0_REDIRECT_URL="http://localhost:5173"
VITE_API_NINJAS_X_API_KEY="API_NINJAS_X_API_KEY"
  1. Start the app:
npm run dev

Features

  • Log in or log out using Auth0 authentication.
  • Enter a URL to scrape and choose between extracting text or HTML content.
  • Download scraped content as a PDF.

Credits

API Ninjas (https://api.api-ninjas.com/): For providing the web scraping API.

Author

Contributors

License

This project is licensed under the MIT License.

Support

Support the project by starring the repository.

About

ScrapeIt is a web scraping application built with ReactJS and TailwindCSS. It enables the extraction of HTML content or text from web pages by providing a URL.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published