Skip to content

aydin-elif/web-scraping

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

11 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Web Scraping News Project

A Spring Boot based web scraping project with Jsoup and MongoDB.

📌 Overview

This project is a Spring Boot + Jsoup + MongoDB application that collects AI/Technology news from websites, stores them in MongoDB, and exposes them via a REST API.

✨ Features

  • 🔎 Web scraping with Jsoup
  • 🔄 ETL pipeline (Extract - Transform - Load)
  • 💾 Store scraped news in MongoDB
  • 🌐 Expose data via REST API
  • 📖 Swagger (OpenAPI) documentation

🛠️ Technologies

  • Java 21
  • 🚀 **Spring Boot 3.55
  • 📰 Jsoup
  • 🗄️ MongoDB
  • 📑 Swagger (OpenAPI)

🚀 Getting Started

Prerequisites

  • Java 21
  • Maven 3.55
  • MongoDB (local or Docker)

About

A Spring Boot based web scraping project with Jsoup and MongoDB.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 2

  •  
  •