Big Data projects for beginners
-
Updated
Jul 15, 2024 - Java
Big Data projects for beginners
Solving Big Data Problems using Spark framework in Java. Running the Project on HDFS clusters (BigData@Polito) to get the results.
基于 Spark Streaming 的电影推荐系统
Hive & Spark SQL extension for Visual Studio Code
The current repository contains all the code developed during the Big Data processing and Analytics laboratories. Data are processed and analyzed using Hadoop and Spark
Estimation surge pricing and traffic congestion using Kafka,Hadoop,Spark,Spark-Streaming,My-SQL
End to End Usecase Swagger UI -->Spring MicroServices -->Kafka -->Spark Consumer -->Cassandra DB
SFaker is one data generator.
Big Data Pipeline | Querying Data from Hive Table Phase
This repository contains a project showcasing the use of Big Data technologies in processing and visualizing real-time data from an eCommerce electronics store using tools such as Apache Kafka, Spark Streaming, Spark SQL, HBase, and Plotly
Spark in Action, 2nd edition - chapter 11 - Working with SQL
it is trials for integrating spring boot with spark
An ETL application which is written in Quarkus, Spark SQL Streaming, Neo4j and various types of Databases and stores. It also covers the devops frameworks like Jenkins CI/CD, docker and Kubernetes.
A Health Monitor to simulate receiving and processing large amounts of health metrics from many clients with the goal of efficiently finding aggregate statistics
Add a description, image, and links to the spark-sql topic page so that developers can more easily learn about it.
To associate your repository with the spark-sql topic, visit your repo's landing page and select "manage topics."