A bash script to install and configure Hadoop DFS, YARN, MapReduce, Apache Hive and Spark on CentOS.
-
Updated
Jun 25, 2017 - Shell
A bash script to install and configure Hadoop DFS, YARN, MapReduce, Apache Hive and Spark on CentOS.
Exploring details of Motor Vehicle Collisions in New York City provided by the Police Department (NYPD).
DCL-700: Big Data Essentials
DoC Spark on minikube from Mac with Docker Desktop
Examples of using Apache Spark MLlib Pipelines and Structured Streaming on version 2.4.0
A rudimentary command line utility for contrasting Apache Spark event logs.
🏆 Spark4You Design patterns
This project integrates real-time data processing and analytics using Apache NiFi, Kafka, Spark, Hive, and AWS services for comprehensive COVID-19 data insights.
Hands-on workshop with Apache Iceberg
bigdatatutorial
Add a description, image, and links to the spark-sql topic page so that developers can more easily learn about it.
To associate your repository with the spark-sql topic, visit your repo's landing page and select "manage topics."