#

spark-dataframes

Here are 10 public repositories matching this topic...

Thomas-George-T / Movies-Analytics-in-Spark-and-Scala

Data cleaning, pre-processing, and Analytics on a million movies using Spark and Scala.

scala movies big-data spark hadoop analytics movielens-data-analysis shell-script dataframes movielens-dataset rdd case-study spark-sql spark-programs spark-dataframes big-data-analytics spark-scala big-data-projects spark-rdd

Updated May 19, 2021
Scala

spider-123-eng / Spark

Apache Spark is a fast, in-memory data processing engine with elegant and expressive development API's to allow data workers to efficiently execute streaming, machine learning or SQL workloads that require fast iterative access to datasets.This project will have sample programs for Spark in Scala language .

streaming consumer parquet kafka-producer spark-sql spark-kafka-integration spark-streaming-data spark-transformations spark-to-cassandra-connection spark-dataframes spark-joins spark-hive-context spark-jdbc-connection spark-with-mangodb spark-aggregations-using-dataframe spark-use-cases cassandra-installation spark-datadog spark-mangodb spark-catalog-api

Updated Nov 16, 2022
Scala

yennanliu / spark-etl-pipeline

Various data stream/batch process demo with Apache Scala Spark 🚀

docker dockerfile scala twitter spark apache-spark sbt pipeline stream-processing sbt-plugin spark-streaming sbt-assembly spark-sql spark-dataframes spark-batch spark-rdd

Updated Feb 28, 2020
Scala

rajeshsantha / MonitoredStructuredStreaming

Repository for Spark structured streaming use case implementations.

scala kafka apache-spark spark-streaming spark-dataframes spark-streaming-kafka spark-structured-streaming

Updated Apr 13, 2020
Scala

thenickben / SplitCSV-Spark

Big Data - Split a large CSV file into N smaller ones and save them into the local disk

scala big-data spark spark-dataframes

Updated Nov 3, 2018
Scala

NashTech-Labs / spark-dataframes-meetup

meetup scala spark sbt spark-dataframes knoldus

Updated Apr 4, 2016
Scala

afzals2000 / spark-bigquery-parallel

Spark BigQuery Parallel

bigquery spark apache-spark pyspark google-cloud-platform spark-sql spark-dataframes spark-scala pyspark-python

Updated Jan 24, 2019
Scala

jrgito / SparkApp

make easier the use of columnar spark files

spark easy-to-use spark-dataframes dataframes-api sparkapp spark-method easy-spark

Updated Jan 2, 2018
Scala

codyle50 / spark-bigquery-parallel

bigquery spark apache-spark pyspark google-cloud-platform spark-sql spark-dataframes pyspark-python

Updated Oct 23, 2023
Scala

SevakAvet / spark-session-enricher

Calculate user sessions & stats on top of them for imaginary ecom site using Spark sql & aggregations

ecommerce scala spark pet-project spark-sql scala-spark spark-dataframes petproject spark-dataset sessionize

Updated Sep 9, 2019
Scala

Improve this page

Add a description, image, and links to the spark-dataframes topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the spark-dataframes topic, visit your repo's landing page and select "manage topics."