dataframe
Here are 42 public repositories matching this topic...
Spark implementation of haversine formula
-
Updated
Sep 14, 2021 - Scala
A better UDF API for Spark SQL
-
Updated
Sep 4, 2020 - Scala
A Spark/Scala custom function to count nulls, nans and blank values in all the columns of a given table or dataframe.
-
Updated
May 7, 2020 - Scala
Using property testing to feel out the input space of "delimited text file"
-
Updated
Jan 12, 2022 - Scala
Spark example project, written in Scala. It aggregates electrical energy consumption in a given time window.
-
Updated
Jun 2, 2021 - Scala
Easy implementation of Apache Spark Streaming for dataframes
-
Updated
May 1, 2019 - Scala
A library for using Camunda DMN in Big Data projects with Apache Spark
-
Updated
Apr 21, 2023 - Scala
Update PubMed articles daily on HDFS by using Spark Cluster
-
Updated
Nov 18, 2022 - Scala
🛠️ Template to do data processing with Scala and Apache Spark ✨
-
Updated
Dec 31, 2021 - Scala
Some simple, kinda introductory projects based on Apache Spark to be used as guides in order to make the whole DataFrame data management look less weird or complex.
-
Updated
May 11, 2021 - Scala
Work with a set of Tweets about US airlines and examine their sentiment polarity.The aim is to learn to classify Tweets as either “positive”, “neutral”, or “negative” by using two classifiers and pipelines for pre-processing and model building.
-
Updated
Aug 7, 2019 - Scala
an SBT controlled project for Spark in Scala template
-
Updated
Aug 7, 2022 - Scala
Improve this page
Add a description, image, and links to the dataframe topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the dataframe topic, visit your repo's landing page and select "manage topics."