Here are
35 public repositories
matching this topic...
Simple and Distributed Machine Learning
Updated
Jun 6, 2024
Scala
State of the Art Natural Language Processing
Updated
Jun 7, 2024
Scala
Sparkling Water provides H2O functionality inside Spark cluster
Updated
May 27, 2024
Scala
Big Data Processing Framework - Unified Data API or SQL on Any Storage
Updated
Nov 23, 2023
Scala
Isolation Forest on Spark
Updated
Nov 11, 2022
Scala
Apache Spark Connector for Azure Cosmos DB
Updated
May 20, 2024
Scala
A library that provides useful extensions to Apache Spark and PySpark.
Updated
May 31, 2024
Scala
The Archives Unleashed Toolkit is an open-source toolkit for analyzing web archives.
Updated
Feb 27, 2024
Scala
An Azure Databricks workshop leveraging the New York Taxi and Limousine Commission Trip Records dataset
Updated
Apr 27, 2023
Scala
A library for Spark DataFrame using MinIO Select API
Updated
Sep 27, 2019
Scala
Real-world Spark pipelines examples
Updated
Feb 27, 2018
Scala
jgit-spark-connector is a library for running scalable data retrieval pipelines that process any number of Git repositories for source code analysis.
Updated
Feb 13, 2019
Scala
A connector for Apache Spark and PySpark to Dgraph databases.
Updated
Jun 5, 2024
Scala
Spark extension for processing large-scale 3D data sets: Astrophysics, High Energy Physics, Meteorology, …
Updated
Apr 12, 2023
Scala
Routines and data structures for using isarn-sketches idiomatically in Apache Spark
Updated
May 27, 2024
Scala
FITS data source for Spark SQL and DataFrames
Updated
Apr 12, 2023
Scala
Updated
Jun 22, 2020
Scala
A repository of Apache Spark projects, training projects, and tutorials, in both Scala and Python.
Updated
Sep 15, 2021
Scala
Spark implementation of Slowly Changing Dimension type 2
Updated
Jan 8, 2019
Scala
Updated
Apr 21, 2023
Scala
Improve this page
Add a description, image, and links to the
pyspark
topic page so that developers can more easily learn about it.
Curate this topic
Add this topic to your repo
To associate your repository with the
pyspark
topic, visit your repo's landing page and select "manage topics."
Learn more
You can’t perform that action at this time.