Skip to content
View wesm's full-sized avatar
💭
➡️ ➡️ ➡️
💭
➡️ ➡️ ➡️

Sponsors

@clstaudt
@Yvictor
@tinhb92

Highlights

  • Pro

Organizations

@apache @statsmodels @pydata @datapad @conda-forge @pandas-dev @ibis-project

Block or report wesm

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
11 stars written in Scala
Clear filter

Apache Spark - A unified analytics engine for large-scale data processing

Scala 40,692 28,539 Updated Mar 7, 2025

A Scala API for Apache Beam and Google Cloud Dataflow.

Scala 2,583 513 Updated Mar 5, 2025

GeoMesa is a suite of tools for working with big geo-spatial data in a distributed fashion.

Scala 1,443 439 Updated Mar 7, 2025

High performance data store solution

Scala 1,435 703 Updated Mar 2, 2025

A Time Series Library for Apache Spark

Scala 1,014 185 Updated Jul 3, 2020

Livy is an open source REST interface for interacting with Apache Spark from anywhere

Scala 1,006 315 Updated Oct 5, 2022

BlinkDB: Sub-Second Approximate Queries on Very Large Data.

Scala 661 123 Updated Feb 6, 2014

Simplifying robust end-to-end machine learning on Apache Spark.

Scala 470 117 Updated Apr 18, 2017

Advanced Analytics Engine for NoSQL Data

Scala 402 64 Updated Nov 15, 2013

Distributed decision tree ensemble learning in Scala

Scala 392 50 Updated Jan 9, 2019

Lightweight, functional and correct time-series library for scala. Easy manipulation, filtering and combination of time-series data.

Scala 30 4 Updated Jan 13, 2022
11 stars written in Scala