
- All languages
- AsciiDoc
- C
- C#
- C++
- CSS
- CartoCSS
- Clojure
- CoffeeScript
- Cython
- Dart
- Dockerfile
- EJS
- Elixir
- Elm
- Go
- HCL
- HTML
- Haskell
- Java
- JavaScript
- Jinja
- Jupyter Notebook
- Lua
- MDX
- Makefile
- Mako
- Markdown
- Nim
- Nix
- Objective-C++
- PHP
- PLpgSQL
- Perl
- PowerShell
- Python
- R
- Rich Text Format
- Roff
- Ruby
- Rust
- SCSS
- Scala
- Shell
- Starlark
- Svelte
- Swift
- TeX
- TypeScript
- Vim Script
- Vue
Starred repositories
Free and Open Source, Distributed, RESTful Search Engine
Suite of tools for deploying and training deep learning models using the JVM. Highlights include model import for keras, tensorflow, and onnx/pytorch, a modular and tiny c++ library for running mat…
Apache Druid: a high performance real-time analytics database.
Tink is a multi-language, cross-platform, open source library that provides cryptographic APIs that are secure, easy to use correctly, and hard(er) to misuse.
OpenRefine is a free, open source power tool for working with messy data and improving it
Apache Beam is a unified programming model for Batch and Streaming data processing.
Open source routing engine for OpenStreetMap. Use it as Java library or standalone web server.
Open Location Code is a library to generate short codes, called "plus codes", that can be used as digital addresses where street addresses don't exist.
Supplementary resources for the AWS Lambda Developer Guide
A distributed in-memory data store for the cloud
A new data structure for accurate on-line accumulation of rank-based statistics such as quantiles and trimmed means
Open Source ML Model Versioning, Metadata, and Experiment Management
Multi Model Server is a tool for serving neural net models for inference
An extensible distributed system for reliable nearline data streaming at scale
REST web service for the true real-time scoring (<1 ms) of Scikit-Learn, R and Apache Spark models
SAMOA (Scalable Advanced Massive Online Analysis) is an open-source platform for mining big data streams.
AWS libraries/modules for working with Kinesis aggregated record data
The Kinesis Scaling Utility is designed to give you the ability to scale Amazon Kinesis Streams in the same way that you scale EC2 Auto Scaling groups – up or down by a count or as a percentage of …
Access to the AWS IAM accounts via LDAP
Sample Apache Flink application that can be deployed to Kinesis Analytics for Java. It reads taxi events from a Kinesis data stream, processes and aggregates them, and ingests the result to an Amaz…
Java implementation of Thompson sampling to solve the multi-armed bandit problem
The Lambda function S3ObjectLambdaDecompression, is equipped to decompress objects stored in S3 in one of six compressed file formats including bzip2, gzip, snappy, zlib, zstandard and ZIP.
A distributed implementation of AdaBoost.MH and MP-Boost using Apache Spark