Kubernetes-native platform to run massively parallel data/streaming jobs
-
Updated
Jul 16, 2024 - Go
Kubernetes-native platform to run massively parallel data/streaming jobs
Platypus is a programming language for Observability Data Pipeline
OpenSource data platform to build event-driven systems. It's like Deebezium for golang :)
Miller is like awk, sed, cut, join, and sort for name-indexed data such as CSV, TSV, and tabular JSON
Select, put and delete data from JSON, TOML, YAML, XML and CSV files with a single tool. Supports conversion between formats and can be used as a Go package.
A simple package to abstract away the process of creating usable DataFrames for data analytics. This package is heavily inspired by the amazing Python library, Pandas.
DataDigger is a powerful and intuitive web application designed to extract and analyze data from web pages.
GenQL is a generic querying language fully written in Go
Go library to create and manage data pipelines on your machine
Welcome to the Real-time Data Processing using Apache Kafka project! In this project, we will explore the capabilities of Apache Kafka, a powerful and distributed streaming platform, to build a real-time data processing system. Whether you're a beginner or have some experience with Kafka.
Go library for efficient data accumulation and processing.
YAML Runner Go is an application that executes commands based on the rules defined in a YAML file. It provides the flexibility to run commands either once or as a daemon at specific intervals.
A set of plugins (mappers, sinks, etc.) for Numaflow pipelines
The Bhojpur Space is a software-as-a-service product used as a Space Engine based on Bhojpur.NET Platform for application delivery.
go-adflib is designed to simplify adaptive signal processing tasks with golang.
This is the STRM Privacy Command Line Interface, to define and manage your privacy streams, data schemas, event contracts and much more.
Terraform provider for easy and clean data processing (JQ, YQ, Go plugins...).
Grabs all of the audio files from all of the Blinkist books
📦 Service to export companies list
Golang and Python implementations of big data analytics algorithms for finding frequent itemsets from large datasets. These algorithms consists of Apriori and PCY algorithms. These implementations are part of a course project for the Big Data Analytics course at UOIT.
Add a description, image, and links to the data-processing topic page so that developers can more easily learn about it.
To associate your repository with the data-processing topic, visit your repo's landing page and select "manage topics."