Skip to content
View holdenk's full-sized avatar

Organizations

@sparklingpandas @high-performance-spark @scalingpythonml @PigsCanFlyLabs

Block or report holdenk

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Encrypt files uploaded to a Django application.

Python 7 1 Updated Jun 19, 2022

Let's RAG it RAW without fancy frameworks

Jupyter Notebook 26 2 Updated Sep 15, 2024

A collection of learning resources for curious software engineers

Python 47,466 3,769 Updated Mar 28, 2025

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Python 8,352 524 Updated May 3, 2024

pyspark methods to enhance developer productivity 📣 👯 🎉

Python 667 99 Updated Mar 6, 2025

Apache Spark Connect Client for Golang

Go 199 38 Updated Mar 24, 2025

A Python Library to support running data quality rules while the spark job is running⚡

Python 180 47 Updated Mar 18, 2025

A tool to validate data, built around Apache Spark.

Scala 101 34 Updated Mar 28, 2025

8-bit CUDA functions for PyTorch, modified to build on Jetson Xavier

C 14 9 Updated Apr 26, 2023

LLM finetuned for medical question answering

Python 517 63 Updated Sep 7, 2023

English SDK for Apache Spark

Python 859 130 Updated Jun 12, 2024

Python Stream Processing

Python 1,684 76 Updated Mar 27, 2025

A modular implementation of timely dataflow in Rust

Rust 3,403 279 Updated Mar 28, 2025

State of the Art Natural Language Processing

Scala 3,948 722 Updated Mar 30, 2025

Your self-hosted, globally interconnected microblogging community

Ruby 48,013 7,133 Updated Mar 30, 2025

A POC for multilingual UDFs in KSQL

Shell 3 Updated Mar 16, 2019

TiDB - the open-source, cloud-native, distributed SQL database designed for modern applications.

Go 38,175 5,899 Updated Mar 29, 2025

Prototype implementation of Service-Level Fault Injection Testing in Python.

Python 70 2 Updated Nov 5, 2022

Replaces the factory firmware on the SwitchBot Plug Mini via OTA, enabling the use of Tasmota without disassembling the unit.

C 117 18 Updated Jul 21, 2024

A Label Printer Application

C 255 34 Updated Mar 24, 2025

lakeFS - Data version control for your data lake | Git for data

Go 4,603 370 Updated Mar 30, 2025
Scala 13 Updated Sep 20, 2023

Java imap nio client that is designed to scale well for thousands of connections per machine and reduce contention when using large number of threads and cpus.

Java 60 50 Updated Feb 11, 2025

Inofficial Qualcomm Firehose / Sahara / Streaming / Diag Tools :)

Python 1,814 417 Updated Mar 23, 2025

Reverse Engineering Furby Connect's Bluetooth Protocol and Update Format

JavaScript 511 83 Updated Jan 16, 2024

Open source version of Arrow Connect Platform developed by Arrow Electronics

Java 6 1 Updated Jan 12, 2023

A PowerDNS pipe dynamic backend to serve dnswall style A, AAAA and PTR DNS records for any given CIDR ranges.

Python 23 10 Updated Aug 5, 2024

Main repository for the Howlr application

JavaScript 47 14 Updated Feb 26, 2022
Kotlin 4 1 Updated Oct 29, 2020
Next
Showing results