Skip to content
View soldni's full-sized avatar
🏳️‍🌈
vibing!
🏳️‍🌈
vibing!

Organizations

@Georgetown-IR-Lab @allenai

Block or report soldni

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Create and run high-performance macOS and Linux VMs on Apple Silicon, with built-in support for AI agents.

Python 2,612 49 Updated Mar 28, 2025

PyTorch building blocks for the OLMo ecosystem

Python 178 31 Updated Mar 29, 2025

GhoulBoii's Firefox Dots

CSS 6 Updated Jan 15, 2025

OLMost every training recipe you need to perform data interventions with the OLMo family of models.

Python 16 5 Updated Mar 29, 2025

Curated list of datasets and tools for post-training.

2,885 252 Updated Jan 29, 2025

Versatile typeface for code, from code.

JavaScript 20,020 598 Updated Mar 29, 2025

👻 Ghostty is a fast, feature-rich, and cross-platform terminal emulator that uses platform-native UI and GPU acceleration.

Zig 28,930 762 Updated Mar 28, 2025

😸 Soothing pastel theme for the high-spirited!

TypeScript 16,251 298 Updated Mar 29, 2025

A more intuitive version of du in rust

Rust 9,543 210 Updated Mar 11, 2025

A curated list of resources and examples of ASCII Art

110 8 Updated Apr 24, 2024

A high-performance Python-based I/O system for large (and small) deep learning problems, with strong support for PyTorch.

Python 2,518 204 Updated Feb 12, 2025

Toolkit for linearizing PDFs for LLM datasets/training

Python 10,656 713 Updated Mar 28, 2025

LLM.swift is a simple and readable library that allows you to interact with large language models locally with ease for macOS, iOS, watchOS, tvOS, and visionOS.

C 540 56 Updated Mar 29, 2025

Large Language Model (LLM) module for the Spezi Ecosystem

Swift 214 23 Updated Mar 19, 2025

BPE modification that implements removing of the intermediate tokens during tokenizer training.

Python 25 1 Updated Nov 25, 2024

A curated list of awesome model based RL resources (continually updated)

1,050 60 Updated Feb 17, 2025

Dockerized iCloud Client - make a local copy of your iCloud documents and photos, and keep it automatically up-to-date.

Python 1,372 55 Updated Mar 24, 2025

Tools for shrinking fastText models (in gensim format)

Jupyter Notebook 178 13 Updated May 3, 2024
Rust 3 2 Updated Jun 10, 2024

[ACL 2024] This is the code repo for our ACL’24 paper "Cleaner Pretraining Corpus Curation with Neural Web Scraping".

Python 224 20 Updated Aug 28, 2024

GitHub Action to build and push Docker images with Buildx

TypeScript 4,664 594 Updated Mar 12, 2025

Fast bare-bones BPE for modern tokenizer training

Python 151 3 Updated Oct 21, 2024

A javascript text differencing implementation.

JavaScript 8,442 509 Updated Mar 28, 2025

scroll two or more areas simultaneously

JavaScript 234 32 Updated Apr 17, 2015

A fast, lightweight and easy-to-use Python library for splitting text into semantically meaningful chunks.

Python 271 16 Updated Mar 27, 2025

Run a TryCloudflare tunnel to your flask app right from code.

Python 37 17 Updated Feb 9, 2025

A PyTorch native library for large model training

Python 3,506 325 Updated Mar 28, 2025

Chat Templates for 🤗 HuggingFace Large Language Models

Jinja 637 58 Updated Dec 13, 2024

Wget-compatible web downloader and crawler.

HTML 579 77 Updated Apr 29, 2024

Go-DomDistiller is a Go port of the DOM Distiller library which implements Reader mode in Chrome for Android and Desktop. It has no dependencies on Chromium and is meant to run as a command line pr…

Go 67 17 Updated Sep 26, 2024
Next
Showing results