Skip to content
View GoDjMike's full-sized avatar
📈
Putting the ML back in AI
📈
Putting the ML back in AI

Block or report GoDjMike

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

📏 ML Eval

Benchmark, compare, measure, etc
34 repositories

Benchmarks of approximate nearest neighbor libraries in Python

Python 5,610 880 Updated Jun 10, 2025

LLM Benchmark for Throughput via Ollama (Local LLMs)

Python 345 41 Updated Jan 17, 2026

A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.

Python 2,097 235 Updated Oct 16, 2025

Retrieval and Retrieval-augmented LLMs

Python 11,361 840 Updated Dec 15, 2025

Framework for evaluating ANNS algorithms on billion scale datasets.

Jupyter Notebook 424 136 Updated Dec 17, 2025

Things you can do with the token embeddings of an LLM

Python 1,453 50 Updated Dec 1, 2025

Baguetter is a flexible, efficient, and hackable search engine library implemented in Python. It's designed for quickly benchmarking, implementing, and testing new search methods. Baguetter support…

Python 208 10 Updated Aug 31, 2024

⚡️A Blazing-Fast Python Library for Ranking Evaluation, Comparison, and Fusion 🐍

Python 656 31 Updated Aug 7, 2025

A language for constraint-guided and efficient LLM programming.

Python 4,156 218 Updated May 22, 2025

Generalist and Lightweight Model for Named Entity Recognition (Extract any entity types from texts) @ NAACL 2024

Python 2,888 252 Updated Mar 2, 2026

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 54,216 9,197 Updated Nov 12, 2025

A CPU+GPU Profiling library that provides access to timeline traces and hardware performance counters.

HTML 926 227 Updated Mar 6, 2026

Facebook AI Performance Evaluation Platform

Python 394 87 Updated Feb 20, 2026

Benchmarking suite for popular AI APIs

Python 88 16 Updated Feb 6, 2025

Website with current metrics on the fastest AI models.

Astro 43 8 Updated Nov 13, 2024

Benchmarking LLMs with Challenging Tasks from Real Users

Python 246 52 Updated Nov 3, 2024

Shopping Queries Dataset: A Large-Scale ESCI Benchmark for Improving Product Search

Python 346 69 Updated Oct 7, 2024

A curated list of awesome open source libraries to deploy, monitor, version and scale your machine learning

20,210 2,523 Updated Mar 5, 2026

Pytorch library for fast transformer implementations

Python 1,763 191 Updated Mar 23, 2023

Visual inference exploration & experimentation playground

Svelte 97 5 Updated Nov 29, 2024

GPU Trace Visualizer

C++ 878 100 Updated Jan 14, 2026

Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)

Python 12,746 1,327 Updated Mar 3, 2026

Evaluation and Tracking for LLM Experiments and AI Agents

Python 3,137 250 Updated Mar 6, 2026

Chat language model that can use tools and interpret the results

Python 1,591 118 Updated Dec 3, 2025

Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks

Python 3,867 644 Updated Mar 5, 2026

A website where you can compare every AI Model ✨

TypeScript 415 41 Updated Mar 6, 2026

AutoEvals is a tool for quickly and easily evaluating AI model outputs using best practices.

Python 824 53 Updated Mar 6, 2026

Science-driven chatbot development

Python 63 8 Updated May 5, 2024

Flax is a neural network library for JAX that is designed for flexibility.

Jupyter Notebook 7,104 793 Updated Mar 6, 2026

MLflow automatic tracing for txtai

Python 11 1 Updated Jun 22, 2025