Stars
Build resilient language agents as graphs.
Get your documents ready for gen AI
Pretrain, finetune ANY AI model of ANY size on multiple GPUs, TPUs with zero code changes.
An ONNX-based implementation of the CLIP model that doesn't depend on torch or torchvision.
This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and cont…
The fastest way to create an HTML app
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
A massively parallel, high-level programming language
Evidently is an open-source ML and LLM observability framework. Evaluate, test, and monitor any AI-powered system or data pipeline. From tabular data to Gen AI. 100+ metrics.
Toolkit for fine-tuning, ablating and unit-testing open-source LLMs.
Train Llama 2 & 3 on the SQuAD v2 task as an example of how to specialize a generalized (foundation) model.
Create web-based user interfaces with Python. The nice way.
An open source implementation of CLIP.
The OpenTF Manifesto expresses concern over HashiCorp's switch of the Terraform license from open-source to the Business Source License (BSL) and calls for the tool's return to a truly open-source …
Extracts the historic word occurrence of a search term in academic papers
Prefect is a workflow orchestration framework for building resilient data pipelines in Python.
♾️ CML - Continuous Machine Learning | CI/CD for ML
Code for the NAACL 2022 long paper "DiffCSE: Difference-based Contrastive Learning for Sentence Embeddings"
PyTorch implementation of some learning rate schedulers for deep learning researcher.
PyTorch implementation of the U-Net for image semantic segmentation with high quality images
Table Transformer (TATR) is a deep learning model for extracting tables from unstructured documents (PDFs and images). This is also the official repository for the PubTables-1M dataset and GriTS ev…
a.k.a Octo-Bouncer. Highly precise stepper motor driving with a Teensy 4.0 and custom pulse generating algorithm and PC based image processing with the goal of getting a machine to juggle a ping po…
A minimal Docker, RabbitMQ, Python example.
Find, verify, and analyze leaked credentials
📚 Papers & tech blogs by companies sharing their work on data science & machine learning in production.