Stars
Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.
This repo is meant to serve as a guide for Machine Learning/AI technical interviews.
Retrieval and Retrieval-augmented LLMs
yq is a portable command-line YAML, JSON, XML, CSV, TOML and properties processor
Uncomplicated Observability for Python and beyond! 🪵🔥
sentence-transformers to onnx 让sbert模型推理效率更快
The Patterns of Scalable, Reliable, and Performant Large-Scale Systems
The GPT-based Universal Web Scraper MVP is a solution that leverages GPT models and web scraping libraries to generate scraper code based on user input and website analysis, simplifying the web scr…
Seamlessly integrate LLMs into scikit-learn.
🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming
the container setup MLflow Tracking backend with docker compose.
Graphic notes on Gilbert Strang's "Linear Algebra for Everyone"
SGPT: GPT Sentence Embeddings for Semantic Search
🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support
Milvus is a high-performance, cloud-native vector database built for scalable vector ANN search
OneFlow is a deep learning framework designed to be user-friendly, scalable and efficient.
Making large AI models cheaper, faster and more accessible
Open source platform for the machine learning lifecycle
Source code for Twitter's Recommendation Algorithm
A logical, reasonably standardized, but flexible project structure for doing and sharing data science work.
A cross-platform command-line utility that creates projects from cookiecutters (project templates), e.g. Python package projects, C projects.
Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.
Managing your machine learning lifecycle with MLflow and Amazon SageMaker
Code release for "Detecting Twenty-thousand Classes using Image-level Supervision".