- San Francisco Bay Area
-
11:44
- 7h behind - in/chrisjcarini
Data Science / ML / AI
A complete daily plan for studying to become a machine learning engineer.
The data scientist's open-source choice to scale, assess and maintain natural language data. Treat training data like a software artifact.
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
12 weeks, 26 lessons, 52 quizzes, classic Machine Learning for all
Primary Kite repo — private bits replaced with XXXXXXX
Apache Druid: a high performance real-time analytics database.
Whisper as a Service (GUI and API with queuing for OpenAI Whisper)
Robust Speech Recognition via Large-Scale Weak Supervision
A web interface for chatting with Alpaca through llama.cpp. Fully dockerized, with an easy to use API.
Roadmapper - A Roadmap as Code (Rac) python library. Generate professional roadmap diagram using python code.
The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
Qdrant - High-performance, massive-scale Vector Database and Vector Search Engine for the next generation of AI. Also available in the cloud https://cloud.qdrant.io/
FinGPT: Open-Source Financial Large Language Models! Revolutionize 🔥 We release the trained model on HuggingFace.
Democratizing Internet-scale financial data.
Chapyter: ChatGPT Code Interpreter in Jupyter Notebooks
Friends don't let friends make certain types of data visualization - What are they and why are they bad.
Open-source tool to visualise your RAG 🔮
Hacky repo to see what the Copilot extension sends to the server
Chat with your database or your datalake (SQL, CSV, parquet). PandasAI makes data analysis conversational using LLMs and RAG.