A small and humble repo trying to collect awesome resources about Large Language Models and more!
work in constant progress...
- HuggingFace Models
- xAI
- Mistral
open
Mixtral 8x7Bopen
Mistral 7B
- Anthropic
- OpenAI
- Meta
- Best practices for prompt engineering with the OpenAI API
- GPT-4 Turbo in the OpenAI API
- Knowledge in GPTs
- Retrieval Augmented Generation (RAG) and Semantic Search for GPTs
- OpenAI's Embeddings
cookbook
OpenAI Cookbook- Moderation API
- LangChain - LangChain is a framework for developing applications powered by language models.
docs
LlamaIndex - LlamaIndex is a data framework for LLM-based applications which benefit from context augmentation.
- Unstructured
github
unstructured - Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
docs
LangSmith Docs LangSmith is a platform for building production-grade LLM applications.- Instructor, Generating Structure from LLMs - Instructor makes it easy to reliably get structured data like JSON from Large Language Models
- Tavily AI - Tavily Search API is a search engine optimized for LLMs and RAG, aimed at efficient, quick and persistent search results
docs
Tavily APIdocs
GPT Researchergithub
GPT Researcher GitHub page
- tiktoken - fast BPE tokenizer created by OpenAI.
- Byte pair encoding
tool
ChunkViz v0.1
article
Best 16 Vector Databases for 2024- Vector databases - OpenAI Cookbook
- The Top 5 Vector Databases - DataCamp Article
- A gentle introduction to Vector Databases - Weaviate Blog
- Weaviate.io
- Milvus
- chroma
- qdrant
- Stanford CS25: V3 I Retrieval Augmented Language Models - YT
- RAFT: Adapting Language Model to Domain Specific RAG
- Massive Text Embedding Benchmark from Hugging Face
- Needle In A Haystack - Pressure Testing LLMs
video
Is RAG Really Dead? Testing Multi Fact Retrieval & Reasoning in GPT4-128kgithub
Multi Needle In A Haystack Evaluation + LangSmith - rendered IPyNotebook on Github
- OpenAI Python API library
- OpenAI Node API Library
- tiktoken - tiktoken is a fast BPE tokeniser for use with OpenAI's models.
- A Survey of Large Language Models
- RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval
- RAFT: Adapting Language Model to Domain Specific RAG
- RAGAS: Automated Evaluation of Retrieval Augmented Generation
- Attention Is All You Need
- Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention
- Efficient Streaming Language Models with Attention Sinks
- Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone
- A General Theoretical Paradigm to Understand Learning from Human Preferences
- Prolego is an elite team of AI engineers and creative technologists that has been transforming the world's largest companies since 2017.
report
LLM Optimization Playbook
article
RAG makes LLMs better and equal- Advanced RAG Techniques
article
Command R: Retrieval-Augmented Generation at Production Scale - Coherearticle
Introducing Command R+: A Scalable LLM Built for Businessdocs
Command R
- Building a Research Assistant from Scratch - LangChain YT Channel
- Hugging Face + Langchain in 5 mins | Access 200k+ FREE AI models for your AI apps - Jason AI
- All You Need To Know About Running LLMs Locally
...to be continued.