Awesome LLMs

A small and humble repo trying to collect awesome resources about Large Language Models and more!

work in constant progress...

Models

HuggingFace Models
xAI
- open Grok-1
- Grok-1.5
Mistral
- open Mixtral 8x7B
- open Mistral 7B
Anthropic
- docs Welcome to Claude - Anthropic
- api API reference
Google
OpenAI
Meta
- Meta Llama on HuggingFace

How GPT works?

What Is ChatGPT Doing … and Why Does It Work?

Prompt Engineering

Prompt Engineering Guide by DAIR.AI

OpenAI's resources

Frameworks

LangChain - LangChain is a framework for developing applications powered by language models.
- book LangChain AI Handbook By James Briggs & Francisco Ingham - Pinecone
- tutorials LangChain Tutorial page
- cookbook LangChain Cookbook
- LangChain Templates
docs LlamaIndex - LlamaIndex is a data framework for LLM-based applications which benefit from context augmentation.

Tools

Unstructured
- github unstructured - Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.

Development, testing and deploying AI systems

docs LangSmith Docs LangSmith is a platform for building production-grade LLM applications.
- LangSmith Pricing
Instructor, Generating Structure from LLMs - Instructor makes it easy to reliably get structured data like JSON from Large Language Models

for AI Agents

Tavily AI - Tavily Search API is a search engine optimized for LLMs and RAG, aimed at efficient, quick and persistent search results
- docs Tavily API
- docs GPT Researcher
- github GPT Researcher GitHub page

Tokenizers and tokenization

tiktoken - fast BPE tokenizer created by OpenAI.
Byte pair encoding

Embeddings

The Ins and Outs of Working with Embeddings and Embedding Models

Text splitting

tool ChunkViz v0.1

Vector Databases

RAG and giving models a long-term memory

Forecasting and Time Series

Time-LLM: Reprogram an LLM for Time Series Forecasting

LLM Testing and Evaluation

Needle In A Haystack - Pressure Testing LLMs
video Is RAG Really Dead? Testing Multi Fact Retrieval & Reasoning in GPT4-128k
- github Multi Needle In A Haystack Evaluation + LangSmith - rendered IPyNotebook on Github

LLM Security

What Anthropic’s Sleeper Agents study means for LLM apps

Libraries

OpenAI Python API library
OpenAI Node API Library
tiktoken - tiktoken is a fast BPE tokeniser for use with OpenAI's models.

(Academic) Papers

Managing costs and resources

The REAL cost of LLM (And How to reduce 78%+ of Cost)

Articles

article Preference Tuning LLMs with Direct Preference Optimization Methods

Other

Named Entity Recognition (NER)
- @Wikipedia
- What is Named Entity Recognition (NER)? Methods, Use Cases, and Challenges - DataCamp

Companies and their AI solutions/products

Prolego is an elite team of AI engineers and creative technologists that has been transforming the world's largest companies since 2017.
- report LLM Optimization Playbook

Learn - standing on the shoulders of giants

Big text files

shakespeare.txt - Complete Work of William Shakespeare

RAG

article RAG makes LLMs better and equal
Advanced RAG Techniques
article Command R: Retrieval-Augmented Generation at Production Scale - Cohere
article Introducing Command R+: A Scalable LLM Built for Business
docs Command R

Videos

...to be continued.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
img		img
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Awesome LLMs

Models

How GPT works?

Prompt Engineering

OpenAI's resources

Frameworks

Tools

Development, testing and deploying AI systems

for AI Agents

Tokenizers and tokenization

Embeddings

Text splitting

Vector Databases

RAG and giving models a long-term memory

Forecasting and Time Series

LLM Testing and Evaluation

LLM Security

Libraries

(Academic) Papers

Managing costs and resources

Articles

Other

Companies and their AI solutions/products

Learn - standing on the shoulders of giants

Big text files

RAG

Videos

About

Releases

Packages

License

mfaron-CKPL/awesome-llms

Folders and files

Latest commit

History

Repository files navigation

Awesome LLMs

Models

How GPT works?

Prompt Engineering

OpenAI's resources

Frameworks

Tools

Development, testing and deploying AI systems

for AI Agents

Tokenizers and tokenization

Embeddings

Text splitting

Vector Databases

RAG and giving models a long-term memory

Forecasting and Time Series

LLM Testing and Evaluation

LLM Security

Libraries

(Academic) Papers

Managing costs and resources

Articles

Other

Companies and their AI solutions/products

Learn - standing on the shoulders of giants

Big text files

RAG

Videos

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Packages