SpaCy
NITK
TextBlob
Flair
Sentence-Transformers
Large Language Models: A Survey
2024.02 - Shervin Minaee
MAP-Neo: Highly Capable and Transparent Bilingual Large Language Model Series
2024.05 - Ge Zhang - M-A-P
LLMs From Scratch: Hands-on Building Your Own Large Language Models
2024.01 - Kewei Chen
Contextual Position Encoding: Learning to Count What’s Important
2024.05 - Olga Golovneva - FAIR at Meta
提示工程(prompt engineering):技术分类与提示词调优看这篇就够了
2024.04 - 山行AI
我是如何赢得GPT-4提示工程大赛冠军的
2023.12 - Sheila Teo
12种Prompt Engineering(提示工程)方法
2023.12 - DC数字人才
Large Language Models for Information Retrieval: A Survey
2023.08 - Zhuyu Tao - Renmin University of China
A Survey on RAG Meets LLMs: Towards Retrieval-Augmented Large Language Models
2024.05 - Yujuan Ding - The Hong Kong Polytechnic University
Embedchain - 2023
2023 - Embedchain is an Open Source Framework for personalizing LLM responses.
Mastering RAG: How To Architect An Enterprise RAG System
2024.01 - Pratik Bhavsar - Galileo Labs
Query Rewriting for Retrieval-Augmented Large Language Models
2023.05 - Xinbei Ma - Shanghai Jiao Tong University
A Comprehensive Survey of Sentence Representations: From the BERT Epoch to the ChatGPT Era and Beyond
2024.02 - Abhinav Ramesh Kashyap - ASUS Intelligent Cloud Services (AICS), Singapore
C-Pack: Packed Resources For General Chinese Embeddings
2023.09 - Shitao Xiao - Beijing Academy of AI
Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks
2019.08 - Nils Reimers - Technische Universitat Darmstadt
SPLADE-v3: New baselines for SPLADE
2024.03 - Carlos Lassance - Cohere
RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval
2024.01 - Parth Sarthi - Stanford University
Tevatron: An Efficient and Flexible Toolkit for Dense Retrieval
2022.03 - Luyu Gao - CMU - [github]
详解预训练模型在信息检索第一阶段的应用
2021.11 - 范意兴 - 中科院
RetroMAE
2022.05 - Shitao Xiao - Beijing University of Posts and Telecommunications
A Thorough Comparison of Cross-Encoders and LLMs for Reranking SPLADE
2024.03 - Hervé Déjean - Naver Labs Europe
ColBERTv2: Effective and Efficient Retrieval via Lightweight Late Interaction
2021.12 - Keshav Santhanam - Stanford University
ColBERT: Efficient and Effective Passage Search via Contextualized Late Interaction over BERT
2020.04 - Omar Khattab - Stanford University
uniem
A repo includes codes of training a chinese reranker.
RAG-Retrieval
A repo includes unified codes for rerankers.
Mastering RAG: How to Select A Reranking Model
2024.03 - Pratik Bhavsar - Galileo Labs
Streamlit
Gradio
ChainLit - Build the ChatGPT-like UI
GNN-RAG: Graph Neural Retrieval for Large Language Model Reasoning
2024.05 - Costas Mavromatis - University of Minnesota