Skip to content

yaoweihu/NLP-Resources

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

75 Commits
 
 

Repository files navigation

NLP-Resource

Content

General NLP Packages

SpaCy
NITK
TextBlob
Flair
Sentence-Transformers

LLM

Large Language Models: A Survey
2024.02 - Shervin Minaee
MAP-Neo: Highly Capable and Transparent Bilingual Large Language Model Series
2024.05 - Ge Zhang - M-A-P
LLMs From Scratch: Hands-on Building Your Own Large Language Models
2024.01 - Kewei Chen

Components

Contextual Position Encoding: Learning to Count What’s Important
2024.05 - Olga Golovneva - FAIR at Meta

Prompt Engineering

提示工程(prompt engineering):技术分类与提示词调优看这篇就够了
2024.04 - 山行AI
我是如何赢得GPT-4提示工程大赛冠军的
2023.12 - Sheila Teo
12种Prompt Engineering(提示工程)方法
2023.12 - DC数字人才

RAG (Retrieval-Augmented Generation)

Large Language Models for Information Retrieval: A Survey
2023.08 - Zhuyu Tao - Renmin University of China
A Survey on RAG Meets LLMs: Towards Retrieval-Augmented Large Language Models
2024.05 - Yujuan Ding - The Hong Kong Polytechnic University
Embedchain - 2023
2023 - Embedchain is an Open Source Framework for personalizing LLM responses.
Mastering RAG: How To Architect An Enterprise RAG System
2024.01 - Pratik Bhavsar - Galileo Labs

Preprocessing

Query Rewriting for Retrieval-Augmented Large Language Models
2023.05 - Xinbei Ma - Shanghai Jiao Tong University

Embedder

A Comprehensive Survey of Sentence Representations: From the BERT Epoch to the ChatGPT Era and Beyond
2024.02 - Abhinav Ramesh Kashyap - ASUS Intelligent Cloud Services (AICS), Singapore
C-Pack: Packed Resources For General Chinese Embeddings
2023.09 - Shitao Xiao - Beijing Academy of AI
Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks
2019.08 - Nils Reimers - Technische Universitat Darmstadt

Retriever

SPLADE-v3: New baselines for SPLADE
2024.03 - Carlos Lassance - Cohere
RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval
2024.01 - Parth Sarthi - Stanford University
Tevatron: An Efficient and Flexible Toolkit for Dense Retrieval
2022.03 - Luyu Gao - CMU - [github]
详解预训练模型在信息检索第一阶段的应用
2021.11 - 范意兴 - 中科院
RetroMAE
2022.05 - Shitao Xiao - Beijing University of Posts and Telecommunications

Reranker

A Thorough Comparison of Cross-Encoders and LLMs for Reranking SPLADE
2024.03 - Hervé Déjean - Naver Labs Europe
ColBERTv2: Effective and Efficient Retrieval via Lightweight Late Interaction
2021.12 - Keshav Santhanam - Stanford University
ColBERT: Efficient and Effective Passage Search via Contextualized Late Interaction over BERT
2020.04 - Omar Khattab - Stanford University

uniem
A repo includes codes of training a chinese reranker.
RAG-Retrieval
A repo includes unified codes for rerankers.

Mastering RAG: How to Select A Reranking Model
2024.03 - Pratik Bhavsar - Galileo Labs

UI Framewrok

Streamlit
Gradio
ChainLit - Build the ChatGPT-like UI

VectorDB

ChromaDB

GNN

GNN-RAG: Graph Neural Retrieval for Large Language Model Reasoning
2024.05 - Costas Mavromatis - University of Minnesota

Search

Perplexica - An AI-powered search engine

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published