An Open-Source Engineering Guide for Prompt-in-context-learning from EgoAlpha Lab.


📝 Papers | ⚡️ Playground | 🛠 Prompt Engineering | 🌍 ChatGPT Prompt | ⛳ LLMs Usage Guide
⭐️ Shining ⭐️: This is fresh, daily-updated resources for in-context learning and prompt engineering. As Artificial General Intelligence (AGI) is approaching, let’s take action and become a super learner so as to position ourselves at the forefront of this exciting era and strive for personal and professional greatness.
The resources include:
🎉Papers🎉: The latest papers about In-Context Learning, Prompt Engineering, Agent, and Foundation Models.
🎉Playground🎉: Large language models(LLMs)that enable prompt experimentation.
🎉Prompt Engineering🎉: Prompt techniques for leveraging large language models.
🎉ChatGPT Prompt🎉: Prompt examples that can be applied in our work and daily lives.
🎉LLMs Usage Guide🎉: The method for quickly getting started with large language models by using LangChain.
In the future, there will likely be two types of people on Earth (perhaps even on Mars, but that's a question for Musk):
- Those who enhance their abilities through the use of AIGC;
- Those whose jobs are replaced by AI automation.
💎EgoAlpha: Hello! human👤, are you ready?
Deep Learning-based Code Reviews: A Paradigm Shift or a Double-Edged Sword? (New)
Rosalia Tufano,Alberto Martin-Lopez,Ahmad Tayeb,Ozren Dabić,Sonia Haiduc,etc - [arXiv]
Knowledge Graph Guided Evaluation of Abstention Techniques (New)
Kinshuk Vasisht,Navreet Kaur,Danish Pruthi - [arXiv]
Memorization and Knowledge Injection in Gated LLMs (New)
Xu Pan,Ely Hahami,Zechen Zhang,Haim Sompolinsky - [arXiv]
End-to-End Conformal Calibration for Optimization Under Uncertainty
Christopher Yeh,Nicolas Christianson,Alan Wu,Adam Wierman,Yisong Yue - [arXiv]
A Practical Examination of AI-Generated Text Detectors for Large Language Models (New)
Brian Tufts,Xuandong Zhao,Lei Li - [arXiv]
SkyReels-V2: Infinite-length Film Generative Model
Guibin Chen,Dixuan Lin,Jiangping Yang,Chunze Lin,Junchen Zhu,etc - [arXiv]
MentalChat16K: A Benchmark Dataset for Conversational Mental Health Assistance
Jia Xu,Tianyi Wei,Bojian Hou,Patryk Orzechowski,Shu Yang,etc - [arXiv]
Sleep-time Compute: Beyond Inference Scaling at Test-time
Kevin Lin,Charlie Snell,Yu Wang,Charles Packer,Sarah Wooders,etc - [arXiv]
End-to-End Conformal Calibration for Optimization Under Uncertainty
Christopher Yeh,Nicolas Christianson,Alan Wu,Adam Wierman,Yisong Yue - [arXiv]
MentalChat16K: A Benchmark Dataset for Conversational Mental Health Assistance
Jia Xu,Tianyi Wei,Bojian Hou,Patryk Orzechowski,Shu Yang,etc - [arXiv]
BitNet b1.58 2B4T Technical Report
Shuming Ma,Hongyu Wang,Shaohan Huang,Xingxing Zhang,Ying Hu,etc - [arXiv]
Sleep-time Compute: Beyond Inference Scaling at Test-time
Kevin Lin,Charlie Snell,Yu Wang,Charles Packer,Sarah Wooders,etc - [arXiv]
End-to-End Conformal Calibration for Optimization Under Uncertainty (New)
Christopher Yeh,Nicolas Christianson,Alan Wu,Adam Wierman,Yisong Yue - [arXiv]
Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context
Gemini Team,Petko Georgiev,Ving Ian Lei,Ryan Burnell,Libin Bai,etc - [arXiv]
SkyReels-V2: Infinite-length Film Generative Model (New)
Guibin Chen,Dixuan Lin,Jiangping Yang,Chunze Lin,Junchen Zhu,etc - [arXiv]
MentalChat16K: A Benchmark Dataset for Conversational Mental Health Assistance (New)
Jia Xu,Tianyi Wei,Bojian Hou,Patryk Orzechowski,Shu Yang,etc - [arXiv]
Sleep-time Compute: Beyond Inference Scaling at Test-time (New)
Kevin Lin,Charlie Snell,Yu Wang,Charles Packer,Sarah Wooders,etc - [arXiv]
System of Agentic AI for the Discovery of Metal-Organic Frameworks (New)
Theo Jaffrelot Inizan,Sherry Yang,Aaron Kaplan,Yen-hsu Lin,Jian Yin,etc - [arXiv]
One Model to Rig Them All: Diverse Skeleton Rigging with UniRig
Jia-Peng Zhang,Cheng-Feng Pu,Meng-Hao Guo,Yan-Pei Cao,Shi-Min Hu - [arXiv]
Pushing the Limits of Large Language Model Quantization via the Linearity Theorem (New)
Vladimir Malinovskii,Andrei Panferov,Ivan Ilin,Han Guo,Peter Richtárik,etc - [arXiv]
BitNet b1.58 2B4T Technical Report (New)
Shuming Ma,Hongyu Wang,Shaohan Huang,Xingxing Zhang,Ying Hu,etc - [arXiv]
Adaptive AI decision interface for autonomous electronic material discovery (New)
Yahao Dai,Henry Chan,Aikaterini Vriza,Fredrick Kim,Yunfei Wang,etc - [arXiv]
Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context (New)
Gemini Team,Petko Georgiev,Ving Ian Lei,Ryan Burnell,Libin Bai,etc - [arXiv]
PooDLe: Pooled and dense self-supervised learning from naturalistic videos (New)
Alex N. Wang,Christopher Hoang,Yuwen Xiong,Yann LeCun,Mengye Ren - [arXiv]
InstantCharacter: Personalize Any Characters with a Scalable Diffusion Transformer Framework (New)
Jiale Tao,Yanbing Zhang,Qixun Wang,Yiji Cheng,Haofan Wang,etc - [arXiv]
NoisyRollout: Reinforcing Visual Reasoning with Data Augmentation (New)
Xiangyan Liu,Jinjie Ni,Zijian Wu,Chao Du,Longxu Dou,etc - [arXiv]
One Model to Rig Them All: Diverse Skeleton Rigging with UniRig (New)
Jia-Peng Zhang,Cheng-Feng Pu,Meng-Hao Guo,Yan-Pei Cao,Shi-Min Hu - [arXiv]
Byte Latent Transformer: Patches Scale Better Than Tokens (New)
Artidoro Pagnoni,Ram Pasunuru,Pedro Rodriguez,John Nguyen,Benjamin Muller,etc - [arXiv]
Amuse: Human-AI Collaborative Songwriting with Multimodal Inspirations (New)
Yewon Kim,Sung-Ju Lee,Chris Donahue - [arXiv]
Language Model Alignment in Multilingual Trolley Problems (New)
Zhijing Jin,Max Kleiman-Weiner,Giorgio Piatti,Sydney Levine,Jiarui Liu,etc - [arXiv]
Survey on Evaluation of LLM-based Agents (New)
Asaf Yehudai,Lilach Eden,Alan Li,Guy Uziel,Yilun Zhao,etc - [arXiv]
Concise Reasoning via Reinforcement Learning (New)
Mehdi Fatemi,Banafsheh Rafiee,Mingjie Tang,Kartik Talamadupula - [arXiv]
Superintelligent Agents Pose Catastrophic Risks: Can Scientist AI Offer a Safer Path? (New)
Yoshua Bengio,Michael Cohen,Damiano Fornasiere,Joumana Ghosn,Pietro Greiner,etc - [arXiv]
Optimizing Safe and Aligned Language Generation: A Multi-Objective GRPO Approach (New)
Xuying Li,Zhuo Li,Yuji Kosuga,Victor Bian - [arXiv]
Agent S: An Open Agentic Framework that Uses Computers Like a Human (New)
Saaket Agashe,Jiuzhou Han,Shuyu Gan,Jiachen Yang,Ang Li,etc - [arXiv]
Ctrl-Z: Controlling AI Agents via Resampling (New)
Aryan Bhatt,Cody Rushing,Adam Kaufman,Tyler Tracy,Vasil Georgiev,etc - [arXiv]
You can directly click on the title to jump to the corresponding PDF link location
Motion meets Attention: Video Motion Prompts (2024.07.03)
Towards a Personal Health Large Language Model (2024.06.10)
Husky: A Unified, Open-Source Language Agent for Multi-Step Reasoning (2024.06.10)
Towards Lifelong Learning of Large Language Models: A Survey (2024.06.10)
Towards Semantic Equivalence of Tokenization in Multimodal LLM (2024.06.07)
LLMs Meet Multimodal Generation and Editing: A Survey (2024.05.29)
Tool Learning with Large Language Models: A Survey (2024.05.28)
When LLMs step into the 3D World: A Survey and Meta-Analysis of 3D Tasks via Multi-modal Large Language Models (2024.05.16)
Uncertainty Estimation and Quantification for LLMs: A Simple Supervised Approach (2024.04.24)
A Survey on the Memory Mechanism of Large Language Model based Agents (2024.04.21)
👉Complete paper list 🔗 for "Survey"👈
LLaRA: Supercharging Robot Learning Data for Vision-Language Policy (2024.06.28)
Dataset Size Recovery from LoRA Weights (2024.06.27)
Dual-Phase Accelerated Prompt Optimization (2024.06.19)
VoCo-LLaMA: Towards Vision Compression with Large Language Models (2024.06.18)
LaMDA: Large Model Fine-Tuning via Spectrally Decomposed Low-Dimensional Adaptation (2024.06.18)
The Impact of Initialization on LoRA Finetuning Dynamics (2024.06.12)
An Empirical Study on Parameter-Efficient Fine-Tuning for MultiModal Large Language Models (2024.06.07)
Cross-Context Backdoor Attacks against Graph Prompt Learning (2024.05.28)
Yuan 2.0-M32: Mixture of Experts with Attention Router (2024.05.28)
👉Complete paper list 🔗 for "Prompt Design"👈
An Empirical Study on Parameter-Efficient Fine-Tuning for MultiModal Large Language Models (2024.06.07)
Cantor: Inspiring Multimodal Chain-of-Thought of MLLM (2024.04.24)
Visualization-of-Thought Elicits Spatial Reasoning in Large Language Models (2024.04.04)
Can Small Language Models Help Large Language Models Reason Better?: LM-Guided Chain-of-Thought (2024.04.04)
Visual CoT: Unleashing Chain-of-Thought Reasoning in Multi-Modal Language Models (2024.03.25)
A Chain-of-Thought Prompting Approach with LLMs for Evaluating Students' Formative Assessment Responses in Science (2024.03.21)
NavCoT: Boosting LLM-Based Vision-and-Language Navigation via Learning Disentangled Reasoning (2024.03.12)
ERA-CoT: Improving Chain-of-Thought through Entity Relationship Analysis (2024.03.11)
Bias-Augmented Consistency Training Reduces Biased Reasoning in Chain-of-Thought (2024.03.08)
👉Complete paper list 🔗 for "Chain of Thought"👈
LaMDA: Large Model Fine-Tuning via Spectrally Decomposed Low-Dimensional Adaptation (2024.06.18)
The Impact of Initialization on LoRA Finetuning Dynamics (2024.06.12)
An Empirical Study on Parameter-Efficient Fine-Tuning for MultiModal Large Language Models (2024.06.07)
Leveraging Visual Tokens for Extended Text Contexts in Multi-Modal Learning (2024.06.04)
Learning to grok: Emergence of in-context learning and skill composition in modular arithmetic tasks (2024.06.04)
Long Context is Not Long at All: A Prospector of Long-Dependency Data for Large Language Models (2024.05.28)
Efficient Prompt Tuning by Multi-Space Projection and Prompt Fusion (2024.05.19)
MAML-en-LLM: Model Agnostic Meta-Training of LLMs for Improved In-Context Learning (2024.05.19)
Improving Diversity of Commonsense Generation by Large Language Models via In-Context Learning (2024.04.25)
Stronger Random Baselines for In-Context Learning (2024.04.19)
👉Complete paper list 🔗 for "In-context Learning"👈
Retrieval-Augmented Mixture of LoRA Experts for Uploadable Machine Learning (2024.06.24)
Enhancing RAG Systems: A Survey of Optimization Strategies for Performance and Scalability (2024.06.04)
Enhancing Noise Robustness of Retrieval-Augmented Language Models with Adaptive Adversarial Training (2024.05.31)
Accelerating Inference of Retrieval-Augmented Generation via Sparse Context Selection (2024.05.25)
DocReLM: Mastering Document Retrieval with Language Model (2024.05.19)
UniRAG: Universal Retrieval Augmentation for Multi-Modal Large Language Models (2024.05.16)
ChatHuman: Language-driven 3D Human Understanding with Retrieval-Augmented Tool Reasoning (2024.05.07)
REASONS: A benchmark for REtrieval and Automated citationS Of scieNtific Sentences using Public and Proprietary LLMs (2024.05.03)
Superposition Prompting: Improving and Accelerating Retrieval-Augmented Generation (2024.04.10)
Untangle the KNOT: Interweaving Conflicting Knowledge and Reasoning Skills in Large Language Models (2024.04.04)
👉Complete paper list 🔗 for "Retrieval Augmented Generation"👈
CELLO: Causal Evaluation of Large Vision-Language Models (2024.06.27)
PrExMe! Large Scale Prompt Exploration of Open Source LLMs for Machine Translation and Summarization Evaluation (2024.06.26)
Revisiting Referring Expression Comprehension Evaluation in the Era of Large Multimodal Models (2024.06.24)
OR-Bench: An Over-Refusal Benchmark for Large Language Models (2024.05.31)
TimeChara: Evaluating Point-in-Time Character Hallucination of Role-Playing Large Language Models (2024.05.28)
HW-GPT-Bench: Hardware-Aware Architecture Benchmark for Language Models (2024.05.16)
Multimodal LLMs Struggle with Basic Visual Network Analysis: a VNA Benchmark (2024.05.10)
Vibe-Eval: A hard evaluation suite for measuring progress of multimodal language models (2024.05.03)
Causal Evaluation of Language Models (2024.05.01)
👉Complete paper list 🔗 for "Evaluation & Reliability"👈
Cooperative Multi-Agent Deep Reinforcement Learning Methods for UAV-aided Mobile Edge Computing Networks (2024.07.03)
Symbolic Learning Enables Self-Evolving Agents (2024.06.26)
Adversarial Attacks on Multimodal Agents (2024.06.18)
DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement Learning (2024.06.14)
Transforming Wearable Data into Health Insights using Large Language Model Agents (2024.06.10)
Neuromorphic dreaming: A pathway to efficient learning in artificial agents (2024.05.24)
Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning (2024.05.16)
Learning Multi-Agent Communication from Graph Modeling Perspective (2024.05.14)
Smurfs: Leveraging Multiple Proficiency Agents with Context-Efficiency for Tool Planning (2024.05.09)
Unveiling Disparities in Web Task Handling Between Human and Web Agent (2024.05.07)
👉Complete paper list 🔗 for "Agent"👈
InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output (2024.07.03)
LLaRA: Supercharging Robot Learning Data for Vision-Language Policy (2024.06.28)
Web2Code: A Large-scale Webpage-to-Code Dataset and Evaluation Framework for Multimodal LLMs (2024.06.28)
LLaVolta: Efficient Multi-modal Models via Stage-wise Visual Context Compression (2024.06.28)
Cambrian-1: A Fully Open, Vision-Centric Exploration of Multimodal LLMs (2024.06.24)
VoCo-LLaMA: Towards Vision Compression with Large Language Models (2024.06.18)
Beyond LLaVA-HD: Diving into High-Resolution Large Multimodal Models (2024.06.12)
An Empirical Study on Parameter-Efficient Fine-Tuning for MultiModal Large Language Models (2024.06.07)
Leveraging Visual Tokens for Extended Text Contexts in Multi-Modal Learning (2024.06.04)
DeCo: Decoupling Token Compression from Semantic Abstraction in Multimodal Large Language Models (2024.05.31)
👉Complete paper list 🔗 for "Multimodal Prompt"👈
IncogniText: Privacy-enhancing Conditional Text Anonymization via LLM-based Private Attribute Randomization (2024.07.03)
Web2Code: A Large-scale Webpage-to-Code Dataset and Evaluation Framework for Multimodal LLMs (2024.06.28)
OMG-LLaVA: Bridging Image-level, Object-level, Pixel-level Reasoning and Understanding (2024.06.27)
Adversarial Search Engine Optimization for Large Language Models (2024.06.26)
VideoLLM-online: Online Video Large Language Model for Streaming Video (2024.06.17)
Regularizing Hidden States Enables Learning Generalizable Reward Model for LLMs (2024.06.14)
Autoregressive Model Beats Diffusion: Llama for Scalable Image Generation (2024.06.10)
PaCE: Parsimonious Concept Engineering for Large Language Models (2024.06.06)
Yuan 2.0-M32: Mixture of Experts with Attention Router (2024.05.28)
👉Complete paper list 🔗 for "Prompt Application"👈
TheoremLlama: Transforming General-Purpose LLMs into Lean4 Experts (2024.07.03)
Pedestrian 3D Shape Understanding for Person Re-Identification via Multi-View Learning (2024.07.01)
Token Erasure as a Footprint of Implicit Vocabulary Items in LLMs (2024.06.28)
OMG-LLaVA: Bridging Image-level, Object-level, Pixel-level Reasoning and Understanding (2024.06.27)
Fundamental Problems With Model Editing: How Should Rational Belief Revision Work in LLMs? (2024.06.27)
Efficient World Models with Context-Aware Tokenization (2024.06.27)
The Remarkable Robustness of LLMs: Stages of Inference? (2024.06.27)
ResumeAtlas: Revisiting Resume Classification with Large-Scale Datasets and Large Language Models (2024.06.26)
AITTI: Learning Adaptive Inclusive Token for Text-to-Image Generation (2024.06.18)
Unveiling Encoder-Free Vision-Language Models (2024.06.17)
👉Complete paper list 🔗 for "Foundation Models"👈


Large language models (LLMs) are becoming a revolutionary technology that is shaping the development of our era. Developers can create applications that were previously only possible in our imaginations by building LLMs. However, using these LLMs often comes with certain technical barriers, and even at the introductory stage, people may be intimidated by cutting-edge technology: Do you have any questions like the following?
- ❓ How can LLM be built using programming?
- ❓ How can it be used and deployed in your own programs?
💡 If there was a tutorial that could be accessible to all audiences, not just computer science professionals, it would provide detailed and comprehensive guidance to quickly get started and operate in a short amount of time, ultimately achieving the goal of being able to use LLMs flexibly and creatively to build the programs they envision. And now, just for you: the most detailed and comprehensive Langchain beginner's guide, sourced from the official langchain website but with further adjustments to the content, accompanied by the most detailed and annotated code examples, teaching code lines by line and sentence by sentence to all audiences.
Click 👉here👈 to take a quick tour of getting started with LLM.


This repo is maintained by EgoAlpha Lab. Questions and discussions are welcome via helloegoalpha@gmail.com
.
We are willing to engage in discussions with friends from the academic and industrial communities, and explore the latest developments in prompt engineering and in-context learning together.


Thanks to the PhD students from EgoAlpha Lab and other workers who participated in this repo. We will improve the project in the follow-up period and maintain this community well. We also would like to express our sincere gratitude to the authors of the relevant resources. Your efforts have broadened our horizons and enabled us to perceive a more wonderful world.