GitHub - clqing/SurveyX: Academic Survey Paper Generation.

SurveyX: Academic Survey Automation via Large Language Models

✨Welcome to SurveyX! This GitHub repository serves as a channel for users to submit requests for paper generation based on specific topics or domains.📚

🚀 We're actively developing a full-featured product with a sleek graphical interface!

⭐ Star this repo to stay updated and be the first to know about our progress and release announcements!

💡 Your support means everything to us as we work to bring this innovative solution to life. Stay tuned for more updates!

⭐Star History

🤔What is SurveyX?

SurveyX is an advanced academic survey automation system that leverages the power of Large Language Models (LLMs) to generate high-quality, domain-specific academic papers and surveys.🚀

By simply providing a paper title and keywords for literature retrieval, users can request comprehensive academic papers or surveys tailored to specific topics.

If you're curious about how SurveyX works or want to understand the underlying technology and methodology, feel free to check out our 📑website, where we provide an in-depth explanation of the system's architecture, data processing methods, and experimental results.

🤔 What’s This Git For?

This GitHub repository is designed to provide a platform where users can request the generation of high-quality, domain-specific academic surveys by simply submitting an issue. The main purpose of this repository is to allow users to easily create and receive tailored academic surveys or papers, which are generated using SurveyX📄

By submitting an issue with a paper title and keywords for literature search, users can quickly generate a comprehensive review paper or survey on a specific topic. This process streamlines academic research by automating paper creation, saving users time and effort in compiling research content. 📚💡

🖋️How to Request a Custom Paper via Issue

To request a paper, create a new issue with the following details:

Paper Title: Provide the title of the paper you need.
Keywords for Literature Search: Provide keywords separated by commas that will help retrieve relevant literature and guide the content generation (e.g. "AI in healthcare, climate change impact on agriculture, ethical implications of AI"). We recommend that the number of keywords be limited to 4 ~ 5 for optimal results.
Your email(optional): Please provide your email address so that we can notify you promptly once the paper is ready.

💬Example Issue Submission:

Title: Controllable text generation of LLM: A Survey

Keywords: AI, healthcare, ethical implications, technology adoption, AI-driven diagnostics

Email: xxxxxxxx@SurveyX.cn

Once your request is submitted, the generated paper will be placed in the user requests folder. Please allow 1-2 business days for processing and generation. ⏳

📝Generated Topics

Examples Papers

Title	Keywords
From BERT to GPT-4: A Survey of Architectural Innovations in Pre-trained Language Models	Transformer, BERT, GPT-3, self-attention, masked language modeling, cross-lingual transfer, model scaling
Unsupervised Cross-Lingual Word Embedding Alignment: Techniques and Applications	low-resource NLP, few-shot learning, data augmentation, unsupervised alignment, synthetic corpora, NLLB, zero-shot transfer
Vision-Language Pre-training: Architectures, Benchmarks, and Emerging Trends	multimodal learning, CLIP, Whisper, cross-modal retrieval, modality fusion, video-language models, contrastive learning
Efficient NLP at Scale: A Review of Model Compression Techniques	model compression, knowledge distillation, pruning, quantization, TinyBERT, edge computing, latency-accuracy tradeoff
Domain-Specific NLP: Adapting Models for Healthcare, Law, and Finance	domain adaptation, BioBERT, legal NLP, clinical text analysis, privacy-preserving NLP, terminology extraction, few-shot domain transfer
Attention Heads of Large Language Models: A Survey	attention head, attention mechanism, large language model, LLM,transformer architecture, neural networks, natural language processing
Controllable Text Generation for Large Language Models: A Survey	controlled text generation, text generation, large language model, LLM,natural language processing
A survey on evaluation of large language models	evaluation of large language models,large language models assessment, natural language processing, AI model evaluation
Large language models for generative information extraction: a survey	information extraction, large language models, LLM,natural language processing, generative AI, text mining
Internal consistency and self feedback of LLM	Internal consistency, self feedback, large language model, LLM,natural language processing, model evaluation, AI reliability
Review of Multi Agent Offline Reinforcement Learning	multi agent, offline policy, reinforcement learning,decentralized learning, cooperative agents, policy optimization
Reasoning of large language model: A survey	reasoning of large language models, large language models, LLM,natural language processing, AI reasoning, transformer models
Hierarchy Theorems in Computational Complexity: From Time-Space Tradeoffs to Oracle Separations	P vs NP, NP-completeness, polynomial hierarchy, space complexity, oracle separation, Cook-Levin theorem
Classical Simulation of Quantum Circuits: Complexity Barriers and Implications	BQP, quantum supremacy, Shor's algorithm, post-quantum cryptography, QMA, hidden subgroup problem
Kernelization: Theory, Techniques, and Limits	fixed-parameter tractable (FPT), kernelization, treewidth, W-hierarchy, ETH (Exponential Time Hypothesis), parameterized reduction
Optimal Inapproximability Thresholds for Combinatorial Optimization Problems	PCP theorem, approximation ratio, Unique Games Conjecture, APX-hardness, gap-preserving reduction, LP relaxation
Hardness in P: When Polynomial Time is Not Enough	SETH (Strong Exponential Time Hypothesis), 3SUM conjecture, all-pairs shortest paths (APSP), orthogonal vectors problem, fine-grained reduction, dynamic lower bounds
Consistency Models in Distributed Databases: From ACID to NewSQL	CAP theorem, ACID vs BASE, Paxos/Raft, Spanner, NewSQL, sharding, linearizability
Cloud-Native Databases: Architectures, Challenges, and Future Directions	cloud databases, AWS Aurora, Snowflake, storage-compute separation, auto-scaling, pay-per-query, multi-tenancy
Graph Database Systems: Storage Engines and Query Optimization Techniques	graph traversal, Neo4j, SPARQL, property graph, subgraph matching, RDF triplestore, Gremlin
Real-Time Aggregation in TSDBs: Techniques for High-Cardinality Data	time-series data, InfluxDB, Prometheus, downsampling, time windowing, high-cardinality indexing, stream processing
Self-Driving Databases: A Survey of AI-Powered Autonomous Management	autonomous databases, learned indexes, query optimization, Oracle AutoML, workload forecasting, anomaly detection
Multi-Model Databases: Integrating Relational, Document, and Graph Paradigms	multi-model database, MongoDB, ArangoDB, JSONB, unified query language, schema flexibility, polystore
Vector Databases for AI: Efficient Similarity Search and Retrieval-Augmented Generation	vector database, FAISS, Milvus, ANN search, embedding indexing, RAG (Retrieval-Augmented Generation), HNSW
Software-Defined Networking: Evolution, Challenges, and Future Scalability	OpenFlow, control plane/data plane separation, NFV orchestration, network slicing, P4 language, OpenDaylight, scalability bottlenecks
Beyond 5G: Architectural Innovations for Terahertz Communication and Network Slicing	network slicing, MEC (Multi-access Edge Computing), beamforming, mmWave, URLLC (Ultra-Reliable Low-Latency Communication), O-RAN, energy efficiency
IoT Network Protocols: A Comparative Study of LoRaWAN, NB-IoT, and Thread	LPWAN, LoRa, ZigBee 3.0, 6LoWPAN, TDMA scheduling, RPL routing, device density management
Edge Caching in Content Delivery Networks: Algorithms and Economic Incentives	CDN, Akamai, cache replacement policies, DASH (Dynamic Adaptive Streaming), QoE optimization, edge server placement, bandwidth cost reduction
A survey on flow batteries	battery electrolyte formulation
Research on battery electrolyte formulation	flow batteries

User Requested Papers

Title	Keywords
Think and Draw! A survey on Vision-MLLMs that can understand and generate	vision-language models, multimodal learning, generative AI
A Survey of new intent detection and discovery for Conversational Understanding	Out-of-domain Detection， New Intent Discovery， Generalized Category Discovery
A Survey of Segment Anything Model (SAM) in Medical Imaging: Advances in Vision Foundation Models	Segment Anything Model (SAM), Medical Image Segmentation, Vision Foundation Models, Prompt Engineering, Efficient Fine-Tuning
A Survey of joint extraction of medical entities and relations	Medical Entity Recognition, Joint Extraction, Relation Extraction, Biomedical Text Mining, Deep Learning
Reinforcement Learning for Large Language Models: Methods, Challenges, and Applications	Large Language Models, Reinforcement Learning, RLHF, Reward Modeling, AI Alignment, Fine-Tuning, Prompt Optimization, Self-Supervised Learning, Model-based RL, Meta-Reinforcement Learning, AI Agents, Multi-Agent Reinforcement Learning (MARL), Curriculum Learning, Few-Shot Learning, Continual Learning, Adaptive Learning, Human-in-the-Loop Learning
Process Reward Models for LLM Reasoning	Process Reward Model, Reasoning, Large Language Model
Novel Multiferroics coupling ferroelectricity with Skyrmion, altermagnetism or Ferrovalley	Multiferroic, Skyrmion, altermagnetism, Ferrovalley, ferroelectricity
Comparative study of cardiac markers CKMB and LDH in pericardial fluid for postmortem diagnosis	Forensic medicine, Postmortem diagnosis, Sample timing, Cardiac muscle fibers, Myocardial infarction
A Multi-dimensional Perspective on Hybrid Human-Artificial Intelligence: Opportunities and Challenges in the Era of Large Language Models	Hybrid Human-Artificial Intelligence, Large Language Models, Deep Learning, Reinforcement learning
Multimodal fusion with Multimodal Temporal Data for Clinical prediction	Multimodal temporal data, Deep learning in healthcare, Clinical decision support systems (CDSS), Temporal alignment, Uncertainty quantification, Reinforcement learning in medicine, Medical image sequence analysis, Interpretable machine learning
A Survey on Unit Test Case Generation	Unit Test, Unit Testing, Large Language Model
Enhancing Blind Image Deblurring Robustness Against Saturated Images	Blind Deblurring, Impulse Noise Detection, Non-convex Optimization, Saturated Images
A Comprehensive Survey of Token Compression for Vision Transformers, Vision Generation, Large Language Models, Large Multimodal Models	Token Compression, Token Pruning, Token Reduction, Token Merging, Accelerating Diffusion Transformers, Vision-Language Models, Image Generation, Video Generation, Vision Transformers. Large Language Models, Acceleration, Efficient Large Multimodal Models
Automatic Detection of Age-Related Macular Degeneration in Mice Using OCT Images and Deep Neural Networks	Age-Related Macular Degeneration (AMD), Optical Coherence Tomography (OCT), Convolutional Neural Networks (CNN), VGG16, ResNet, Deep Learning,nnUNet, Biomedical Image Analysis, Classification Models, Artificial Intelligence in Ophthalmology
Deep learning for depression recognition with multimodal : A Review	Depression, Multimodal methods, Emotion recognition, Deep learning, Unimodal methods, Psychological abnormalities
Paper on the PET image reconstruction, focused on the methods in the generation AI era, comparing deterministic and generative methods	AI, medical imaging, PET, reconstruction, generative model
LLM-based Autonomous Agents Empowered Social Sciences: From a Social Simulation Perspective	large language models, LLM-based agent, social simulation, social science, social intelligence, world simulation
Multi-modal Knowledge Graph Completion and Its Application: A Comprehensive Survey	Knowledge Graphs, Multi-modal Knowledge Graph Completion, Multi-modal Fusion, Knowledge Graph Application, Link Prediction
A Fingerprint Localization Algorithm Based on Low-Density Tags	Indoor location, Fingerprint location
Document Understanding with Multi-modal Large Language Model : A Survey	Document Understanding, Multi-modal Large Language, Document AI, Document VQA
Towards human-like multimodal perception and cognition: A review of challenges and future prospects of MLLM multimodal alignment.	nan
Temporal Question Answering: A survey	Large Language Models, Temproal Knowledge Graph, Retrieval-Augmented Generation
Automated Material Synthesis Laboratory: A Survey	Robotics, AI, LLM, Autonomous laboratory, Active learning, Machine learning, Deep learning, High-throughput, DFT, Materials science, Chemistry
Emergent Abilities in Large Language Models: a Survey	Large Language Models, Emergent Abilities
A Comprehensive Survey of Text-to-Speech Synthesis: Technologies, Methodologies, and Applications	Text-to-speech (TTS) systems, speech synthesis architectures, parametric TTS, neural TTS, deep learning for speech synthesis, waveform generation, vocoders, prosody modeling, multilingual speech synthesis, real-time TTS, voice conversion, zero-shot TTS, evaluation metrics for TTS
From Signal to Parameter: A Complete Technical Guide to All PPG-Derived Measurements and Their Extraction Methods	Photoplethysmography, Biosensors, Signal processing, Optical monitoring, Vital signs, Biomedical engineering
The Potential and Challenges of Artificial Intelligence in Policing : A survey	AI, Policing, Law enforcement, Crime prevention, Surveillance, Predictive analytics, Ethics, Bias, Decision-making, Legal implications
Application of multi-modal large-scale model in robot orientation	Multi-modal large model, robot, Reinforcement learning, embodied intelligence,Hybrid model of experts
Application Research on fatigue detection using face recognition technology	Fatigue detection, face recognition, deep learning image processing
Deep Learning Applications in Single-cell Omics for Colorectal Cancer Research	Deep Learning, Single-cell Omics, Colorectal Cancer, Bioinformatics
Prediction of shear strength parameters of granite residual soil based on machine learning	granite residual soil, machine learning, shear strength parameters, data processing, model interpretability
Quantifying the Use and Impact of Machine Learning in Lung Sound Classification and Recognition	Machine learning, lung sounds, respiratory sound recognition, Pulmonary sound analysis,Feature extraction
Process Reward Models for Large Language Model Reasoning: A Comprehensive Review	Large Language Models, process supervision, step-level supervision, process reward model (PRM), step-level reward, process verifier, step-by-step verifier, Reinforcement Learning
A Comprehensive Survey of Low-Rank Adaptation (LoRA) in Large Language Models, Vision Generation, Large Multimodal Models and Beyond: Methods, Theories, Applications and Opportunities	LoRA (Low-Rank Adaptation), LoRA Hyperparameters, LoRA with Pruning, Multimodal learning, Vision-Language Models
Symbolic Regression: Architectures, Benchmarks, and Emerging Trends	symbolic regression, sparse regression, equation discovery, genetic algorithms, Reinforcement Learning, large language models
Diffusion Models in object detection: A survey.	Diffusion Models, remote sensing, synthetic aperture radar, feature representation, object recognition
EEG Foundation Model: A Survey	AI, EEG, Foundation model, Deep Learning, Time Series, BCI
PCB defect detection: a survey	PCB defect detection
Diffusion Models in object detection: A survey.	Diffusion Models, object detection, object recognition
The Impact of AI Image Recognition Technology in Sports Fitness Training on Customer Participation and Repurchase Intention: A Survey	Image Recognition, Sports Fitness Training, Customer Participation, Repurchase Intention
A review on AGI-enabled solutions for IoX services exploding with cyber-physical-social-thinking space	services exploding, AI, internet of things, internet of people, Internet of Thinking
A review of deep-learning-based approaches for Positron Emission Tomography Reconstrution	deep learning, Positron Emission Tomography, PET image reconstrution
Finetuning Mixture of Experts LLMs: Methods, Applications, Risks and Opportunities	LLMs, Large Language Models, Mixture of Experts, training, Continued Pretraining, Finetuning, SFT, Supervised Finetuning, MOE, Expert router, LORA, PEFT, Full Finetuning, retraining experts, finetuning experts
A Survey on Large Language Model based Autonomous Agents	Autonomous agent, Large language model, Human-level intelligence, AGI, ASI, coding
Progress and Prospects of Research on the Whole Stage of Coal Spontaneous Combustion	spontaneous combustion, comprehensive prevention and control throughout all stages, machine learning, artificial intelligence, informatization, monitoring and early warning, fire prevention and control, emergency rescue
a survey on large language model for debugging the programs	Autonomous agent, Large language model, Debug, Debug like a Human, Human in the loop, Human-Computer Interaction, Automatic Program Repair, self-debug, testing
Evolving robotic systems with integrated advanced sensors and machine learning	multisensory, multimodal sensor, advanced sensor, design principles of sensors, materials, structure design, fabriaction, sensing mechanisms, cross-modal communication and learning, machine learning hardware and algorithms, Integration, embedded ML systems, Neuromorphic sensing and computing systems, Virtual Reality (VR) and Augmented Reality (AR), Smart manufacturin, Healthcare, Autonomous vehicles, Large scale IoTs, Robotics
Deepfake Detection Guided by data augmentation: A Survey	deepfake, data augmentation, inverse graphics, classifier, computer vision, faceswap
A Comprehensive Survey of Facial Landmark Detection	facial landmark detection, face alignment, 3d facial landmarks, face morphing, facial retouching, virtual makeup
Domain Adaptation in Medical Imaging: A Survey	medical imaging, domain adaptation, machine learning, healthcare, image processing
Speech-based Depression Recognition: A Survey	Depression, Speech, Deep Learning
Energy Communities: Global Trends, Nordic Innovations, and the Swedish Transition	decentralized energy systems, renewable energy transition, citizen participation, energy democracy
A Comprehensive Review of Sepsis: Mechanisms, Clinical Management, and MIMIC Database-Driven Insights	Sepsis, mimic database, data mining, critical care, pathophysiology, clinical management, prognostic indicators, big data analytics, machine learning, deep learning, evidence-based medicine
Tree Crown Delineation in Remote Sensing: A Review	Individual tree crown delineation, Tree crown detection, LiDAR, RGB high-resolution imagery, UAV remote sensing, very high-resolution , multispectral, deep learning, tropical forest, ecology, remote sensing, aerial imagery, automatic tree crown delineation
Ecological Succession and Canopy Gaps: A Survey on Tree Architecture, Species Strategies, and Gap Dynamics	Ecological succession, canopy gaps, tree architecture, species temperament
A Comprehensive Survey on LLM-Post-Training-oriented Data Synthesis	-
Using Large Language Model Agents in psychology	Large Language Model, LLM Agents, psychology

📃Citing SurveyX

Please cite us if you find this project helpful for your project/paper:

@misc{liang2025surveyxacademicsurveyautomation,
      title={SurveyX: Academic Survey Automation via Large Language Models}, 
      author={Xun Liang and Jiawei Yang and Yezhaohui Wang and Chen Tang and Zifan Zheng and Simin Niu and Shichao Song and Hanyu Wang and Bo Tang and Feiyu Xiong and Keming Mao and Zhiyu li},
      year={2025},
      eprint={2502.14776},
      archivePrefix={arXiv},
      primaryClass={cs.CL},
      url={https://arxiv.org/abs/2502.14776}, 
}

⚠️ Note

Our retrieval engine may not have access to many papers that require commercial licensing. If your research topic requires papers from sources other than arXiv, the quality and comprehensiveness of the generated papers may be affected due to limitations in our retrieval scope.
We currently only support the generation of English academic survey generation. Support for other languages is not available.

⚠️Disclaimer

SurveyX uses advanced language models to assist with the generation of academic papers. However, it is important to note that the generated content is a tool for research assistance. Users should verify the accuracy of the generated papers, as SurveyX cannot guarantee full compliance with academic standards.

Name		Name	Last commit message	Last commit date
Latest commit History 49 Commits
assets		assets
examples		examples
user_requests		user_requests
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SurveyX: Academic Survey Automation via Large Language Models

⭐Star History

🤔What is SurveyX?

🤔 What’s This Git For?

🖋️How to Request a Custom Paper via Issue

💬Example Issue Submission:

📝Generated Topics

Examples Papers

User Requested Papers

📃Citing SurveyX

⚠️ Note

⚠️Disclaimer

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

SurveyX: Academic Survey Automation via Large Language Models

⭐Star History

🤔What is SurveyX?

🤔 What’s This Git For?

🖋️How to Request a Custom Paper via Issue

💬Example Issue Submission:

📝Generated Topics

Examples Papers

User Requested Papers

📃Citing SurveyX

⚠️ Note

⚠️Disclaimer

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Packages