✨Welcome to SurveyX! This GitHub repository serves as a channel for users to submit requests for paper generation based on specific topics or domains.📚
🚀 We're actively developing a full-featured product with a sleek graphical interface!
⭐ Star this repo to stay updated and be the first to know about our progress and release announcements!
💡 Your support means everything to us as we work to bring this innovative solution to life. Stay tuned for more updates!
SurveyX is an advanced academic survey automation system that leverages the power of Large Language Models (LLMs) to generate high-quality, domain-specific academic papers and surveys.🚀
By simply providing a paper title and keywords for literature retrieval, users can request comprehensive academic papers or surveys tailored to specific topics.
If you're curious about how SurveyX works or want to understand the underlying technology and methodology, feel free to check out our 📑website, where we provide an in-depth explanation of the system's architecture, data processing methods, and experimental results.
This GitHub repository is designed to provide a platform where users can request the generation of high-quality, domain-specific academic surveys by simply submitting an issue. The main purpose of this repository is to allow users to easily create and receive tailored academic surveys or papers, which are generated using SurveyX📄
By submitting an issue with a paper title and keywords for literature search, users can quickly generate a comprehensive review paper or survey on a specific topic. This process streamlines academic research by automating paper creation, saving users time and effort in compiling research content. 📚💡
To request a paper, create a new issue with the following details:
- Paper Title: Provide the title of the paper you need.
- Keywords for Literature Search: Provide keywords separated by commas that will help retrieve relevant literature and guide the content generation (e.g. "AI in healthcare, climate change impact on agriculture, ethical implications of AI"). We recommend that the number of keywords be limited to 4 ~ 5 for optimal results.
- Your email(optional): Please provide your email address so that we can notify you promptly once the paper is ready.
Title: Controllable text generation of LLM: A Survey
Keywords: AI, healthcare, ethical implications, technology adoption, AI-driven diagnostics
Email: xxxxxxxx@SurveyX.cn
Once your request is submitted, the generated paper will be placed in the user requests folder. Please allow 1-2 business days for processing and generation. ⏳
| Title | Keywords |
|---|---|
| From BERT to GPT-4: A Survey of Architectural Innovations in Pre-trained Language Models | Transformer, BERT, GPT-3, self-attention, masked language modeling, cross-lingual transfer, model scaling |
| Unsupervised Cross-Lingual Word Embedding Alignment: Techniques and Applications | low-resource NLP, few-shot learning, data augmentation, unsupervised alignment, synthetic corpora, NLLB, zero-shot transfer |
| Vision-Language Pre-training: Architectures, Benchmarks, and Emerging Trends | multimodal learning, CLIP, Whisper, cross-modal retrieval, modality fusion, video-language models, contrastive learning |
| Efficient NLP at Scale: A Review of Model Compression Techniques | model compression, knowledge distillation, pruning, quantization, TinyBERT, edge computing, latency-accuracy tradeoff |
| Domain-Specific NLP: Adapting Models for Healthcare, Law, and Finance | domain adaptation, BioBERT, legal NLP, clinical text analysis, privacy-preserving NLP, terminology extraction, few-shot domain transfer |
| Attention Heads of Large Language Models: A Survey | attention head, attention mechanism, large language model, LLM,transformer architecture, neural networks, natural language processing |
| Controllable Text Generation for Large Language Models: A Survey | controlled text generation, text generation, large language model, LLM,natural language processing |
| A survey on evaluation of large language models | evaluation of large language models,large language models assessment, natural language processing, AI model evaluation |
| Large language models for generative information extraction: a survey | information extraction, large language models, LLM,natural language processing, generative AI, text mining |
| Internal consistency and self feedback of LLM | Internal consistency, self feedback, large language model, LLM,natural language processing, model evaluation, AI reliability |
| Review of Multi Agent Offline Reinforcement Learning | multi agent, offline policy, reinforcement learning,decentralized learning, cooperative agents, policy optimization |
| Reasoning of large language model: A survey | reasoning of large language models, large language models, LLM,natural language processing, AI reasoning, transformer models |
| Hierarchy Theorems in Computational Complexity: From Time-Space Tradeoffs to Oracle Separations | P vs NP, NP-completeness, polynomial hierarchy, space complexity, oracle separation, Cook-Levin theorem |
| Classical Simulation of Quantum Circuits: Complexity Barriers and Implications | BQP, quantum supremacy, Shor's algorithm, post-quantum cryptography, QMA, hidden subgroup problem |
| Kernelization: Theory, Techniques, and Limits | fixed-parameter tractable (FPT), kernelization, treewidth, W-hierarchy, ETH (Exponential Time Hypothesis), parameterized reduction |
| Optimal Inapproximability Thresholds for Combinatorial Optimization Problems | PCP theorem, approximation ratio, Unique Games Conjecture, APX-hardness, gap-preserving reduction, LP relaxation |
| Hardness in P: When Polynomial Time is Not Enough | SETH (Strong Exponential Time Hypothesis), 3SUM conjecture, all-pairs shortest paths (APSP), orthogonal vectors problem, fine-grained reduction, dynamic lower bounds |
| Consistency Models in Distributed Databases: From ACID to NewSQL | CAP theorem, ACID vs BASE, Paxos/Raft, Spanner, NewSQL, sharding, linearizability |
| Cloud-Native Databases: Architectures, Challenges, and Future Directions | cloud databases, AWS Aurora, Snowflake, storage-compute separation, auto-scaling, pay-per-query, multi-tenancy |
| Graph Database Systems: Storage Engines and Query Optimization Techniques | graph traversal, Neo4j, SPARQL, property graph, subgraph matching, RDF triplestore, Gremlin |
| Real-Time Aggregation in TSDBs: Techniques for High-Cardinality Data | time-series data, InfluxDB, Prometheus, downsampling, time windowing, high-cardinality indexing, stream processing |
| Self-Driving Databases: A Survey of AI-Powered Autonomous Management | autonomous databases, learned indexes, query optimization, Oracle AutoML, workload forecasting, anomaly detection |
| Multi-Model Databases: Integrating Relational, Document, and Graph Paradigms | multi-model database, MongoDB, ArangoDB, JSONB, unified query language, schema flexibility, polystore |
| Vector Databases for AI: Efficient Similarity Search and Retrieval-Augmented Generation | vector database, FAISS, Milvus, ANN search, embedding indexing, RAG (Retrieval-Augmented Generation), HNSW |
| Software-Defined Networking: Evolution, Challenges, and Future Scalability | OpenFlow, control plane/data plane separation, NFV orchestration, network slicing, P4 language, OpenDaylight, scalability bottlenecks |
| Beyond 5G: Architectural Innovations for Terahertz Communication and Network Slicing | network slicing, MEC (Multi-access Edge Computing), beamforming, mmWave, URLLC (Ultra-Reliable Low-Latency Communication), O-RAN, energy efficiency |
| IoT Network Protocols: A Comparative Study of LoRaWAN, NB-IoT, and Thread | LPWAN, LoRa, ZigBee 3.0, 6LoWPAN, TDMA scheduling, RPL routing, device density management |
| Edge Caching in Content Delivery Networks: Algorithms and Economic Incentives | CDN, Akamai, cache replacement policies, DASH (Dynamic Adaptive Streaming), QoE optimization, edge server placement, bandwidth cost reduction |
| A survey on flow batteries | battery electrolyte formulation |
| Research on battery electrolyte formulation | flow batteries |
| Title | Keywords |
|---|---|
| Think and Draw! A survey on Vision-MLLMs that can understand and generate | vision-language models, multimodal learning, generative AI |
| A Survey of new intent detection and discovery for Conversational Understanding | Out-of-domain Detection, New Intent Discovery, Generalized Category Discovery |
| A Survey of Segment Anything Model (SAM) in Medical Imaging: Advances in Vision Foundation Models | Segment Anything Model (SAM), Medical Image Segmentation, Vision Foundation Models, Prompt Engineering, Efficient Fine-Tuning |
| A Survey of joint extraction of medical entities and relations | Medical Entity Recognition, Joint Extraction, Relation Extraction, Biomedical Text Mining, Deep Learning |
| Reinforcement Learning for Large Language Models: Methods, Challenges, and Applications | Large Language Models, Reinforcement Learning, RLHF, Reward Modeling, AI Alignment, Fine-Tuning, Prompt Optimization, Self-Supervised Learning, Model-based RL, Meta-Reinforcement Learning, AI Agents, Multi-Agent Reinforcement Learning (MARL), Curriculum Learning, Few-Shot Learning, Continual Learning, Adaptive Learning, Human-in-the-Loop Learning |
| Process Reward Models for LLM Reasoning | Process Reward Model, Reasoning, Large Language Model |
| Novel Multiferroics coupling ferroelectricity with Skyrmion, altermagnetism or Ferrovalley | Multiferroic, Skyrmion, altermagnetism, Ferrovalley, ferroelectricity |
| Comparative study of cardiac markers CKMB and LDH in pericardial fluid for postmortem diagnosis | Forensic medicine, Postmortem diagnosis, Sample timing, Cardiac muscle fibers, Myocardial infarction |
| A Multi-dimensional Perspective on Hybrid Human-Artificial Intelligence: Opportunities and Challenges in the Era of Large Language Models | Hybrid Human-Artificial Intelligence, Large Language Models, Deep Learning, Reinforcement learning |
| Multimodal fusion with Multimodal Temporal Data for Clinical prediction | Multimodal temporal data, Deep learning in healthcare, Clinical decision support systems (CDSS), Temporal alignment, Uncertainty quantification, Reinforcement learning in medicine, Medical image sequence analysis, Interpretable machine learning |
| A Survey on Unit Test Case Generation | Unit Test, Unit Testing, Large Language Model |
| Enhancing Blind Image Deblurring Robustness Against Saturated Images | Blind Deblurring, Impulse Noise Detection, Non-convex Optimization, Saturated Images |
| A Comprehensive Survey of Token Compression for Vision Transformers, Vision Generation, Large Language Models, Large Multimodal Models | Token Compression, Token Pruning, Token Reduction, Token Merging, Accelerating Diffusion Transformers, Vision-Language Models, Image Generation, Video Generation, Vision Transformers. Large Language Models, Acceleration, Efficient Large Multimodal Models |
| Automatic Detection of Age-Related Macular Degeneration in Mice Using OCT Images and Deep Neural Networks | Age-Related Macular Degeneration (AMD), Optical Coherence Tomography (OCT), Convolutional Neural Networks (CNN), VGG16, ResNet, Deep Learning,nnUNet, Biomedical Image Analysis, Classification Models, Artificial Intelligence in Ophthalmology |
| Deep learning for depression recognition with multimodal : A Review | Depression, Multimodal methods, Emotion recognition, Deep learning, Unimodal methods, Psychological abnormalities |
| Paper on the PET image reconstruction, focused on the methods in the generation AI era, comparing deterministic and generative methods | AI, medical imaging, PET, reconstruction, generative model |
| LLM-based Autonomous Agents Empowered Social Sciences: From a Social Simulation Perspective | large language models, LLM-based agent, social simulation, social science, social intelligence, world simulation |
| Multi-modal Knowledge Graph Completion and Its Application: A Comprehensive Survey | Knowledge Graphs, Multi-modal Knowledge Graph Completion, Multi-modal Fusion, Knowledge Graph Application, Link Prediction |
| A Fingerprint Localization Algorithm Based on Low-Density Tags | Indoor location, Fingerprint location |
| Document Understanding with Multi-modal Large Language Model : A Survey | Document Understanding, Multi-modal Large Language, Document AI, Document VQA |
| Towards human-like multimodal perception and cognition: A review of challenges and future prospects of MLLM multimodal alignment. | nan |
| Temporal Question Answering: A survey | Large Language Models, Temproal Knowledge Graph, Retrieval-Augmented Generation |
| Automated Material Synthesis Laboratory: A Survey | Robotics, AI, LLM, Autonomous laboratory, Active learning, Machine learning, Deep learning, High-throughput, DFT, Materials science, Chemistry |
| Emergent Abilities in Large Language Models: a Survey | Large Language Models, Emergent Abilities |
| A Comprehensive Survey of Text-to-Speech Synthesis: Technologies, Methodologies, and Applications | Text-to-speech (TTS) systems, speech synthesis architectures, parametric TTS, neural TTS, deep learning for speech synthesis, waveform generation, vocoders, prosody modeling, multilingual speech synthesis, real-time TTS, voice conversion, zero-shot TTS, evaluation metrics for TTS |
| From Signal to Parameter: A Complete Technical Guide to All PPG-Derived Measurements and Their Extraction Methods | Photoplethysmography, Biosensors, Signal processing, Optical monitoring, Vital signs, Biomedical engineering |
| The Potential and Challenges of Artificial Intelligence in Policing : A survey | AI, Policing, Law enforcement, Crime prevention, Surveillance, Predictive analytics, Ethics, Bias, Decision-making, Legal implications |
| Application of multi-modal large-scale model in robot orientation | Multi-modal large model, robot, Reinforcement learning, embodied intelligence,Hybrid model of experts |
| Application Research on fatigue detection using face recognition technology | Fatigue detection, face recognition, deep learning image processing |
| Deep Learning Applications in Single-cell Omics for Colorectal Cancer Research | Deep Learning, Single-cell Omics, Colorectal Cancer, Bioinformatics |
| Prediction of shear strength parameters of granite residual soil based on machine learning | granite residual soil, machine learning, shear strength parameters, data processing, model interpretability |
| Quantifying the Use and Impact of Machine Learning in Lung Sound Classification and Recognition | Machine learning, lung sounds, respiratory sound recognition, Pulmonary sound analysis,Feature extraction |
| Process Reward Models for Large Language Model Reasoning: A Comprehensive Review | Large Language Models, process supervision, step-level supervision, process reward model (PRM), step-level reward, process verifier, step-by-step verifier, Reinforcement Learning |
| A Comprehensive Survey of Low-Rank Adaptation (LoRA) in Large Language Models, Vision Generation, Large Multimodal Models and Beyond: Methods, Theories, Applications and Opportunities | LoRA (Low-Rank Adaptation), LoRA Hyperparameters, LoRA with Pruning, Multimodal learning, Vision-Language Models |
| Symbolic Regression: Architectures, Benchmarks, and Emerging Trends | symbolic regression, sparse regression, equation discovery, genetic algorithms, Reinforcement Learning, large language models |
| Diffusion Models in object detection: A survey. | Diffusion Models, remote sensing, synthetic aperture radar, feature representation, object recognition |
| EEG Foundation Model: A Survey | AI, EEG, Foundation model, Deep Learning, Time Series, BCI |
| PCB defect detection: a survey | PCB defect detection |
| Diffusion Models in object detection: A survey. | Diffusion Models, object detection, object recognition |
| The Impact of AI Image Recognition Technology in Sports Fitness Training on Customer Participation and Repurchase Intention: A Survey | Image Recognition, Sports Fitness Training, Customer Participation, Repurchase Intention |
| A review on AGI-enabled solutions for IoX services exploding with cyber-physical-social-thinking space | services exploding, AI, internet of things, internet of people, Internet of Thinking |
| A review of deep-learning-based approaches for Positron Emission Tomography Reconstrution | deep learning, Positron Emission Tomography, PET image reconstrution |
| Finetuning Mixture of Experts LLMs: Methods, Applications, Risks and Opportunities | LLMs, Large Language Models, Mixture of Experts, training, Continued Pretraining, Finetuning, SFT, Supervised Finetuning, MOE, Expert router, LORA, PEFT, Full Finetuning, retraining experts, finetuning experts |
| A Survey on Large Language Model based Autonomous Agents | Autonomous agent, Large language model, Human-level intelligence, AGI, ASI, coding |
| Progress and Prospects of Research on the Whole Stage of Coal Spontaneous Combustion | spontaneous combustion, comprehensive prevention and control throughout all stages, machine learning, artificial intelligence, informatization, monitoring and early warning, fire prevention and control, emergency rescue |
| a survey on large language model for debugging the programs | Autonomous agent, Large language model, Debug, Debug like a Human, Human in the loop, Human-Computer Interaction, Automatic Program Repair, self-debug, testing |
| Evolving robotic systems with integrated advanced sensors and machine learning | multisensory, multimodal sensor, advanced sensor, design principles of sensors, materials, structure design, fabriaction, sensing mechanisms, cross-modal communication and learning, machine learning hardware and algorithms, Integration, embedded ML systems, Neuromorphic sensing and computing systems, Virtual Reality (VR) and Augmented Reality (AR), Smart manufacturin, Healthcare, Autonomous vehicles, Large scale IoTs, Robotics |
| Deepfake Detection Guided by data augmentation: A Survey | deepfake, data augmentation, inverse graphics, classifier, computer vision, faceswap |
| A Comprehensive Survey of Facial Landmark Detection | facial landmark detection, face alignment, 3d facial landmarks, face morphing, facial retouching, virtual makeup |
| Domain Adaptation in Medical Imaging: A Survey | medical imaging, domain adaptation, machine learning, healthcare, image processing |
| Speech-based Depression Recognition: A Survey | Depression, Speech, Deep Learning |
| Energy Communities: Global Trends, Nordic Innovations, and the Swedish Transition | decentralized energy systems, renewable energy transition, citizen participation, energy democracy |
| A Comprehensive Review of Sepsis: Mechanisms, Clinical Management, and MIMIC Database-Driven Insights | Sepsis, mimic database, data mining, critical care, pathophysiology, clinical management, prognostic indicators, big data analytics, machine learning, deep learning, evidence-based medicine |
| Tree Crown Delineation in Remote Sensing: A Review | Individual tree crown delineation, Tree crown detection, LiDAR, RGB high-resolution imagery, UAV remote sensing, very high-resolution , multispectral, deep learning, tropical forest, ecology, remote sensing, aerial imagery, automatic tree crown delineation |
| Ecological Succession and Canopy Gaps: A Survey on Tree Architecture, Species Strategies, and Gap Dynamics | Ecological succession, canopy gaps, tree architecture, species temperament |
| A Comprehensive Survey on LLM-Post-Training-oriented Data Synthesis | - |
| Using Large Language Model Agents in psychology | Large Language Model, LLM Agents, psychology |
Please cite us if you find this project helpful for your project/paper:
@misc{liang2025surveyxacademicsurveyautomation,
title={SurveyX: Academic Survey Automation via Large Language Models},
author={Xun Liang and Jiawei Yang and Yezhaohui Wang and Chen Tang and Zifan Zheng and Simin Niu and Shichao Song and Hanyu Wang and Bo Tang and Feiyu Xiong and Keming Mao and Zhiyu li},
year={2025},
eprint={2502.14776},
archivePrefix={arXiv},
primaryClass={cs.CL},
url={https://arxiv.org/abs/2502.14776},
}
- Our retrieval engine may not have access to many papers that require commercial licensing. If your research topic requires papers from sources other than arXiv, the quality and comprehensiveness of the generated papers may be affected due to limitations in our retrieval scope.
- We currently only support the generation of English academic survey generation. Support for other languages is not available.
SurveyX uses advanced language models to assist with the generation of academic papers. However, it is important to note that the generated content is a tool for research assistance. Users should verify the accuracy of the generated papers, as SurveyX cannot guarantee full compliance with academic standards.

