| 中文版 / English |
GraphScholar is a research assistant for graph learning and GraphRAG papers. It turns local PDF papers into structured metadata, chunk-level evidence, and a graph-based literature store for retrieval and query answering.
- parses PDFs from
data/ - extracts title, abstract, tasks, applications, datasets, method summary, and contribution summary
- builds paper summaries and chunk-level evidence
- constructs a paper-topic-method-task-application-dataset graph
- answers research questions with retrieval and graph querying
- saves each run into
outputs/answer/
outputs/paper_summaries.jsonoutputs/paper_chunks.jsonoutputs/paper_graph.jsonoutputs/paper_metadata_cache.jsonoutputs/answer/*.md
paper_organization.pyreads PDFs and extracts structured metadata.src/build_graph.pybuilds the paper graph from the summaries.src/tools.pyhandles paper search, chunk search, and graph queries.src/agent.pyroutes the question, gathers evidence, and produces the answer.run_agent.pyruns demo or single-question mode and writes a report.
Each paper summary includes:
titleabstracttagscategorypaper_typetasksapplicationsdatasetsmethod_summarycontribution_summaryconfidence
Rebuild the paper store:
python paper_organization.pyRun in local deterministic mode:
python run_agent.py --no-llmRun one question:
python run_agent.py --question "If I am working on GraphRAG, help me organize the most representative papers in recent years by method, evaluation, and survey."Run the preset demo set:
python run_agent.py --demosrc/llm_client.py uses explicit in-code settings:
DEFAULT_BASE_URLDEFAULT_API_KEYDEFAULT_MODEL_ID
Fill those values before using LLM mode.
This is not a plain keyword search tool. It combines:
- structured paper extraction
- graph-aware literature organization
- task/application/dataset-aware retrieval
- chunk evidence with page references
- saved answer reports for review and reuse
The project is intentionally lightweight, but it already behaves like a compact research workflow for graph literature analysis.
