You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
GraphRAG automatically detects and maps relationships between document chunks while
44
+
maintaining context of how information connects across different sources.
45
45
46
-
## How GraphRAG works
46
+
-**Dynamic knowledge evolution**:
47
+
The system learns and improves understanding as more documents are added, with
48
+
relationships and connections becoming more sophisticated over time.
47
49
48
-
ArangoDB's GraphRAG solution democratizes the creation and usage of knowledge
49
-
graphs with a unique combination of vector search, graphs, and LLMs (privately or publicly hosted)
50
-
in a single product.
51
50
52
-
The overall workflow involves the following steps:
53
-
1.**Chunking**:
54
-
- Breaking down raw documents into text chunks
55
-
2.**Entity and relation extraction for Knowledge Graph construction**:
56
-
- LLM-assisted description of entities and relations
57
-
- Entities get inserted as nodes with embeddings
58
-
- Relations get inserted as edges, these include: entity-entity, entity-chunk, chunk-document
59
-
3.**Topology-based clustering into mini-topics (called communities)**:
60
-
- Each entity points to its community
61
-
- Each community points to its higher-level community, if available
62
-
(mini-topics point to major topics)
63
-
4.**LLM-assisted community summarization**:
64
-
- Community summarization is based on all information available about each topic
51
+
## What's next
65
52
66
-
### Turn text files into a Knowledge Graph
53
+
-**[GraphRAG Enterprise Use Cases](use-cases.md)**: Understand the business value through real-world scenarios.
54
+
-**[GraphRAG Technical Overview](technical-overview.md)**: Dive into the architecture, services, and implementation details.
55
+
-**[GraphRAG Web Interface](web-interface.md)**: Try GraphRAG using the interactive web interface.
56
+
-**[GraphRAG Tutorial using integrated Notebook servers](tutorial-notebook.md)**: Follow hands-on examples and implementation guidance via Jupyter Notebooks.
67
57
68
-
The Importer service is the entry point of the GraphRAG pipeline. It takes a
69
-
raw text file as input, processes it using an LLM to extract entities and
70
-
relationships, and generates a Knowledge Graph. The Knowledge Graph is then
71
-
stored in an ArangoDB database for further use. The Knowledge Graph represents
72
-
information in a structured graph format, allowing efficient querying and retrieval.
73
-
74
-
1. Pre-process the raw text file to identify entities and their relationships.
75
-
2. Use LLMs to infer connections and context, enriching the Knowledge Graph.
76
-
3. Store the generated Knowledge Graph in the database for retrieval and reasoning.
77
-
78
-
For detailed information about the service, see the
79
-
[Importer](../services/importer.md) service documentation.
80
-
81
-
### Extract information from the Knowledge Graph
82
-
83
-
The Retriever service enables intelligent search and retrieval of information
84
-
from your previously created Knowledge Graph.
85
-
You can extract information from Knowledge Graphs using two distinct methods:
86
-
- Global retrieval
87
-
- Local retrieval
88
-
89
-
For detailed information about the service, see the
90
-
[Retriever](../services/retriever.md) service documentation.
91
-
92
-
#### Global retrieval
93
-
94
-
Global retrieval focuses on:
95
-
- Extracting information from the entire Knowledge Graph, regardless of specific
96
-
contexts or constraints.
97
-
- Provides a comprehensive overview and answers queries that span across multiple
98
-
entities and relationships in the graph.
99
-
100
-
**Use cases:**
101
-
- Answering broad questions that require a holistic understanding of the Knowledge Graph.
102
-
- Aggregating information from diverse parts of the Knowledge Graph for high-level insights.
103
-
104
-
**Example query:**
105
-
106
-
Global retrieval can answer questions like _**What are the main themes or topics covered in the document**_?
107
-
108
-
During import, the entire Knowledge Graph is analyzed to identify and summarize
109
-
the dominant entities, their relationships, and associated themes. Global
110
-
retrieval uses these community summaries to answer questions from different
111
-
perspectives, then the information gets aggregated into the final response.
112
-
113
-
#### Local retrieval
114
-
115
-
Local retrieval is a more focused approach for:
116
-
- Queries that are constrained to specific subgraphs or contextual clusters
117
-
within the Knowledge Graph.
118
-
- Targeted and precise information extraction, often using localized sections
119
-
of the Knowledge Graph.
120
-
121
-
**Use cases:**
122
-
- Answering detailed questions about a specific entity or a related group of entities.
123
-
- Retrieving information relevant to a particular topic or section in the Knowledge Graph.
124
-
125
-
**Example query:**
126
-
127
-
Local retrieval can answer questions like _**What is the relationship between entity X and entity Y**_?
128
-
129
-
Local queries use hybrid search (semantic and lexical) over the Entities
130
-
collection, and then it expands that subgraph over related entities, relations
131
-
(and its LLM-generated verbal descriptions), text chunks, and communities.
132
-
133
-
### Private LLMs
134
-
135
-
If you're working in an air-gapped environment or need to keep your data
0 commit comments