docs_to_knowledge_graph

cleanup: delete unneeded data files for examples (#481 )

May 13, 2025

5b371e9 · May 13, 2025

Name	Name	Last commit message	Last commit date
parent directory ..
.env	.env	rename kg -> knowledge-graph in example (#366 )	Apr 22, 2025
README.md	README.md	chore(examples): update README and bump dep `cocoindex` version (#423 )	May 1, 2025
main.py	main.py	feat(kg): make the way to map data to KG more clear (#409 )	Apr 30, 2025
pyproject.toml	pyproject.toml	chore(examples): bump depenendency on cocoindex to v0.1.35 (#469 )	May 11, 2025

README.md

Build Real-Time Knowledge Graph For Documents with LLM

We will process a list of documents and use LLM to extract relationships between the concepts in each document. We will generate two kinds of relationships:

Relationships between subjects and objects. E.g., "CocoIndex supports Incremental Processing"
Mentions of entities in a document. E.g., "core/basics.mdx" mentions CocoIndex and Incremental Processing.

You can find a step by step blog for this project here

Please drop Cocoindex on Github a star to support us if you like our work. Thank you so much with a warm coconut hug 🥥🤗.

Prerequisite

Install Postgres if you don't have one.
Install Neo4j if you don't have one.
Configure your OpenAI API key.

Documentation

You can read the official CocoIndex Documentation for Property Graph Targets here.

Run

Build the index

Install dependencies:

pip install -e .

Setup:

python main.py cocoindex setup

Update index:

python main.py cocoindex update

Browse the knowledge graph

After the knowledge graph is build, you can explore the knowledge graph you built in Neo4j Browser.

For the dev enviroment, you can connect neo4j browser using credentials:

username: neo4j
password: cocoindex which is pre-configured in the our docker compose config.yaml.

You can open it at http://localhost:7474, and run the following Cypher query to get all relationships:

MATCH p=()-->() RETURN p

CocoInsight

I used CocoInsight (Free beta now) to troubleshoot the index generation and understand the data lineage of the pipeline. It just connects to your local CocoIndex server, with Zero pipeline data retention. Run following command to start CocoInsight:

python main.py cocoindex server -ci

And then open the url https://cocoindex.io/cocoinsight.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Files

docs_to_knowledge_graph

docs_to_knowledge_graph

README.md

Build Real-Time Knowledge Graph For Documents with LLM

Prerequisite

Documentation

Run

Build the index

Browse the knowledge graph

CocoInsight

Files

docs_to_knowledge_graph

Directory actions

More options

Directory actions

More options

Latest commit

History

docs_to_knowledge_graph

Folders and files

parent directory

README.md

Build Real-Time Knowledge Graph For Documents with LLM

Prerequisite

Documentation

Run

Build the index

Browse the knowledge graph

CocoInsight