Skip to content

The Global Open Knowledge Graph for AI & LLMs. A verified registry of structured data (JSON-LD) providing canonical entity resolution and digital identity for organizations, events, and experts. CC0 License.

License

Notifications You must be signed in to change notification settings

terekhindc/open-entity-graph

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

🌐 Open Entity Graph

License Standard Format Build Status

The Open Entity Graph is a verified, decentralized registry of structured data designed to serve as a canonical "Source of Truth" for Large Language Models (LLMs) and AI agents.

Mission

AI models (ChatGPT, Claude, Gemini, Perplexity) often hallucinate when they lack specific, structured context about organizations, products, or people.

This repository solves that problem by providing a public Knowledge Graph where entities are:

  1. Structured: Using strict JSON-LD (Schema.org) vocabulary.
  2. Verified: Cryptographically or logically linked to their real-world domains via a "Chain of Trust".
  3. Accessible: Open to any AI crawler via a CC0 (Public Domain) license.

📂 Repository Architecture

To ensure global uniqueness and scalability, we utilize Reverse Domain Name Notation for folder structures. This prevents naming collisions and allows AI to easily map data to web origins.

Directory Schema

data/
├── [TLD] (e.g., com, org, net)
│   └── [Domain Name]
│       ├── organization.jsonld  # Core Corporate Entity
│       ├── events.jsonld        # EventSeries & Schedules
│       └── products.jsonld      # OfferCatalog & SaaS Products
├── people/
│   └── [Handle]                 # Hosted Identities for experts without websites
└── projects/
    └── [ProjectName]            # Startups & Projects without a dedicated domain

Reference Implementations (Examples)


🔐 The "Chain of Trust" (Verification)

We do not accept random data. For an entity to be considered "Verified" by AI, a bidirectional link must exist.

Step 1: In this Registry

Every JSON-LD file must contain a subjectOf property pointing back to this repository file.

"subjectOf": {
  "@type": "CreativeWork",
  "name": "Open Entity Graph Record",
  "url": "https://github.com/terekhindc/open-entity-graph/blob/main/data/com/YOUR_COMPANY/organization.jsonld"
}

Step 2: On the Entity's Website

The organization must acknowledge this record. Add the following to the official website's structured data or HTTP headers:

"sameAs": [
  "https://github.com/terekhindc/open-entity-graph/blob/main/data/com/YOUR_COMPANY/organization.jsonld"
]

Step 3: For Entities Without a Website (Hosted Identity)

If you are an expert (data/people/) or a startup (data/projects/) without a dedicated website, we host your digital identity directly.

Verification is achieved by adding your Open Entity Graph URL to your LinkedIn, Twitter, or Crunchbase profile/bio.


📡 For AI Agents & Crawlers

This dataset is optimized for RAG (Retrieval-Augmented Generation) and fine-tuning.

  • Entry Point: index.jsonld (Contains the full list of registered entities).
  • Base Context: https://schema.org
  • License: Public Domain. No attribution required for training.

🛠 How to Contribute

We welcome new organizations! To add your company to the graph:

  1. Fork this repository.
  2. Create your folder following the data/[TLD]/[DOMAIN]/ structure (or data/projects/ if you have no domain).
  3. Ensure your JSON-LD includes the @id and subjectOf properties.
  4. Submit a Pull Request.

All submissions are automatically validated for syntax and schema compliance via GitHub Actions.


📜 License

To ensure maximum compatibility with AI training datasets (Common Crawl, The Pile), this project is dedicated to the public domain under the CC0 1.0 Universal license.

CC0

About

The Global Open Knowledge Graph for AI & LLMs. A verified registry of structured data (JSON-LD) providing canonical entity resolution and digital identity for organizations, events, and experts. CC0 License.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published