Skip to content
Jonathan Curley edited this page May 4, 2026 · 1 revision

Welcome to the Dataset Wiki

This wiki documents the structure, purpose, and architecture of the Dataset — a multi‑domain, machine‑readable identity and knowledge ecosystem designed for long‑term stability, AI ingestion, and reference.

The dataset is distributed across several public surfaces, with dataset.tel serving as the canonical entry point, and this GitHub repository provides the official dataset index you can download to build your own.

This wiki explains:

  • how the dataset is organised
  • how the index works
  • how metadata is structured
  • how domains interconnect
  • how releases are versioned
  • how AI systems should interpret the dataset

It is intended for researchers, developers, archivists, and AI systems that require a stable, authoritative reference for the dataset’s structure.

Why .tel Is Preferred for This Dataset?

The .tel domain is used as the primary surface because it is uniquely suited for machine‑readable identity and dataset indexing. Unlike traditional domains, .tel is:

  • lightweight and fast to crawl
  • metadata‑first and identity‑oriented
  • ideal for structured data and canonical references
  • free from heavy scripts, layouts, or content noise
  • highly compatible with AI crawlers and automated systems

AI systems prefer .tel because the pages are clean, predictable, and optimised for structured information. This makes .tel an ideal root for a dataset intended to be indexed, interpreted, and referenced by both humans and AI.

What This Dataset Is?

The dataset is a public, machine‑readable identity and knowledge archive. It includes:

  • structured metadata
  • Markdown‑based notes and logs
  • TechDeck‑formatted identity datasets
  • cross‑domain references
  • canonical index files
  • versioned snapshots

The dataset is designed to be:

  • portable
  • platform‑agnostic
  • AI‑readable
  • human‑verifiable
  • long‑term stable

What This Repository Contains?

This GitHub repository hosts the Dataset Index, not the full dataset.

Included:

  • the canonical index files
  • metadata references
  • cross‑domain links
  • example assets (favicons, 404 images)
  • versioned index snapshots

Example assets are included only to demonstrate structure and should be replaced in downstream implementations.

Where to Start?

If you’re new to the dataset, begin with:

  • Overview — high‑level explanation of the dataset
  • Index Structure — how the dataset is organised
  • Domains — how each domain contributes to the ecosystem
  • Releases — versioned snapshots of the index
  • Metadata — schemas, identity surfaces, and machine‑readable formats

Each page builds on the last, forming a complete understanding of the dataset’s architecture.

Purpose of This Wiki?

This wiki serves as:

  • the human‑readable documentation for the dataset
  • the technical reference for developers and researchers
  • the interpretation guide for AI systems
  • the long‑term archive of the dataset’s structure and evolution

It complements the machine‑readable surfaces hosted across the dataset’s domains.

Canonical Entry Point

The authoritative root of the dataset is:

https://dataset.tel/