tranz

Knowledge graph embedding models.

[dependencies]
tranz = "0.5"

Dual-licensed under MIT or Apache-2.0.

Models

Each model scores a triple (head, relation, tail) differently:

Model	Scoring function	Intuition	Reference
TransE	$\lVert \mathbf{h} + \mathbf{r} - \mathbf{t} \rVert$	Translation: tail = head + relation	Bordes et al., 2013
RotatE	$\lVert \mathbf{h} \circ \mathbf{r} - \mathbf{t} \rVert$	Rotation in complex plane	Sun et al., 2019
ComplEx	$\text{Re}(\langle \mathbf{h}, \mathbf{r}, \bar{\mathbf{t}} \rangle)$	Asymmetric via complex conjugate	Trouillon et al., 2016
DistMult	$\langle \mathbf{h}, \mathbf{r}, \mathbf{t} \rangle$	Element-wise product, symmetric	Yang et al., 2015

$\mathbf{h}, \mathbf{r}, \mathbf{t}$ are learned embedding vectors for head, relation, and tail. $\lVert \cdot \rVert$ is the L2 norm, $\circ$ is element-wise product, $\langle \cdot \rangle$ is the trilinear dot product, $\bar{\mathbf{t}}$ is the complex conjugate.

Quick start

Install with cargo install tranz --features candle.

# Train with 1-N scoring (recommended)
tranz train --data data/WN18RR/ --model complex --dim 200 \
    --1n --label-smoothing 0.1 --reciprocals \
    --epochs 100 --lr 0.001 --output embeddings/ --eval

# Train with negative sampling (classic)
tranz train --triples my_graph.tsv --model transe --dim 200 \
    --epochs 500 --gamma 9.0 --alpha 0.5 --output embeddings/ --eval

# Predict from saved embeddings
tranz predict --embeddings embeddings/ --model distmult \
    --head "aspirin" --relation "treats" --k 10

Benchmark: WN18RR

Model	Config	Dim	Epochs	MRR	H@1	H@10
ComplEx	Adagrad + N3 + reciprocals	100	100	0.438	0.400	0.512
ComplEx	Adam + reciprocals	100	50	0.429	0.407	0.469
DistMult	Adam + 1-N	100	50	0.341	0.329	0.362

Published ComplEx MRR on WN18RR is 0.475 (Lacroix et al. 2018). tranz reaches 92% of published with the same recipe (Adagrad, N3, reciprocals).

Library usage

use tranz::{TransE, DistMult, Scorer};
use tranz::dataset::{load_dataset, FilterIndex, InternedDatasetExt};
use tranz::eval::evaluate_link_prediction;

// Load dataset (types from lattix::kge)
let ds = load_dataset("data/WN18RR".as_ref()).unwrap();
let mut interned = ds.into_interned();
interned.add_reciprocals();

// Create model and query
let model = DistMult::new(interned.num_entities(), interned.num_relations(), 200);
let top10 = model.top_k_tails(0, 0, 10);

// Evaluate (filtered link prediction)
let filter = FilterIndex::from_dataset(&interned);
let metrics = evaluate_link_prediction(&model, &interned.test, &filter, interned.num_entities());

Generic triple loading

use tranz::dataset::{Dataset, DatasetExt};

let ds = Dataset::load_flexible("my_graph.tsv".as_ref()).unwrap();
let ds = ds.split(0.1, 0.1); // 80/10/10
let interned = ds.into_interned();

Embedding export

use tranz::io::{export_embeddings, flatten_matrix};

// Export to w2v TSV
export_embeddings("output/".as_ref(), &names, &vecs, &rel_names, &rel_vecs).unwrap();

// Flat f32 matrix for FAISS/Qdrant
let flat: Vec<f32> = flatten_matrix(&vecs);

Training (requires `candle` feature)

Two training modes:

1-N scoring (recommended): scores all entities per query via matmul + BCE loss. Faster convergence, no negative sampling noise.

Negative sampling (classic): samples k negatives per positive with self-adversarial weighting (SANS).

use tranz::train::{train, TrainConfig, ModelType};

let config = TrainConfig {
    model_type: ModelType::DistMult,
    dim: 200,
    one_to_n: true,
    label_smoothing: 0.1,
    embedding_dropout: 0.1,
    epochs: 100,
    ..TrainConfig::default()
};

let result = train(&triples, num_entities, num_relations, &config, &device).unwrap();

Companion to subsume

subsume embeds entities as geometric regions (boxes, cones) where containment encodes subsumption. tranz embeds entities as points where distance/similarity encodes relational facts.

subsume: ontology completion, taxonomy expansion, logical query answering
tranz: link prediction, relation extraction, knowledge base completion

Name		Name	Last commit message	Last commit date
Latest commit History 81 Commits
.github		.github
examples		examples
src		src
tests		tests
.gitignore		.gitignore
BENCHMARKS.md		BENCHMARKS.md
CHANGELOG.md		CHANGELOG.md
Cargo.toml		Cargo.toml
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

tranz

Models

Quick start

Benchmark: WN18RR

Library usage

Generic triple loading

Embedding export

Training (requires `candle` feature)

Companion to subsume

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

tranz

Models

Quick start

Benchmark: WN18RR

Library usage

Generic triple loading

Embedding export

Training (requires candle feature)

Companion to subsume

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Training (requires `candle` feature)

Packages