KG Embeddings

Generate embeddings on the graph, with RDF2Vec - using pyRDF2Vec -, TransE, DistMult - using pyKEEN.

Training

In the config folder, edit appropriately the following files:

object_properties.txt the list of object properties which will be taken into account for the random walks (1 per line);
prefixes.txt to be added to the SPARQL queries
get_entities.rq the query for getting the URIs of all entities of interest.

Run the following commands for downloading the data on your machine:

pip install -r requirements.txt
python preprocessing.py

Finally, generate the embeddings using:

python main.py [entities list] [-a algorith_name]

where entities list is a list of entities uri (1 per line) in a textual file. Following the generated files by the preprocessing, you can run (for example):

python main.py voc
python main.py smells -a TransE

This is producing an [entity].kv file, which is a gensim's KeyedVector file.

Load and use embeddings

Load embeddings in this way:

emb = KeyedVectors.load('emb.kv')

Search the most similar to a term:

emb.most_similar('http://data.odeuropa.eu/vocabulary/olfactory-objects/269', topn=10) # incense

# 0.7755   http://data.odeuropa.eu/vocabulary/olfactory-objects/267   Frankincense

Refer to gensim's documentation for further possibilities.

Clustering and link predicting

We performed experiment for clustering and link predicting using the following code:

Clustering: notebook
Link prediction with TransE and DistMult: notebook

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
config		config
embeddings		embeddings
.gitignore		.gitignore
Clustering.ipynb		Clustering.ipynb
LICENSE		LICENSE
README.md		README.md
Using embeddings.ipynb		Using embeddings.ipynb
Visualizing Word2Vec Word Embeddings using t-SNE.ipynb		Visualizing Word2Vec Word Embeddings using t-SNE.ipynb
edgelists2nt.py		edgelists2nt.py
graph-transe.ipynb		graph-transe.ipynb
link_prediction.py		link_prediction.py
linkpred.ipynb		linkpred.ipynb
main.py		main.py
preprocessing.py		preprocessing.py
requirements.txt		requirements.txt
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

KG Embeddings

Training

Load and use embeddings

Clustering and link predicting

About

Releases

Packages

Languages

License

Odeuropa/kg-embeddings

Folders and files

Latest commit

History

Repository files navigation

KG Embeddings

Training

Load and use embeddings

Clustering and link predicting

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages