OntologyAlignmentWithEmbeddings

This project has been created for an LREC 2018 paper to align labels of two lightweight ontologies (industry classification systems) in four languages (en, de, it, es) using pre-trained embedding libraries. We were interested in seeing whether good results can be achieved with using already existing embeddings. It turned out that for our domain-specific multilingual scenario fastText provided the most successful results. For detailed results please consult our LREC 2018 paper.

Embedding repositories

The pretrained embeddings utilized in this project were retrieved from the following repositories.

Dependencies

gensim version 3.0.0
numpy version 1.13.3
nltk version 3.2.1
pandas version 0.18.1
polyglot version 16.07.04
re version 2.2.1
scipy version 0.19.1

References

If you use any of this code please cite the following paper: Gromann, D. and Declerck, T. "Comparing Pretrained Multilingual Word Embeddings on an Ontology Alignment Task", In: LREC 2018.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
OntAlignEmbeddings		OntAlignEmbeddings
LICENSE.txt		LICENSE.txt
README.md		README.md
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

OntologyAlignmentWithEmbeddings

Embedding repositories

Dependencies

References

About

Releases

Packages

Languages

License

dgromann/OntologyAlignmentWithEmbeddings

Folders and files

Latest commit

History

Repository files navigation

OntologyAlignmentWithEmbeddings

Embedding repositories

Dependencies

References

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages