lib2vec - A Multi-Faceted Document Embedding Approach

This repository contains code to compute multi-faceted embeddings of lib2vec(often called book2vec within this code basis). Besides, many different experiments, scripts for generating tables as well as plots, and approaches for possibilities not pursued further are contained.

All configurations should be registered in the config.json. Most important are the following three methods:

EvaluationUtils.build_corpora(...): creates python representations for given corpora and applies filters. For self-defined corpora, appropriate parsing methods must be created before.
EvaluationUtils.train_vecs(...): creates vector representations, for example lib2vec embeddings, to given corpora names.
EvaluationUtils.run_evaluation(...): evaluates for the Similarity Tasks.

The d3 directory contains an exploratory visualization of various facets of lib2vec. The necessary file is generated by experiments/embedding_porjection.py.

experiments/book_comparison.py contains experiments for the book comparison task.

experiments/predicting_high_rated_books contains the evaluation for the Scenario predicting high rated books.

The Book Comparison Survey is analyzed by boco_survey/survey_analyses.py and converted to a dataset.

Name		Name	Last commit message	Last commit date
Latest commit History 149 Commits
baselines		baselines
boco_survey		boco_survey
configs		configs
d3		d3
experiments		experiments
extensions		extensions
figures		figures
heideltime_scripts		heideltime_scripts
lib2vec		lib2vec
paper_plots_tables		paper_plots_tables
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

lib2vec - A Multi-Faceted Document Embedding Approach

About

Releases

Packages

Languages

License

LasseKohlmeyer/ma-doc-embeddings

Folders and files

Latest commit

History

Repository files navigation

lib2vec - A Multi-Faceted Document Embedding Approach

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages