Costra

This is a tool for automatic evaluation of Czech sentence embeddings using Costra 1.1 dataset.

More information can be found in the following paper:

Petra Barančíková and Ondřej Bojar: Costra 1.1: An Inquiry into Geometric Properties of Sentence Spaces. In: TSD 2020. Lecture Notes in Computer Science, vol 12284. Springer, Cham.

The presentation of the paper with the accompanying video can be found here.

Installation

$ pip install costra

Usage

You can get sentences from Costra using the following command:

from costra import costra
sentences = costra.get_sentences()

Use the sentences to generate your embeddings. The embeddings are evaluating the following way:

costra.evaluate(YOUR_EMBEDDINGS)

Citation

If you use the tool for academic purporses, please consider citing the following paper:

@inproceedings{Costra,
  author    = {Petra Baran{\v{\c}}{\'{\i}}kov{\'{a}} and Ond{\v{\r}}ej Bojar},
  editor    = {Petr Sojka and Ivan Kope{\v{\c}}ek and Karel Pala and Ales Hor{\'{a}}k},
  title     = {Costra 1.1: An Inquiry into Geometric Properties of Sentence Spaces},
  booktitle = {Text, Speech, and Dialogue - 23rd International Conference, {TSD}
               2020, Brno, Czech Republic, September 8-11, 2020, Proceedings},
  series    = {Lecture Notes in Computer Science},
  volume    = {12284},
  pages     = {135--143},
  publisher = {Springer},
  year      = {2020},
  url       = {https://doi.org/10.1007/978-3-030-58323-1\_14},
  doi       = {10.1007/978-3-030-58323-1\_14},
}

License

The data is distributed under the Creative Commons 4.0 BY.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
costra		costra
README.md		README.md
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Costra

Installation

Usage

Citation

License

About

Releases

Packages

Languages

barancik/costra

Folders and files

Latest commit

History

Repository files navigation

Costra

Installation

Usage

Citation

License

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages