[Lexical/terminological resource] Bilingual medical glossaries for various language pairs.
Clone or download
Montserrat Marimon
Montserrat Marimon new version without datasets
Latest commit 8cba143 Dec 11, 2018
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
LICENSE Update LICENSE Dec 11, 2018
README.md Update README.md Dec 11, 2018

README.md

MeSpEn_Glossaries: Medical Glossaries

Digital Object Identifier (DOI) and access to dataset files

https://doi.org/10.5281/zenodo.2205690

Introduction

Hand crafted glossaries are a particularly valuable resource for the medical translator community and have shown to boost performance of MT systems.

This repository contains forty-six bilingual medical glossaries for various language pairs generated from free online medical glossaries and dictionaries made by professional translators.

Language pairs Num. entries Language pairs Num. entries Language pairs Num. entries
Arabic-English 9844 English-Hindi 124 French-German 169
Bengali-Englisha 876 English-Hungarian 3631 French-Italian 118
Bulgarian-English 349 English-Indonesian 494 French-Spanish 181
Chinese-English 93736 English-Italian 24351 German-Italian 982
Croatian-English 339 English-Japanese 27974 German-Portuguese 2056
Croatian-French 235 English-Korean 110382 German-Romanian 109
Croatian-German 117 English-Norwegian 44 German-Russian 233
Danish-English 194 English-Polish 4250 German-Spanish 7029
Danish-Polish 166 English-Portuguese 2623 German-Swedish 2232
Dutch-English 10453 English-Romanian 210 Italian-Spanish 199
Dutch-French 650 English-Russian 4774 Latin-Polish 237
Dutch-Spanish 71 English-Slovenian 947 Latin-Russian 2518
Dutch-Turkish 238 English-Spanish 125645 Polish-Spanish 273
English-French 7718 English-Swedish 1078 Portuguese-Spanish 62
English-German 19304 English-Thai 335 Russian-Spanish 127
English-Greek 2640 English-Turkish 7717

Table 1: Number of entries in MeSpEn_Glossaries.

Format

Glossaries are encoded in standard tab-separated values (tsv) format.

Contact

Montserrat Marimon (montserrat.marimon@bsc.es)

License

Creative Commons License
This work is licensed under a Creative Commons Attribution 4.0 International License.

Copyright (c) 2018 Secretaría de Estado para el Avance Digital (SEAD)