Parallel corpus built with SNOMED CT (by using LaBSE for candidate matching).
The current iteration provides parallel corpuses for concepts for which at least one official translation exsits, in any of the following languages:
- Spanish (
es
): - French (
fr
) - Dutch (
nl
) - Danish (
da
) - Swedish (
sv
)
The usage of these files is subject to the SNOMED licensing restrictions, see LICENSE.txt
.
If you use this dataset for a scientific work, please cite the following work
@inproceedings{remy-etal-2022-translating,
title = {{Taming large lexicons : translating clinical text using medical ontologies and sentence templates}},
author = {Remy, François and De Jaeger, Peter and Demuynck, Kris},
booktitle = {{EmP : 1st RADar conference on Engineer meets Physician, Proceedings}},
location = {{Roeselare, Belgium}},
year = {{2022}},
pages = {{5}},
url = {http://hdl.handle.net/1854/LU-01GMN7J3X37CKWCKQV6DWGM4ZE}
}