Automatically generated bilingual dictionaries based on parallel sentence data, provided by Dharmamitra.
These StarDict format dictionaries are created from parallel sentence alignments to assist in research for Sanskrit<>Tibetan. They can be used with GoldenDict and other compatible viewers. A Mac version is under preperation.
The Sanskrit-Tibetan dictionary is here: MITRA sa-bo
The Tibetan-Sanskrit dictionary is here: MITRA bo-sa
Warning: Each dictionary will require more than 10GB of disk space after unzipping! These contain over 4m headword entries based on 600,000 parallel Sanskrit-Tibetan sentence pairs.
- Bilingual translations of terms and phrases automatically extracted from large sets of parallel sentence pairs
- StarDict format compatible with many dictionary applications
- The phrase entries of the dictionaries are automatically generated, they are not manually checked and we don't claim that these are exhaustive resources. The boundary detection is not always reliable. These are not manually curated dictionaries, so use with caution!
- The size is large with millions of entries based on more than 1.7m sentence pairs combined. The rendering can therefore be a bit slow depending on the entry size.
- Download the desired dictionary files from the link above and unzip them to your favorite folder
- Install a StarDict-compatible dictionary viewer (like GoldenDict, StarDict, or SDCV)
- Add your favorite folder with the dictionary files to your viewer
- Start looking up terms!
These dictionaries work with:
- GoldenDict
- StarDict
- SDCV (StarDict Console Version)
- And other StarDict-compatible dictionary programs
These dictionaries are released under the Creative Commons Attribution-ShareAlike 4.0 International License (CC BY-SA 4.0).
These dictionaries are created by Sebastian Nehrdich and Kurt Keutzer in context of the Dharmamitra.org project with generous support by the Tsadra Foundation. We are dedicated to making work on Sanskrit, Tibetan, and Buddhist Chinese easier.
@inproceedings{nehrdich-2022-sanstib,
title = "{S}ans{T}ib, a {S}anskrit - {T}ibetan Parallel Corpus and Bilingual Sentence Embedding Model",
author = "Nehrdich, Sebastian",
booktitle = "Proceedings of the Thirteenth Language Resources and Evaluation Conference",
month = jun,
year = "2022",
address = "Marseille, France",
publisher = "European Language Resources Association",
url = "https://aclanthology.org/2022.lrec-1.724",
pages = "6728--6734"
}A project of Dharmamitra.org

