The 3,140,000 Groups - Chinese-Spanish Parallel Corpus Data is a bilingual texts is stored in text format. All of the data are related to science and technology. average sentence length is 37.1 characters. The data desensitization and quality checking had been done. It can be used as a basic corpus for text data analysis in fields such as machine translation.
For more details, please refer to the link: https://www.nexdata.ai/datasets/1187?source=Github
TXT
Chinese-Spanish Parallel Corpus Data
3.14 million pairs of Chinese-Spanish Parallel Corpus Data. The Chinese sentences contain 37.1 characters on average.
Chinese,Spanish
machine translation
90%
Commercial License