3140000-Groups-Chinese-Spanish-Parallel-Corpus-Data

Description

The 3,140,000 Groups - Chinese-Spanish Parallel Corpus Data is a bilingual texts is stored in text format. All of the data are related to science and technology. average sentence length is 37.1 characters. The data desensitization and quality checking had been done. It can be used as a basic corpus for text data analysis in fields such as machine translation.

For more details, please refer to the link: https://www.nexdata.ai/datasets/1187?source=Github

Format

TXT

Data content

Chinese-Spanish Parallel Corpus Data

Data size

3.14 million pairs of Chinese-Spanish Parallel Corpus Data. The Chinese sentences contain 37.1 characters on average.

Language

Chinese,Spanish

Application scenario

machine translation

Accuracy rate

90%

Licensing Information

Commercial License

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
README.md		README.md
zh_es 样例展示.png		zh_es 样例展示.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

3140000-Groups-Chinese-Spanish-Parallel-Corpus-Data

Description

Format

Data content

Data size

Language

Application scenario

Accuracy rate

Licensing Information

About

Releases

Packages

Nexdata-AI/3140000-Groups-Chinese-Spanish-Parallel-Corpus-Data

Folders and files

Latest commit

History

Repository files navigation

3140000-Groups-Chinese-Spanish-Parallel-Corpus-Data

Description

Format

Data content

Data size

Language

Application scenario

Accuracy rate

Licensing Information

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Packages