A collection of Farsi (Persian) datasets
-
Updated
Jul 15, 2021 - Python
A collection of Farsi (Persian) datasets
The first dataset for Farsi fact extraction and verification
An Image Dataset of Printed Farsi Text for OCR Research
The first intelligent Persian reverse dictionary
CLIPfa: Connecting Farsi Text and Images
In this repository, the wavLM model is used for quality and poor quality data for speaker verification task, and the PyCM library is used for evaluation.
Simple Script To Crawl Data From Persian News Agencies Including Fars, Mehr.
Persian Datasets including: Wikipedia, Twitter, Hamshahri, Hellokish, NSURL'19, Peyma, Text_mining.ir
Official github repository, Persis: A persian font recognition pipeline using convolutional neural networks.
Persian/Farsi text to speech(TTS) training using coqui tts
Persian to Finglish dataset with all the sentences voice for TTS dataset used to train tacotron2
This repo shows how to finetune the wav2vec2.0 model along with its prerequisites.
Add a description, image, and links to the farsi-datasets topic page so that developers can more easily learn about it.
To associate your repository with the farsi-datasets topic, visit your repo's landing page and select "manage topics."