Natural language processing including Datasets,Farsi NLP, Automated Essay Scoring, Automatic Speech Recognition and etc.
-
Updated
Oct 14, 2020 - Jupyter Notebook
Natural language processing including Datasets,Farsi NLP, Automated Essay Scoring, Automatic Speech Recognition and etc.
Experiments with word2vec embeddings for synonyms detection, for the Romanian language.
I tried to figure out positive and negative comments on my Youtube videos. So, I used NLP to analyze comments. I set the main language as Korean, but you can try setting English as the main language.
Hub for the Portuguese language NLP Resources
Extract Abstract and Title Dataset from arXiv articles
4,308 short stories (4 million words) scraped from https://reddit.com/r/WritingPrompts
Repo for Turkish sentiment analysis dataset, "Vitamins and Supplements Customer Reviews"
10 languages are classified using the stopwords included in the nltk library.
Data preprocessing and training on Drug Review Dataset using Hugging Face library
Python program for detecting unintentional bilingual and translation instances in NLP datasets.
Implementation of the semi-structured inference model in our ACL 2020 paper. INFOTABS: Inference on Tables as Semi-structured Data
My project storage in NLP
Questions asked in the Lok Sabha - collection and analysis of trends. Creating the dataset from scratch.
This project focuses on the analysis of text generation models such as GPT-2 to identify and understand populistic behaviors or biases against various nationality.
Add a description, image, and links to the nlp-datasets topic page so that developers can more easily learn about it.
To associate your repository with the nlp-datasets topic, visit your repo's landing page and select "manage topics."