nlp-datasets

Here are 146 public repositories matching this topic...

ArmanBehnam / NLP

Natural language processing including Datasets,Farsi NLP, Automated Essay Scoring, Automatic Speech Recognition and etc.

nlp natural-language-processing tutorial language-modeling dataset persian natural-language-generation nlp-resources language-model farsi natural-language-inference nlp-machine-learning nlp-datasets

Updated Oct 14, 2020
Jupyter Notebook

claudiu1989 / Synonyms-detection

Star

Experiments with word2vec embeddings for synonyms detection, for the Romanian language.

nlp embeddings romanian nlp-resources nlp-machine-learning nlp-datasets

Updated Sep 10, 2023
Python

Kevinlee49 / analysis-youtube-comment-krisandme

Star

I tried to figure out positive and negative comments on my Youtube videos. So, I used NLP to analyze comments. I set the main language as Korean, but you can try setting English as the main language.

nlp nlp-machine-learning nlp-datasets

Updated Aug 23, 2021
Jupyter Notebook

LIAAD / PT-Pump-Up

Star

Hub for the Portuguese language NLP Resources

nlp natural-language-processing resources nlp-resources portuguese-language nlp-datasets

Updated Apr 18, 2024
PHP

gcunhase / ArXivAbsTitleDataset

Star

Extract Abstract and Title Dataset from arXiv articles

arxiv nlp-datasets abstract-to-title

Updated Dec 9, 2019
Python

josherich / nlp-dataset-explorer

Star

NLP datasets explorer

nlp datasets nlp-datasets

Updated Dec 11, 2022
Vue

tdude92 / reddit-short-stories

Star

4,308 short stories (4 million words) scraped from https://reddit.com/r/WritingPrompts

nlp dataset machine-learning-dataset nlp-datasets

Updated Apr 28, 2021

turkish-nlp-suite / Vitamins-Supplements-Reviews

Star

Repo for Turkish sentiment analysis dataset, "Vitamins and Supplements Customer Reviews"

nlp nlp-datasets sentiment-analysis-dataset turkish-nlp turkce-veriseti medical-nlp turkish-nlp-dataset turkce-sentiment-analysis-veriseti

Updated Jul 11, 2023

karanlohia98 / Language-Detection-using-stopwords

Star

10 languages are classified using the stopwords included in the nltk library.

nlp language detection nltk stopwords languagedetector nlp-machine-learning nlp-datasets

Updated May 25, 2019
Jupyter Notebook

saakolch / procedure_of_extracting_data

Star

Data preprocessing and training on Drug Review Dataset using Hugging Face library

classifier-model nlp-datasets nlp-deep-learning

Updated May 19, 2024
Jupyter Notebook

DARK-art108 / FinBox-NLP-Exercise

Sponsor

Star

An NLP Exercise

nlp nltk nlp-resources nlp-datasets

Updated Sep 7, 2021
Jupyter Notebook

RaiBP / incidental-bilingualism

Star

Python program for detecting unintentional bilingual and translation instances in NLP datasets.

python nlp machine-learning natural-language-processing deep-learning language-detection nlp-resources nlp-datasets code-switching

Updated Feb 11, 2024
Python

readerbench / ro-offense-sequences

Star

nlp romanian offensive-language nlp-datasets romanian-language hate-speech-detection nlp-data nlp-dataset

Updated Jun 27, 2023

SemiringInc / Mueller-Report-Corpus

Star

The Mueller Report Corpus V 0.1

nlp corpus corpus-linguistics nlp-datasets

Updated May 12, 2020

vgupta123 / infotabs-code

Star

Implementation of the semi-structured inference model in our ACL 2020 paper. INFOTABS: Inference on Tables as Semi-structured Data

nlp wikipedia svm transformers transformer tables snli mnli nli nlp-datasets roberta acl2020

Updated May 7, 2020
Python

BrunoGianetti / MyNLPProjects

Star

My project storage in NLP

Updated Feb 15, 2024
Jupyter Notebook

Robin1999Stark / Recipe_Tagger

Star

NLP Project for Auto Labeling Receipes

python nlp ai pipeline tags tag languages tagger token mit-license ner piplines nlp-datasets alogirthm tokeniz augsburg-university

Updated Feb 28, 2024
Python

Shayokh144 / Bengali-Literature-Data-Collection

Star

nlp-datasets bengali-dataset

Updated Aug 11, 2020

sammitjain / loksabha-questions

Star

Questions asked in the Lok Sabha - collection and analysis of trends. Creating the dataset from scratch.

dataset india government-data public-policy nlp-machine-learning nlp-datasets

Updated Apr 21, 2022
Jupyter Notebook

PranavNV / Nationality-Prejudice-in-Text-Generation

Star

This project focuses on the analysis of text generation models such as GPT-2 to identify and understand populistic behaviors or biases against various nationality.

social-infomatics nlp-datasets ethics-in-ai

Updated Mar 14, 2023

Improve this page

Add a description, image, and links to the nlp-datasets topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the nlp-datasets topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

nlp-datasets

Here are 146 public repositories matching this topic...

ArmanBehnam / NLP

claudiu1989 / Synonyms-detection

Kevinlee49 / analysis-youtube-comment-krisandme

LIAAD / PT-Pump-Up

gcunhase / ArXivAbsTitleDataset

josherich / nlp-dataset-explorer

tdude92 / reddit-short-stories

turkish-nlp-suite / Vitamins-Supplements-Reviews

karanlohia98 / Language-Detection-using-stopwords

saakolch / procedure_of_extracting_data

DARK-art108 / FinBox-NLP-Exercise

RaiBP / incidental-bilingualism

readerbench / ro-offense-sequences

SemiringInc / Mueller-Report-Corpus

vgupta123 / infotabs-code

BrunoGianetti / MyNLPProjects

Robin1999Stark / Recipe_Tagger

Shayokh144 / Bengali-Literature-Data-Collection

sammitjain / loksabha-questions

PranavNV / Nationality-Prejudice-in-Text-Generation

Improve this page

Add this topic to your repo