nlp-datasets

Star

Here are 146 public repositories matching this topic...

claudiu1989 / Synonyms-detection

Star

Experiments with word2vec embeddings for synonyms detection, for the Romanian language.

nlp embeddings romanian nlp-resources nlp-machine-learning nlp-datasets

Updated Sep 10, 2023
Python

Kevinlee49 / analysis-youtube-comment-krisandme

Star

I tried to figure out positive and negative comments on my Youtube videos. So, I used NLP to analyze comments. I set the main language as Korean, but you can try setting English as the main language.

nlp nlp-machine-learning nlp-datasets

Updated Aug 23, 2021
Jupyter Notebook

nikitaeverywhere / news-articles-dataset

Star

A dataset of 2095 plain text articles of 5 categories with over 805k words in total.

nlp dataset datasets news-data nlp-datasets articles-data

Updated Jan 30, 2018

josherich / nlp-dataset-explorer

Star

NLP datasets explorer

nlp datasets nlp-datasets

Updated Dec 11, 2022
Vue

tdude92 / reddit-short-stories

Star

4,308 short stories (4 million words) scraped from https://reddit.com/r/WritingPrompts

nlp dataset machine-learning-dataset nlp-datasets

Updated Apr 28, 2021

turkish-nlp-suite / Vitamins-Supplements-Reviews

Star

Repo for Turkish sentiment analysis dataset, "Vitamins and Supplements Customer Reviews"

nlp nlp-datasets sentiment-analysis-dataset turkish-nlp turkce-veriseti medical-nlp turkish-nlp-dataset turkce-sentiment-analysis-veriseti

Updated Jul 11, 2023

saakolch / procedure_of_extracting_data

Star

Data preprocessing and training on Drug Review Dataset using Hugging Face library

classifier-model nlp-datasets nlp-deep-learning

Updated May 19, 2024
Jupyter Notebook

SamDineshSD777 / Sentiment-Analysis-on-Product-Reviews

Star

Sentiment Analysis on Product Reviews ( Project Associated with Zummit Infolabs ).

nlp natural-language-processing nlp-resources nlp-library nlp-machine-learning nlp-keywords-extraction nlp-datasets

Updated Mar 13, 2023

BrunoGianetti / MyNLPProjects

Star

My project storage in NLP

Updated Feb 15, 2024
Jupyter Notebook

kaanala / python-webcrawler-turkish-news

Star

Webcrawler for Turkish news.

python nlp natural-language-processing turkish scrapy webcrawler turkish-language turkce nlp-datasets dogal-dil-isleme

Updated Sep 3, 2019
Python

anirudhsom / CAPP-Dataset

Star

Official repository for "Demonstrations Are All You Need: Advancing Offensive Content Paraphrasing using In-Context Learning".

acl nlp-datasets paraphrase-generation offensive-content-paraphrasing acl-2024-findings acl-2024

Updated May 30, 2024
Jupyter Notebook

RaiBP / incidental-bilingualism

Star

Python program for detecting unintentional bilingual and translation instances in NLP datasets.

python nlp machine-learning natural-language-processing deep-learning language-detection nlp-resources nlp-datasets code-switching

Updated Feb 11, 2024
Python

readerbench / ro-offense-sequences

Star

nlp romanian offensive-language nlp-datasets romanian-language hate-speech-detection nlp-data nlp-dataset

Updated Jun 27, 2023

vishnuchilamakuru / coursera-reviews-analsis

Star

nlp reviews nlp-keywords-extraction review-sentiments nlp-datasets

Updated Dec 7, 2019
Jupyter Notebook

readerbench / news-ro-offense

Star

a novel Romanian language dataset for offensive message detection with manually annotated comment from a local Romanian news website (stiri de cluj) into five classes

nlp romanian nlp-resources offensive-language nlp-datasets romanian-language hate-speech-detection nlp-dataset