Skip to content

BoulahiaAhmed/Arabic-Quotes-Dataset

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 

Repository files navigation

Arabic Quotes Dataset

Dataset Size Tags per Quote Language License

Overview

The Arabic Quotes Dataset is an open-source collection of 5900+ quotes in the Arabic language, accompanied by up to three tags for each quote. The dataset is suitable for various Natural Language Processing (NLP) tasks, such as text classification and tagging.

Data Description

  • Contains 5900+ quotes with up to three associated tags per quote.
  • All quotes and tags are in Arabic.

Use Cases

  • Text Classification: Classify quotes into predefined categories.
  • Tagging: Assign relevant labels or themes to quotes.
  • Sentiment Analysis: Analyze sentiment expressed in quotes.
  • Language Modeling: Train models to generate Arabic quotes.
  • Information Retrieval: Retrieve quotes relevant to specific topics.

License

The "Arabic Quotes" dataset is distributed under the Apache License 2.0. Feel free to use it for any purpose, giving appropriate credit to the original source.

Data Format

The dataset is available in CSV format. Each row represents a quote with its associated tags. Example structure:

quote,tags
"أنا لا أبالي برأي الناس، أنا لست عبدًا لتقييماتهم.","[حرية, تحفيز, قوة]"
"الصمت هو أكبر إجابة.", "[سكوت, حكمة]"
...

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published