Skip to content

shucoll/NEHate

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 

Repository files navigation

NEHate

This repository contains the data and code for the paper "NEHATE: Large-Scale Annotated Data Shedding Light on Hate Speech in Nepali Local Election Discourse", accepted for publication at the ECAI (European Conference on Artificial Intelligence) 2023.

Abstract

The use of social media during election campaigns has become increasingly popular. However, the unbridled nature of online discourse can lead to the propagation of hate speech, which has far reaching implications for the democratic process. Natural Language Processing (NLP) techniques are being used to counteract the spread of hate speech and promote healthy online discourse. Despite the increasing need for NLP techniques to combat hate speech, research on low-resource languages such as Nepali is limited, posing a challenge to the realization of the United Nations’ Leave No One Behind principle, which calls for inclusive development that benefits all individuals and communities, regardless of their backgrounds or circumstances. To bridge this gap, we introduce NEHATE, a large-scale manually annotated dataset of hate speech and its targets in Nepali local election discourse. The dataset comprises 13,505 tweets, annotated for hate speech with further sub-categorization of hate speech into targets such as community, individual, and organization. Benchmarking of the dataset with various algorithms has shown potential for performance improvement. We have made the dataset publicly available at https://github.com/shucoll/NEHate to promote further research and development, while also contributing to the UN SDGs aimed at fostering peaceful, inclusive societies, and justice and strong institutions.

Accessing the Tweets

Please refer to this link in the official twitter API guide to get the tweet contents from the tweet ids https://developer.twitter.com/en/docs/twitter-api/tweets/lookup/api-reference/get-tweets

Citation

If you decide to use this dataset please cite the following paper.

NEHATE: Large-Scale Annotated Data Shedding Light on Hate Speech in Nepali Local Election Discourse

@incollection{thapa2023nehate,
  title={Nehate: Large-scale annotated data shedding light on hate speech in nepali local election discourse},
  author={Thapa, Surendrabikram and Rauniyar, Kritesh and Shiwakoti, Shuvam and Poudel, Sweta and Naseem, Usman and Nasim, Mehwish},
  booktitle={ECAI 2023},
  pages={2346--2353},
  year={2023},
  publisher={IOS Press}
}

About

Nepali hate speech twitter dataset on election discourse

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors