Skip to content

Centre-for-Information-Resilience/ethiopia-hate-speech-lexicon

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commit
 
 
 
 
 
 

Repository files navigation

Inflammatory Keywords and Phrases

CIR developed a lexicon comprised of inflammatory keywords across four languages (Amharic, Afaan Oromo, Tigrigna, and English) which may be indicative of hate speech along gendered, ethnic, and religious lines. CIR believes that this is the most comprehensive lexicon at present for the Ethiopian context.

For more information on the Lexicon development, see CIR's report "Normalised and Invisible: An Analysis of gendered hate-speech on social media in Ethiopia" and conference paper "Resources for Annotating Hate Speech in Social Media Platforms Used in Ethiopia: A Novel Lexicon and Labelling Scheme".

It is important to note that terms on their own, may not constitute hate speech. The keywords were used to obtain content from social media which could contain hate speech; however, human annotators then analysed whether the content was/wasn't hate speech, as per the detailed annotation protocol.

Contact us: CIR Website

About

CIR: An analysis of gendered hate speech on social media in Ethiopia

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published