
#  SOCIAL MEDIA CYBERBULLYING DETECTION
 

Presented by:
1. Stella Kiarie

2. Kumati Dapash

3. Morvine Otieno

4. Doris Mutie

# 1. BUSINESS UNDERSTANDING 

## 1.1 Overview   
The kenyan digital landscape is one of the most vibrant in Africa, driven by a huge population of young, tech savvy and a high rate of mobile internet usage.  The major platforms for social engagement, entertainment and business being FaceBook, YouTube, TikTok and X(formerly Twitter).  Content creators, artists, vloggers and commentators commanding millions of comments daily.   
Due to the anonymity and viral nature of these social media platforms, there is a developing culture if cyberbullying and online hostility. Unlike other Western countexts where comments fall under well known languages, Kenyan  comment section is characterized by code -switching between Kiswahili, English and Sheng. Sheng is mostly being used to deliver insults or to "roast" individuals in the online space.   

The current global cyberbulling moderation tools are primarily built in well-known languages like English, French, and a little bit of Swahili.  Following the Kenyan linguistic complexity, there moderation tools fail to capture harmful sheng Phrases which leave kenyan content creators, public figures and online users vulnerable to online harrassmenr which in turn results to severe mental issues, reputation damages, suicidal ideation and digital exclusion.    

Social media connects us, entertains us but sometimes it destroys us. Remember behind every trending post or hashtag, there is a real person with real feelings. 

## 1.2  Problem Statement 

There is a significant gap in the ability to automate systems that can detect and moderate cyberbulling within the linguistic Kenyan context.  
Comments from Kenyan social media posts are based on sheng or code switching context which often bypass the existing moderation tools. The linguistic nuances of toxic content remain online, fueling a hostile digital environment. Content creators and brands are unable to moderate thousands of multilingual comments in real time, resulting in a toxic feedback loop that damages engagement and mental well-being. 

## 1.3 Challenges 

1. Linguistic Fluidity: Sheng is not standardized and a word might differ in meaning in different locations.
2. Code-mixed content: mixing English, swahili and sheng in one comment often confuse the existing NLP models
3. High Volume: Kenyan social media and digital space generate thousands of comments within hours which limits human moderation.
4. Contexual Sarcasm: While sheng is being used, deep contexual understanding is required to classfy comments whether "bullying", "hatespeech" rather than "Humor/Irony/Sarcism "
5. Data Scarcity: There is no publicly available high quality datasets for sheng specifically labelled for toxicity 

## 1.3  Proposed Solution

We propose building an adavanced multilingual NLP classification system specifically tuned for the Kenyan linguistic landscape. A structured annotation through label studio  for human annotation to ensure conssistent and high quality annotation of kenyan social media comments.  
Each commenyt presented to the annotator will be labelled through four sequential steps:
1. Language Identification
2. Sentiment Classification
3. Major Category Classification
4. subcategory Classification  
This hierachical labeling will help analyze bothe the general and detailed information about the social media text

1. Language Identification: identify the primary language used in the comment 
The major language options include:   
  
        1. English
        2. Swahili
        3. Sheng
        4. Mixed (Code-switching)
        5. Plain Emoji

2. Sentiment Annotation : After identifying the language, the annotators will label the sentiment expessed in the comment.  
The sentiment categories include: 

            1. Negative
            2. Positve
            3. Neutral   
Help understand the emotional tone of the message to detect harmful and constructive communications. 

3. Major Category Classification : classify the comment into a main category that represents the overall nature of the message.   
The major categories include:

        1. Constructive Criticism
        2. Offensive Content
        3. Cyberbullying
        4. Hate Speech
        5. Irony/Sarcasm
        6. Misinformation
        7. Other

4. Subategory Classification: each major category is linked to specific subcategories. 
### Subcategories Used in Data Annotation

### Subcategories by Major Category

| Constructive Criticism | Offensive | Cyberbullying | Hate Speech | Irony/Sarcasm | Misinformation | Others |
|---|---|---|---|---|---|---|
| [CC] Performance-based | [Offensive] Sex-based | [Bullying] Body shaming | [Hate Speech] Gender based | [Irony] Mock compliment | [Misinfo] Fabricated Content (Fake News) | [Other] Spam |
| [CC] Corrective clarification | [Offensive] Vulgar language (Matusi) | [Bullying] Direct Insult | [Hate Speech] Political | [Irony] Mock Laughter | [Misinfo] Misleading/Contextual Deception | [Other] General Others |
| [CC] Decision-criticism | [Offensive] Non-personal | [Bullying] Threats | [Hate Speech] Ethical/racial group (Tribalism) | [Irony] Rhetorical Question | [Misinfo] Parody | [Other] General Inquiry |
| [CC] Genuine Praise |  | [Bullying] Humiliation | [Hate Speech] Disability | [Irony] Hyperbolic understatement |  |  |
| [CC] Supportive Speech |  | [Bullying] Exclusion tactics |  |  |  |  |
| [CC] Genuine feedback |  | [Bullying] Sexual harassment |  |  |  |  |
|  |  | [Bullying] Trolling |  |  |  |  |
|  |  | [Bullying] Defamation |  |  |  |  |

## 1.5 Objectives 

The goal is to develop a robust ML model that identifies and flags cyberbullying in Sheng, Swahili, and English with high accuracy. 

•	Scrape and curate specialized datasets of more than 10,000 comments from Kenyan YouTube vlogs.  

•	Use human annotators (Fluent Sheng speakers) to create high-quality labelled datasets

•	Evaluating the model performance using F1-score, ensuring the model handles imbalanced data where bullying is less frequent than neutral comments. 

•	Build a prototype API or Dashboard that content creators can use to clean their comment sections automatically. 



## 1.6 Success Criteria 
This project will be considered successful if:  

•	The model achieves an F1-Score of 75% or higher across the cyberbullying and hate speech categories. 

•	The system successfully identifies Sheng-specific insults that standard English filters miss. 


# DATA UNDERSTANDING 

## 2.1 Overview 

To solve the problem identified in this project, we leveraged data from YoutubeTube comments from various Kenyan videos. The data will be scraped from YouTube using the YouTube Data API V3, an official and legal way of accessing public YouTube data for analysis. The project leverage the google-api-python-client interface with the comment Threads resource.   
The Scripts maps the raw JSON responses from YouTube and converts them into Pandas dataframe  
The data contains the following columns:  

•	Author: the public name of the commenter  
•	Text: raw comment content in plain text  
•	Likes: engagement metric  
•	Published at: Timestamp of the post.   



Through this strategy, we managed to scrap  over 10,000 comments from Kenyan Youtube Content. 

## 2.2 Labelling 

### HUMAN DATA LABELLING AND ANNOTATION(Label Studio)

As compared to automated labeling methods, human annotators are better to interprate the nuances in the comments. 

Human annotation ensure accurate detection of harmful content in categories such as Cyberbullying, hate speech, offensive language and Sarcasm . These categories require human judgement because the meaning often depends on context and intention.   
* Human labeling helps ensure:  
* Correct classification of comments  
* Reliable sentiment detection  
* Consistent labeling across categories  
* High-quality training data for the NLP model.


## 2.3 Data Quality issues 

Major data quality issues during the annotation process include;
1. A significant amount of data use non standard english hence annotator needs a local Kenyan Slang understanding to accurately identify the segments
2. class imbalance: Categories such as Contructive critism and Irony appear way more than other .
3. Certain names appear in toxic comments highlihting a need for annymization(masking). In data preparation to prevent the model from learninf "name_based" bias. 