GitHub - skyradez/Audio-based-Indonesia-Toxic-Language-Classification-using-RNN-Speech-Recognition-and-NLP: Audio-based Indonesia Toxic Language Classification using Recurrent Neural Network, Speech Recognition and Natural Language Processing.

Audio-based Indonesia Toxic Language Classification using Recurrent Neural Network, Speech Recognition and Natural Language Processing.

Created by: Marvin Luckianto

Run on colab

Abstract

This research paper introduces a novel approach for identifying toxic language in audio-based Indonesian content using Recurrent Neural Network (RNN), Bidirectional Long Short-Term Memory (BiLSTM), speech recognition, and natural language processing techniques. The proposed methodology transcribes Indonesian audio into text and employs natural language processing methods to extract lexical, syntactic, and semantic features to identify and categorize toxic language. Achieving high accuracy in detecting toxic language in Indonesian audio recordings, this approach outperforms existing methods. The RNN and BiLSTM model architecture captures the temporal dependencies of verbal content in audio recordings and gathers relevant information for toxicity classification. The paper reports a 95.2% accuracy, 96.4% precision, and 93.2% recall in identifying toxic speech recordings in the Indonesian language. The speech recognition component plays a crucial role in transcribing and classifying content. This technique can be applied in real-world scenarios such as content moderation in Indonesian social media platforms and detecting toxic language in customer service interactions, addressing the growing issue of toxic language in Indonesian online communities and social media platforms.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
LICENSE		LICENSE
README.md		README.md
RESEARCH_Audio_based_Indonesia_Toxic_Language_Classification_using_Recurrent_Neural_Network,_Speech_Recognition_and_Natural_Language_Processing_).ipynb		RESEARCH_Audio_based_Indonesia_Toxic_Language_Classification_using_Recurrent_Neural_Network,_Speech_Recognition_and_Natural_Language_Processing_).ipynb
toxic(1).csv		toxic(1).csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LICENSE

LICENSE

README.md

README.md

RESEARCH_Audio_based_Indonesia_Toxic_Language_Classification_using_Recurrent_Neural_Network,_Speech_Recognition_and_Natural_Language_Processing_).ipynb

RESEARCH_Audio_based_Indonesia_Toxic_Language_Classification_using_Recurrent_Neural_Network,_Speech_Recognition_and_Natural_Language_Processing_).ipynb

toxic(1).csv

toxic(1).csv

Repository files navigation

Audio-based Indonesia Toxic Language Classification using Recurrent Neural Network, Speech Recognition and Natural Language Processing.

Run on colab

Abstract

About

Releases

Packages

Languages

License

skyradez/Audio-based-Indonesia-Toxic-Language-Classification-using-RNN-Speech-Recognition-and-NLP

Folders and files

Latest commit

History

Repository files navigation

Audio-based Indonesia Toxic Language Classification using Recurrent Neural Network, Speech Recognition and Natural Language Processing.

Run on colab

Abstract

About

Topics

Resources

License

Stars

Watchers

Forks

Languages