A Machine Learning-powered text filtering system that provides user-controlled profanity censorship. Instead of rigid blacklists, this model scores words based on severity, allowing dynamic filtering based on user-defined thresholds.
- ML-based Text Classification using Naïve Bayes + TF-IDF
- Weighted Censorship (Profanity is filtered based on severity scores)
- User-defined Censorship Thresholds (0.1 - 0.9)
- Dynamic Profanity Detection with an adjustable filtering model
- Text is analyzed using a trained Naïve Bayes model to detect vulgarity.
- If vulgar, words are filtered based on their severity weight (from
Version_2.py
). - Users can set the censorship level (e.g., mild filtering at 0.9 vs. strict filtering at 0.1).
- Final output replaces words above the threshold with
[CENSORED]
.
- Clone the repository:
git clone https://github.com/yourusername/UCC-User-Controlled-Censorship.git cd UCC-User-Controlled-Censorship
- Install dependencies:
pip install -r requirements.txt
Run the censorship script interactively:
python Version_2.py
Then enter the text and censorship threshold when prompted.
- FastAPI-based API for real-time censorship.
- Integration with chat applications to test live moderation.
- Fine-tuning with deep learning models like BERT for improved accuracy.
This project is licensed under the MIT License - see the LICENSE file for details.
I am taking a break from this project for now EXAMS!
Developed by Kinjal Choudhary 🎯