This project is a Python-based automated security log parser and analysis tool designed to inspect server logs, identify potential anomalies, and perform risk analysis using data science techniques. It simulates various cyber attack scenarios and provides comprehensive visual reports.
- Data Simulation: Generates realistic log data for 6 different attack types, including DDoS, Phishing, and Malware.
- Smart Data Cleaning: Automatically detects missing or corrupted data (NaN) and handles anomalies using statistical imputation techniques.
- Visual Reporting & Analytics:
- 📊 Distribution of attack types and firewall mitigation/block rates.
- 📈 Histogram analysis of incident risk scores.
- 📦 Risk range assessments grouped by attack vectors (Boxplot analysis).
- Python 3.x
- Pandas: For advanced data manipulation and security log cleaning.
- Seaborn & Matplotlib: For generating comprehensive data visualizations.
- Random: For stochastic security log simulation.
- Install the required dependencies:
pip install pandas seaborn matplotlib
- Execute the analysis tool:
python security_log_analyzer.py
Upon execution, the script automatically processes the data, exports a cleaned dataset named temiz_guvenlik_loglari.csv, and displays or saves the corresponding security analysis charts as .png files.
The generated analytical charts from the project output are shown below:
Developed as a core technical project by Muhammed Emir Tohumcu.


