A machine learning system which will be able to predict whether a website is phishing or not.
The dataset for this project can be obtained from the UCI Machine Learning Repository:
- Dataset Name: Phishing Websites Dataset
- Link: UCI ML Repository - Phishing Websites Dataset
- Alternative Link: Kaggle - Phishing Website Dataset
The dataset contains features extracted from phishing and legitimate websites, including:
- URL-based features
- Domain-based features
- HTML and JavaScript-based features
- External services features
The target variable indicates whether a website is phishing (-1), suspicious (0), or legitimate (1).