Data Preprocessing for Weather Forecasting

This repository demonstrates how to preprocess weather forecast data before using it to train machine learning models. The code provided here focuses on data preprocessing steps such as handling missing values, outlier removal, and feature engineering.

Dataset

The weather forecast data is loaded from a CSV file named "kecamatanforecast-jawa.csv". The dataset includes information about various weather parameters, such as temperature, relative humidity, wind speed, and weather code.

Data Preprocessing Steps

Loading and Filtering Data: The dataset is loaded using pandas, and specific columns related to temperature, relative humidity, wind speed, and weather code are selected for further analysis.
Handling Missing Values: Any rows with missing values are dropped from the dataset to ensure data quality.
Outlier Removal: Outliers are identified using the mean and standard deviation of each feature. Rows containing outliers are removed from the dataset.
Feature Engineering: The "Kode cuaca" column is transformed into a new "Cuaca" column that represents three classes: "Cerah," "Hujan," and "Berawan."
Train-Test Split: The preprocessed dataset is split into training and testing sets for model evaluation.

Model Evaluation

Several machine learning models are evaluated on the preprocessed data:

Linear Regression
Support Vector Machine (SVM)
Decision Tree
Gradient Boosting
Random Forest

The accuracy of each model is measured and compared using bar plots.

Getting Started

Download the "kecamatanforecast-jawa.csv" dataset and save it to the appropriate location.
Open the provided Jupyter Notebook.
Run the notebook cells to perform data preprocessing and model evaluation.
Observe the accuracy comparison among different models using the generated bar plots.

Acknowledgements

The weather forecast data is used for demonstration purposes only. The preprocessing steps and models evaluated can be customized based on specific requirements.

License

This project is licensed under the MIT License - see the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
Dataset		Dataset
coba		coba
content		content
datadumy		datadumy
.DS_Store		.DS_Store
MicroML_RandomForestClassifier (1).ipynb		MicroML_RandomForestClassifier (1).ipynb
README.md		README.md
code untuk abstrak.ipynb		code untuk abstrak.ipynb
cuaca.h		cuaca.h
fix.h		fix.h
mantap.ipynb		mantap.ipynb
model.h		model.h
risanti.h		risanti.h

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Data Preprocessing for Weather Forecasting

Dataset

Data Preprocessing Steps

Model Evaluation

Getting Started

Acknowledgements

License

About

Releases

Packages

Languages

Amario1306619051/Machine-learning-arduino

Folders and files

Latest commit

History

Repository files navigation

Data Preprocessing for Weather Forecasting

Dataset

Data Preprocessing Steps

Model Evaluation

Getting Started

Acknowledgements

License

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages