GitHub - vr-jayashree5443/Sales-Prediction-using-Python

Let's break down the workflow of the code into a step-by-step process:

Data Loading:
- The code starts by loading the advertising dataset from a specified file location using pandas. This dataset contains information about advertising expenditures on TV, radio, newspaper, and corresponding sales figures.
Data Preprocessing:
- The relevant columns are selected for analysis. The 'TV,' 'Radio,' and 'Newspaper' columns are considered as the features, and the 'Sales' column is designated as the target variable.
Data Splitting:
- The data is divided into two subsets: training data and testing data. This is crucial for training the model on one subset and evaluating its performance on the other. An 80-20 split ratio is used, with 80% of the data allocated for training and 20% for testing.
Model Selection and Training:
- A Linear Regression model is chosen for this task. Linear Regression is a simple and effective algorithm for predicting a target variable based on one or more input features.
- The model is trained using the training data, with the 'TV,' 'Radio,' and 'Newspaper' advertising expenditures as input features and 'Sales' as the target.
Model Prediction:
- The trained model is used to make predictions on the testing data. This step is where the model estimates sales based on the advertising expenses in the test dataset.
Model Evaluation:
- Two important metrics are calculated to assess the model's performance:
  - Mean Squared Error (MSE): It quantifies the average squared difference between the predicted sales and the actual sales in the test data. Lower MSE values indicate better model accuracy.
  - R-squared (R2): It measures how well the model fits the data. R2 values range from 0 to 1, where a higher R2 indicates a better fit.
Printing Predicted Sales:
- The code prints the predicted sales values for the test data. These are the estimated sales values based on the model's predictions.
Data Visualization:
- To provide a visual representation of the model's performance, a scatter plot is created. It compares the actual sales (from the test data) with the predicted sales. This allows you to see how closely the model's predictions align with the actual data points.

In summary, the code follows a standard workflow for performing a simple linear regression analysis on an advertising dataset. It loads, preprocesses, splits, trains, predicts, evaluates, and visualizes the data and model performance, enabling you to understand how well advertising expenses predict sales.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
Activate.ps1		Activate.ps1
README.md		README.md
activate		activate
activate.bat		activate.bat
dataset.zip		dataset.zip
deactivate.bat		deactivate.bat
f2py.exe		f2py.exe
fonttools.exe		fonttools.exe
numpy-1.26.1-cp312-cp312-win_amd64.whl		numpy-1.26.1-cp312-cp312-win_amd64.whl
output_video_demonstration.zip		output_video_demonstration.zip
pip.exe		pip.exe
pip3.12.exe		pip3.12.exe
pip3.exe		pip3.exe
pyftmerge.exe		pyftmerge.exe
pyftsubset.exe		pyftsubset.exe
pylab.py		pylab.py
python.exe		python.exe
pythonw.exe		pythonw.exe
pyvenv.cfg		pyvenv.cfg
run.py		run.py
scipy-1.11.3-cp312-cp312-win_amd64.whl		scipy-1.11.3-cp312-cp312-win_amd64.whl
six.py		six.py
threadpoolctl.py		threadpoolctl.py
ttx.exe		ttx.exe

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

About

Uh oh!

Releases

Packages

Languages

vr-jayashree5443/Sales-Prediction-using-Python

Folders and files

Latest commit

History

Repository files navigation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages