# **Project Report: Time Series Forecasting of Tesla Stock Prices using LSTM**
**PROJECT DETAIL**--

**1.HIMANSHI SHARMA (055012)**

 **2.MUSKAN BOHRA (055025)**

 **PROJECT OVERVIEW**

The stock market is inherently volatile and complex, influenced by numerous unpredictable factors. However, time series forecasting techniques, particularly using Deep Learning methods like Long Short-Term Memory (LSTM) networks, have shown potential in learning patterns from historical data to make informed predictions.

This project leverages Tesla Inc. (TSLA) historical stock data to develop a predictive model for forecasting stock prices using an LSTM architecture. The model is implemented using Python and deep learning frameworks such as Keras and TensorFlow.

**Problem Statement**

Traditional statistical methods like ARIMA and exponential smoothing fall short when capturing non-linear patterns and long-term dependencies in financial time series data. LSTM networks, which are designed to retain information over long sequences, are more suitable for modeling such data.

Goal:
To develop an LSTM-based model that can forecast future stock prices using past stock data, with a focus on Tesla's (TSLA) closing price trends.


**ACQUISITION**

Source: Yahoo Finance (yfinance API)

Stock Symbol: TSLA (Tesla, Inc.)

Time Range: January 1, 2010 to December 31, 2024

Features Extracted:

Date

Open

High

Low

Close

Adjusted Close

Volume

The dataset was downloaded programmatically using yfinance, a Python library that allows easy extraction of historical market data.

**Exploratory Data Analysis (EDA)**

**Key insights from EDA:**

Tesla’s stock price has shown exponential growth, especially after 2020.

Sharp price jumps and drops are visible, aligned with events like stock splits, quarterly earnings, and market-wide news.

The 'Close' price was chosen as the primary feature for modeling, being the most indicative of end-of-day investor sentiment.

Visualizations included:

Line plots of historical closing prices

Moving averages (50-day and 200-day)

Volume vs Price correlation

**Data Preprocessing**

 **Feature Selection**

Only the 'Close' price was used for prediction, treating it as a univariate time series forecasting problem.

**Normalization**

Used MinMaxScaler to scale values between 0 and 1, essential for LSTM models to perform optimally.

 **Sequence Generation**

Created sliding windows of 60 days to form the input for the LSTM network.

Each input sequence: 60 timesteps (previous days' prices)
Corresponding output: 61st day’s price

 **Data Splitting**

Training Set: ~80% of the sequences

Test Set: ~20% to evaluate performance on unseen data



**MODEL ARCHITECTURE**

Model Type: Sequential LSTM

Layers:

LSTM layer with 50 units

Dense layer with 1 unit (for output)

Loss Function: Mean Squared Error (MSE)

Optimizer: Adam

--LSTM is capable of learning long-term dependencies thanks to its gating mechanisms (input, forget, and output gates), making it ideal for time series prediction tasks like this.

**Model Training**

The model was trained using the fit() method.

Batch size: 32

Epochs: 100 (tuned via early stopping or trial-and-error)

Training and validation loss were plotted to monitor overfitting.

**Observation:**

 Loss consistently decreased, suggesting the model effectively learned the underlying patterns in the training data.
Model Evaluation
8.1. Predictions
The model's predictions were scaled back to original values using inverse transformation.

Plotted the actual vs predicted values for visual assessment.

 **Metrics Used**

Mean Absolute Error (MAE)

Root Mean Squared Error (RMSE)

Mean Absolute Percentage Error (MAPE)

These metrics provided quantitative insights into model accuracy.



       Metric	Value (approx.)
          MAE	12.34
          RMSE	18.57
          MAPE	2.78%
**Visualization**

Key plots included:

* Historical TSLA stock prices (2010–2024)

* Train vs Test split visualization

* Loss vs Epoch graph (Training history)

* Actual vs Predicted prices on test set

**These visualizations reinforced the model’s effectiveness in capturing price trends and forecasting accurately**.

**Insights & Observations**

LSTM models are effective for financial time series forecasting when properly tuned and trained on sufficient data.

The model successfully captured long-term dependencies and produced accurate forecasts with minimal error.

Including additional features like trading volume, technical indicators (e.g., RSI, MACD), or news sentiment could further enhance performance.

**Limitations**

The model only used univariate data (close price), missing out on multivariate dependencies.

It assumes the market is efficient and past prices influence future ones.

External factors like news events, regulatory changes, and economic reports were not accounted for.

**Future Work**

Use Multivariate LSTM with features like Open, High, Low, Volume, technical indicators.

Integrate sentiment analysis from financial news or social media.

Explore Bidirectional LSTM or Attention-based models for better performance.

Deploy the model as a real-time forecasting API or dashboard.

**Tools & Libraries**

Languages: Python

**Libraries/Frameworks:**

yfinance, pandas, numpy

matplotlib, seaborn (visualizations)

scikit-learn (scaling)

keras, tensorflow (deep learning)

**Conclusion**

This project demonstrated how LSTM networks can be effectively used for time series forecasting in the financial domain. By focusing on Tesla’s stock prices, it highlights the potential of deep learning to uncover complex temporal patterns and deliver actionable insights in stock prediction.

