# Mutual Fund & ETFs Analysis
---

# Part 7 - Evaluation & Conclusion

## Content

1. [Evaluation](#Evaluation)
2. [Conclusion](#Conclusion)
3. [Limitations](#Limitations)
4. [Recommendations](#Recommendations)

## Links to Other Notebooks

- [Part 1: Data Cleaning](Part_1_Cleaning.ipynb)
- [Part 2: Exploratory Data Analysis](Part_2_EDA.ipynb)
- [Part 3: Modeling: Mutual Fund v1](Part_3_Modeling_MF_v1.ipynb) - Mutual Fund Prediction of fund returns
- [Part 4: Modeling: Mutual Fund v2](Part_4_Modeling_MF_v2.ipynb) - Mutual Fund Prediction of alpha
- [Part 5: Modeling: ETFs v1](Part_5_Modeling_ETF_v1.ipynb) - ETF Prediction of fund returns
- [Part 6: Modeling: ETFs v2](Part_6_Modeling_ETF_v2.ipynb) - ETF Prediction of alpha

## Evaluation

|                                           	| **Best score** 	| **R2  Train** 	| **R2 Test** 	| **RMSE Train** 	| **RMSE Test** 	|
|-------------------------------------------	|:--------------:	|:-------------:	|:-----------:	|:--------------:	|:-------------:	|
| **Mutual fund prediction of YTD returns** 	|     0.9246     	|     0.9919    	|    0.9410   	|     0.0072     	|     0.0193    	|
| **Mutual fund prediction of alpha**       	|     0.9290     	|     0.9921    	|    0.9294   	|     0.4438     	|     1.3637    	|
| **ETF prediction of YTD returns**         	|     0.2315     	|     0.5287    	|    0.1998   	|     0.0814     	|     0.1341    	|
| **ETF prediction of alpha**               	|     0.2685     	|     0.5661    	|    0.1871   	|     3.6966     	|     6.029     	|

Overall, alpha predictions achieved the best scores and improved results. We would recommend using it as a target variable to predict. 

However, we do note that the ETF predictions fall significantly short compared to mutual funds due to the lack of quality dataset.

The best parameters for:
- Mutual fund prediction of alpha:-
    - `n_estimators`: 200
    - `max_depth`: None
- ETF prediction of alpha:-
    - `n_estimators`: 150
    - `max_depth`: 4   
    
`n_estimators` represent the number of trees in the Random Forest model. As we can see that the mutual fund prediction has a higher value because of the volume of data.

`max_depth` represents the depth of each tree in the forest, to capture information about the data. The mutual fund prediction didn't require any parameters for this as the volume of data was sufficient to learn from, meanwhile the ETF prediction needed more depth to be able to learn from the dataset.

## Conclusion

Overall, we were able to create a model to generate near-accurate prediction for mutual funds - making it the better investment option solely based on prediction accuracy. 

The 10 most important features that would impact the accuracy of the model: 
1. Investment type (Growth)
2. Morningstar overall rating
3. Stocks asset breakdown
4. ESG score
5. Fund sector (Energy)
6. Fund sector (Industrials)
7. Size fund (Large)
8. Size fund (Small)
9. Morningstar risk rating
10. Fund sector (Technology)

However, this does not mean that ETF is not a worthy investment alternative. As we saw from the EDA section that ETFs have been able to keep up with market and cost investors a lot less, enabling them to earn more for less. Furthermore, ETFs have been gaining traction over the years and will continue to stay. 

## Limitations

1. The lack of quality ETF dataset has certainly impacted our EDA and modeling section. Fortunately, as ETF gains more traction, there will be more effort to keep record of the data and eventually create a better prediction model. 
2. Prediction model in practice - unfortunately the market changes too fast for prediction models to be put to work effectively. Many studies and fund managers have attempted to predict performance, but the actual market performance often vary greatly and this remains an ongoing quest.
3. Since the key to the success of mutual funds and ETFs is based on how the portfolio changes, we should ideally look at the performance before and after that change within the same fund. This way we'll be able to tell which of the changes had a bigger impact on performance. 

## Recommendations

Through our findings and model, fund managers and investors will be able to look for ways to maximize their returns.
- For fund managers:
    - Re-look at both current active and passive funds and ensure that the shortlisted features are optimized
    - For active mutual funds that have been doing well consistently with minimal effort, consider converting them to index funds
    - Create new active mutual funds using this model and also focus on bringing the added-value to investors, for example, share information on coveted stocks that are not widely available to the public
    - Collect more data on mutual funds and ETF to improve prediction accuracy
    - Put on different hats from a market expert to financial advisor, while keeping the investor's priority in mind, by adapting the counsel to client on the best investment option for their needs
- For investors:
    - Using the shortlisted features, re-look at current portfolio to ensure that these features are optimized to meet investment objectives
    - Prioritize investment goals and diversify accordingly, e.g. engage a fund manager if the goal is to outperform market and willingly take risks
    - Keep up with new products and regulatory changes in this space as well, to make informed decisions about investment