# <div align="center">Machine Learning Approaches in Natural Resources</div>

***
### Concepts, Techniques, and Prediction Models  
* ML concepts and commonly used techniques in natural resources 
* Descriptive, Predictive, and Prescriptive analytics
* Types of Machine Learning models and approaches
    * Supervised vs. unsupervised learning
    * Classification and regression tasks
    
### References
* Géron, A. (2022). Hands-on Machine Learning with Scikit-Learn, Keras, and TensorFlow. O’Reilly Media, Sebastopol, CA. https://go.exlibris.link/x2Wfv2hq (Links to an external site.).
    * Chapter "1. The Machine Learning Landscape"
    * "Performance Measures" from Chapter "3. Classification"
* James, G., Witten, D., Hastie, T., Tibshirani, R. (2021). Statistical Learning. In: An Introduction to Statistical Learning. Springer Texts in Statistics. Springer, New York, NY. https://doi.org/10.1007/978-1-0716-1418-1_2 (Links to an external site).
    * Introduction (pp. 1-6) including "An Overview of Statistical Learning" and "A Brief History of Statistical Learning".
    * "2.1. What is Statistical Learning?" pp. 15-29
    * "3.1 Simple Linear Regres" (skip if familiar with this topic)
    * "4.1. An Overview of Classification" and "4.2. Why Not Linear Regression?" from "4. Classification." 

### Machine Learning (ML)
* ML is the field of study that gives computers the ability to learn without being explicitly programmed (Samuel A. 1959)
* Machine learning is a method of data analysis that automates analytical model building.
* It is a branch of artificial intelligence based on the idea that systems can learn from data, identify patterns, and make decisions with minimal human intervention (SAS, 2022)

### Statistical Learning (SL)
* SL often tries to capture the relationship between predictor and response variables in mathematical equations.
* The focus is more on hypothesis testing, as assumptions such as normality, multicollinearity, and homoscedasticity are typical in statistical learning. 
* The terms SL and ML are often used interchangeably. Others may argue that they are two different fields. 
* SL and ML highly overlap, as both deal with data analysis. Both fields deal with making predictions as well as the formalization of relationships between variables (inference). 

### Big Data
* Big data is associated with machine learning.
* Massive data sets are currently available due to continuous data gathering and storing, possibly due to advanced sensor and communication techniques.
* Data sets are too large or complex to be dealt with by traditional data-processing applications and analysis with simple SL models.
* Big data needs to be processed on multiple CPU cores on a single or multiple machines. 

### References
* Jacobs, A. 2019. The pathologies of big data. Communications of the ACM, 52(), pp. 36-44.
* James, G., Witten, D., Hastie, T., Tibshirani, R. (2021). Statistical Learning. In: An Introduction to Statistical Learning. Springer Texts in Statistics.
* Samuel, A. 1959. Some studies in machine learning using the game of checkers. IBM Journal of Research and Development, 3(3), pp. 210-229.
* SAS 2022. https://www.sas.com/en_ca/insights/analytics/machine-learning.html
* Vapnik, V. 1999. The nature of statistical learning theory. Springer science & business media.

### Descriptive, Predictive, and Prescriptive Analytics

<br>
<div style="text-align: center;">
<img src="https://i.imgur.com/IOhrwzb.png" alt="image" width="400"/>
</div>
<div style="text-align: center;">
     <strong>Figure 1: Selecting the Right Analysis: Exploring Descriptive, Predictive, and Prescriptive Approaches</strong>
    </div>
    <br>
    <br>


### Machine Learning Models

#### Supervised vs. Unsupervised Learning

<br>

<div style="text-align: center;">
<img src=https://i.imgur.com/j3outR9.jpeg alt="image" width="400"/>
</div>
<div style="text-align: center;">
     <strong> Figure 2: Common Supervised and Unsupervised Machine Learning Algorithms</strong>
    </div>
    <br>
 


#### Classification and Regression Tasks

<div style="text-align: center;">
<img src=https://i.imgur.com/3lR3YAz.jpg alt="image" width="400"/>
</div>
<div style="text-align: center;">
     <strong> Figure 3: Common Machine Learning Algorithms for Classification and Regression Tasks</strong>
    </div>
    <br>
                                                              

### Generalization, Overfitting, and Underfitting

* **Generalization:** The ability of a model to generalize from a training dataset and make accurate predictions on "unseen" data.
* **Underfitting:** When a model is too simple to capture all the aspects of and variability in the dataset
* **Overfitting:** When the model is too complex and it fits the noisiness of the training dataset too well

<center>
<div>
<img src=https://i.imgur.com/2tNYGsN.jpg width="650"/>
</div>
    <div style="text-align: center;">
     <strong> Figure 4: Underfitting, Balanced Fitting, and Overfitting in Machine Learning Algorithms</strong>
    </div>
    <br>
    <br>
    


**Questions to Consider:**
> Which model would predict new, “unseen” data more successfully? <br>
