## Table of Contents
* [Module 1: Data](#Module-1:-Data)
    * [Concepts](#Concepts:)
    * [User Stories](#User-Stories:)
        * [Polymer 3D Printer](3.%20Advanced%20Topics/3.1.%20Aerotech%20Data%20Collection%20Platform/Aerotech%20Data%20Collection.ipynb)
    * [Summary Data](#Summary-Data:)
    * [Assessment](#Assessment:)
        * [Take a quiz](http://example.com/quiz)
#### [🏠 Home](../../welcomePage.ipynb)

# Yellow Belt: Foundational Concepts

## Objective:
Introduce beginners to data concepts in digital engineering with a theoretical foundation and illustrative examples. No hands-on coding.

---

## Topics to Discuss

### 1. Introduction to Data in Digital Engineering
- Importance of data in digital engineering.
- Data as the foundation for decision-making, product innovation, and process optimization.
- Examples of digital engineering data:
  - IoT sensor data
  - CAD metadata
  - Machine telemetry data

---

### 2. Understanding Data Types and Sources
- Types of data: numerical, categorical, textual, time-series, image.
- Data sources:
  - Structured: Databases, tables, CSV files.
  - Semi-structured: JSON logs, XML files.
  - Unstructured: Images, videos.
- Example workflows:
  - Conceptual overview of collecting data from IoT devices and CAD systems.

---

### 3. Data Quality and Challenges
- Key challenges in digital engineering data:
  - Missing values.
  - Noise and outliers.
  - Inconsistent formats and duplicates.
- Why data cleaning is critical for engineering systems.
- Conceptual examples of data quality issues (no coding).

---

### 4. Introduction to Tools for Data Handling
- Overview of Python and libraries for data handling:
  - `pandas`, `numpy`, `matplotlib`.
- Conceptual explanation of Jupyter Notebooks for data workflows.

---

### 5. Key Preprocessing Concepts
- **Missing Data**:
  - What it is and why it occurs.
  - Conceptual strategies: dropping, imputing.
- **Outliers**:
  - How outliers impact engineering analyses.
  - Visual identification methods (boxplots, histograms).
- **Transformations**:
  - Concept of scaling and encoding.
  - Examples of Min-Max scaling and one-hot encoding (theory only).


### Module 1: Data
To begin a successful digital engineering journey, it is important to recognize that data is the linchpin driving this transformation. The true power of digital engineering lies not in adopting the latest digital technologies but in making data-driven decisions. Data can be collected from anywhere and everywhere, but the utility of digital engineering is as good as data quantity and quality. It serves as the compass that guides the transformation, enabling informed decision-making, product innovation, service enhancement, and efficient process automation.

<center>
    <img src="../../img/23.jpg" alt="Alt text" width="300">
</center>


#### Concepts:
- **Data sources:** Data sources refer to the origins from which data is obtained, including databases,
sensors, and external services, providing raw information for analysis and processing. They can
be structured, semi-structured, or unstructured, depending on the format and organization of the
data.

   - **IoT:** IoT refers to a network of interconnected devices embedded with
sensors, software, and other technologies, enabling them to collect, exchange, and act on data
over the internet. These devices range from household items to industrial machines, enhancing
automation and data-driven decision-making.

- **Data cleansing:** Data cleansing can involve filtering out noise and removing outliers from
collected data to ensure its their quality and reliability. This process ensures that the data used for
analysis is accurate, complete, and relevant.
    
- **Data storage, security, and accessibility:** Data storage involves saving data in physical or
cloud-based systems, while data security focuses on protecting this data from unauthorized
access and breaches. Data accessibility ensures that authorized users can efficiently retrieve and
use the stored data whenever needed.
    
- **Data Usage:** Data usage encompasses leveraging data to build models, perform simulations, and
apply machine learning techniques to optimize designs and processes. This includes analyzing
sensor data, predicting system behaviors, and improving operational efficiency through data-
driven decision-making and automation.
    
<center>
    <img src="../../img/conceptsFigure.svg" alt="Alt text" width="900">
</center>



#### User Stories:
- [Data Processing (Level 1)]()
- [Data Processing (Level 2)]()
- [Polymer 3D Printer (Level 3)](3.%20Advanced%20Topics/3.1.%20Aerotech%20Data%20Collection%20Platform/Aerotech%20Data%20Collection.ipynb) - This module preprocesses sensor data from a custom-built Fused Filament Fabrication (FFF) 3D printer for modeling the extrusion process, incorporating feedback on motion, temperature, pressure, force, and cooling dynamics. 

#### Summary Data:
- Millions of data points are created at every stage of the product’s lifecycle.
- Making sense of that data and using it when needed and for the right purpose is what matters.
- Strategizing about how, where, and what kind of data to collect is very important.

#### Assessment:
- [Take a quiz](http://example.com/quiz)


### <center>[◀︎ Introduction](../introToDE.ipynb)     [🏠 Home](../../welcomePage.ipynb)     [Module 2 ▶︎](Module2.ipynb)</center>
