# Data Science Tools and Ecosystem

In this notebook, Data Science Tools and Ecosystem are summarized.

- **Python** – Widely used for data analysis, machine learning, and visualization.
- **R** – Strong in statistical analysis and data visualization.
- **SQL** – Essential for querying and managing structured data.
- **Julia** – High-performance language for numerical and scientific computing.
- **Scala** – Commonly used with Apache Spark for big data processing.
- **Java** – Utilized in large-scale data systems and enterprise applications.
- **MATLAB** – Often used in academic and engineering contexts for data modeling.

### Popular Data Science Libraries

#### **Python**
- **NumPy** – Numerical computing and array manipulation  
- **Pandas** – Data manipulation and analysis  
- **Matplotlib** – Data visualization and plotting  
- **Seaborn** – Statistical data visualization  
- **Scikit-learn** – Machine learning algorithms and tools  
- **TensorFlow** – Deep learning framework  
- **PyTorch** – Deep learning and neural networks  
- **Statsmodels** – Statistical modeling and hypothesis testing  

#### **R**
- **ggplot2** – Data visualization  
- **dplyr** – Data manipulation  
- **tidyr** – Data cleaning and tidying  
- **caret** – Machine learning and model training  
- **shiny** – Building interactive web apps for data analysis  

#### **Julia**
- **DataFrames.jl** – Data manipulation  
- **Flux.jl** – Machine learning  
- **Plots.jl** – Visualization  

#### **Scala**
- **Spark MLlib** – Machine learning on big data  
- **Breeze** – Numerical processing and linear algebra  

| **Category**              | **Tool**                  | **Description** |
|----------------------------|---------------------------|-----------------|
| **Data Analysis**          | Excel, Pandas, RStudio    | Tools for exploring, cleaning, and analyzing data. |
| **Data Visualization**     | Tableau, Power BI, Matplotlib, Seaborn | Tools for creating visual representations of data. |
| **Machine Learning**       | Scikit-learn, TensorFlow, PyTorch, Keras | Frameworks and libraries for building ML models. |
| **Big Data Processing**    | Apache Spark, Hadoop, Databricks | Platforms for handling and analyzing large datasets. |
| **Databases**              | MySQL, PostgreSQL, MongoDB, Snowflake | Tools for storing and querying structured or unstructured data. |
| **Data Collection**        | APIs, Web Scraping (BeautifulSoup, Scrapy) | Methods and tools for gathering data from various sources. |
| **Development Environments** | Jupyter Notebook, Google Colab, VS Code | IDEs and environments for coding and experimentation. |
| **Version Control**        | Git, GitHub, GitLab       | Tools for managing code versions and collaboration. |
| **Cloud Platforms**        | AWS, Google Cloud, Azure  | Cloud services for scalable storage, computation, and deployment. |

### Arithmetic Expression Examples

Arithmetic expressions are used to perform basic mathematical operations such as addition, subtraction, multiplication, and division.  
These operations are fundamental in data analysis, programming, and mathematical modeling.

Here are some common arithmetic expressions:

| **Expression** | **Description**         | **Example Result** |
|----------------|--------------------------|--------------------|
| `a + b`        | Addition                 | `5 + 3 = 8`        |
| `a - b`        | Subtraction              | `10 - 4 = 6`       |
| `a * b`        | Multiplication           | `7 * 2 = 14`       |
| `a / b`        | Division (float result)  | `8 / 2 = 4.0`      |
| `a // b`       | Floor Division           | `9 // 4 = 2`       |
| `a % b`        | Modulus (remainder)      | `10 % 3 = 1`       |
| `a ** b`       | Exponentiation (power)   | `2 ** 3 = 8`       |

These expressions can be used in most programming languages such as **Python**, **R**, and **Julia** to perform mathematical computations.

In [6]:
# Multiply and add numbers

# Define numbers
a = 5
b = 3
c = 2

# Perform arithmetic operations
multiplication = a * b
addition = a + b + c

# Display results
print("Multiplication result:", multiplication)
print("Addition result:", addition)

Multiplication result: 15
Addition result: 10


In [5]:
# Convert minutes to hours

# Define the number of minutes
minutes = 150

# Convert minutes to hours
hours = minutes / 60

# Display result
print(f"{minutes} minutes is equal to {hours:.2f} hours")

150 minutes is equal to 2.50 hours


- Understand the basics of data science workflows  
- Learn key programming languages and tools used in data science  
- Perform arithmetic operations and basic data manipulations  
- Practice converting units and handling numerical data in code  
- Build a foundation for more advanced topics such as data visualization and machine learning 

## Author: Benjamín Negrete Contreras