#### ðŸ§  What is pandas?

**pandas** is a Python library used to:

* Load, inspect, and clean data
* Analyze and manipulate data tables
* Handle missing values
* Perform group operations and filtering

It is widely used in data science, especially for **exploratory data analysis (EDA)**.

---

#### ðŸ“˜ What Youâ€™ll Learn in the Kaggle Pandas Course

#### 1. **Creating, Reading, and Writing Data**

* Load data from CSV files using `pd.read_csv()`
* View the data with `.head()` and `.info()`
* Save data using `.to_csv()`

```python
import pandas as pd
data = pd.read_csv("file.csv")
data.to_csv("new_file.csv")
```

---

#### 2. **DataFrame Basics**

* What is a **DataFrame**? (Itâ€™s like a table in Excel)
* Access columns: `data['column_name']`
* Summary statistics: `.describe()`, `.mean()`, `.unique()`

```python
data['age'].mean()
data['category'].unique()
```

---

#### 3. **Indexing, Selecting, and Assigning**

* Select rows using `.loc[]` and `.iloc[]`
* Filter data with conditions
* Add new columns

```python
# Select row by index
data.loc[0]

# Select row by position
data.iloc[3]

# Filter rows
data[data['age'] > 30]

# Add a new column
data['age_in_10_years'] = data['age'] + 10
```

---

#### 4. **Summary Functions and Maps**

* Use `.value_counts()` and `.map()` for analysis
* Apply functions to columns: `.apply()`

```python
data['income_bracket'] = data['income'].map(lambda x: 'High' if x > 50000 else 'Low')
```

---

#### 5. **Grouping and Sorting**

* Use `.groupby()` to group data and compute summaries
* Sort with `.sort_values()`

```python
data.groupby('city')['price'].mean()
data.sort_values('price', ascending=False)
```

---

#### 6. **Data Types and Missing Values**

* Check data types with `.dtypes`
* Convert types using `.astype()`
* Handle missing values with `.isnull()`, `.fillna()`, `.dropna()`

```python
data['age'].fillna(data['age'].mean(), inplace=True)
data.dropna(subset=['income'], inplace=True)
```

---

#### 7. **Renaming and Combining**

* Rename columns: `.rename()`
* Combine DataFrames: `pd.concat()`, `.merge()`

```python
data.rename(columns={'old_name': 'new_name'}, inplace=True)
combined = pd.concat([df1, df2])
```
