## Reading CSV Files

Pandas provides the `read_csv()` function to read CSV files. Let's explore its capabilities.

In [2]:
# Import necessary libraries
import pandas as pd
import numpy as np

print(f"Pandas version: {pd.__version__}")
print(f"NumPy version: {np.__version__}")

Pandas version: 2.2.3
NumPy version: 2.2.4


In [3]:
# Create sample data for demonstration
data = {
    'Name': ['Alice', 'Bob', 'Charlie', 'Diana'],
    'Age': [25, 30, 35, 28],
    'City': ['New York', 'London', 'Paris', 'Tokyo'],
    'Salary': [50000, 60000, 70000, 55000]
}

df = pd.DataFrame(data)
print("Sample DataFrame:")
print(df)

# Save to CSV
df.to_csv('sample_data.csv', index=False)
print("\nData saved to 'sample_data.csv'")

Sample DataFrame:
      Name  Age      City  Salary
0    Alice   25  New York   50000
1      Bob   30    London   60000
2  Charlie   35     Paris   70000
3    Diana   28     Tokyo   55000

Data saved to 'sample_data.csv'


In [4]:
# Read CSV file
df_read = pd.read_csv('sample_data.csv')
print("Data read from CSV:")
print(df_read)

# Check data types
print("\nData types:")
print(df_read.dtypes)

Data read from CSV:
      Name  Age      City  Salary
0    Alice   25  New York   50000
1      Bob   30    London   60000
2  Charlie   35     Paris   70000
3    Diana   28     Tokyo   55000

Data types:
Name      object
Age        int64
City      object
Salary     int64
dtype: object


## Reading Excel Files

Pandas can read Excel files using the `read_excel()` function. Note that this requires the `openpyxl` or `xlrd` library.

In [5]:
# Save to Excel
df.to_excel('sample_data.xlsx', index=False, sheet_name='Employees')
print("Data saved to 'sample_data.xlsx'")

# Read Excel file
df_excel = pd.read_excel('sample_data.xlsx', sheet_name='Employees')
print("Data read from Excel:")
print(df_excel)

Data saved to 'sample_data.xlsx'
Data read from Excel:
      Name  Age      City  Salary
0    Alice   25  New York   50000
1      Bob   30    London   60000
2  Charlie   35     Paris   70000
3    Diana   28     Tokyo   55000


## Reading and Writing JSON Files

Pandas can handle JSON data using `read_json()` and `to_json()` methods.

In [6]:
# Save to JSON
df.to_json('sample_data.json', orient='records', indent=2)
print("Data saved to 'sample_data.json'")

# Read JSON file
df_json = pd.read_json('sample_data.json', orient='records')
print("Data read from JSON:")
print(df_json)

Data saved to 'sample_data.json'
Data read from JSON:
      Name  Age      City  Salary
0    Alice   25  New York   50000
1      Bob   30    London   60000
2  Charlie   35     Paris   70000
3    Diana   28     Tokyo   55000


## Other File Formats

Pandas supports many other formats:
- `read_sql()` / `to_sql()` for SQL databases
- `read_html()` for HTML tables
- `read_pickle()` / `to_pickle()` for Python objects
- `read_parquet()` / `to_parquet()` for Parquet files
- `read_feather()` / `to_feather()` for Feather format

## Summary

You have learned how to:
- Read and write CSV files using `read_csv()` and `to_csv()`
- Read and write Excel files using `read_excel()` and `to_excel()`
- Read and write JSON files using `read_json()` and `to_json()`
- Handle various other file formats supported by Pandas

These functions provide flexible options for data import/export with parameters for customizing the reading/writing process.

# Reading and Writing Data in Pandas

This notebook covers how to read data from various file formats and write data to different formats using Pandas.