# Python Dataset Exploration Notebook
This notebook demonstrates how to install dependencies, import common data science libraries, and access the mounted dataset via the DATA_DIR environment variable or /data.

## Install Dependencies
Installs packages listed in requirements.txt. Add any additional libraries your analysis needs there.

In [None]:
# Install dependencies from requirements.txt (silent)
!pip install -r requirements.txt > /dev/null

## Import Common Libraries
Example imports of widely used data science libraries.

In [None]:
import os
from pathlib import Path
import pandas as pd
import seaborn as sns
import matplotlib.pyplot as plt
print('pandas version:', pd.__version__)
print('seaborn version:', sns.__version__)

## Inspect Dataset Directory
DATA_DIR is an environment variable pointing to the mounted (read-only) dataset directory. You can also access it via the /data symlink. Always prefer DATA_DIR for portability.

In [None]:
data_dir = Path(os.environ['DATA_DIR'])
print('DATA_DIR =', data_dir)
print('\nListing via DATA_DIR:')
for p in data_dir.iterdir():
    print(' -', p.name)
print('\nListing via /data:')
for p in Path('/data').iterdir():
    print(' -', p.name)

# Example: load first CSV (if any) into a DataFrame
csvs = list(data_dir.glob('*.csv'))
if csvs:
    df = pd.read_csv(csvs[0])
    print('\nLoaded', csvs[0].name, 'shape =', df.shape)
    display(df.head())
else:
    print('\nNo CSV files found in dataset root.')