# Geopandas Shapefile Extraction

Utility notebook for loading a shapefile with **geopandas**, performing a simple inspection, and exporting data to a convenient format (e.g., CSV). This can be used as a starting point for any geospatial data preprocessing within the ETL pipeline.

In [None]:
# Install geopandas in the current environment if needed
# !pixi add --feature gis --pypi geopandas
# (Uncomment the line above and run the cell in the appropriate environment)


In [None]:
import geopandas as gpd
import matplotlib.pyplot as plt

# Path to your shapefile (update this path to point at the desired .shp file)
shapefile_path = 'data/example_shapefile.shp'

# Load the shapefile
gdf = gpd.read_file(shapefile_path)

# Quick inspection
print(gdf.head())
print(f'CRS: {gdf.crs}')


In [None]:
# Plot the geometry (simple quick visual)
gdf.plot(figsize=(10, 6))
plt.title('Shapefile Overview')
plt.show()


In [None]:
# Export attributes to CSV for downstream processing (optional)
output_csv = 'data/shapefile_attributes.csv'
gdf.drop(columns='geometry').to_csv(output_csv, index=False)
print(f'Attributes exported to {output_csv}')
