# Visual Features CSV Combiner

## Overview
This Jupyter notebook combines multiple CSV files containing visual feature data from video analysis into a single consolidated dataset. It handles the merging of features extracted across multiple processing batches.

### Purpose
- Combines separate CSV files from batch video processing
- Maintains data consistency across merges
- Creates a unified dataset for further analysis
- Preserves unique video identifiers

### Prerequisites


In [None]:
import pandas as pd
import glob



### Process
1. Locates all CSV files matching the pattern "visual_features_*.csv"
2. Reads and concatenates files while preserving data structure
3. Verifies unique video IDs
4. Exports combined dataset to a single CSV file

### Usage
Run cells sequentially to:
1. Locate and combine CSV files
2. Verify data integrity
3. Export consolidated dataset

The final output is saved as "combined_visual_features.csv" in the current directory.

In [2]:

# Get list of all CSV files matching the pattern
csv_files = sorted(glob.glob("visual_features_*.csv"))

# Read and concatenate all CSV files
df = pd.concat((pd.read_csv(file) for file in csv_files), ignore_index=True)


In [5]:
len(df['video_id'].unique())

20180

In [4]:

# Save the combined DataFrame to a new CSV file
df.to_csv("combined_visual_features.csv", index=False)
