Simple CLI tool that summarizes a CSV.
- Loads a CSV
- Prints number of rows, columns, column names, dtypes, missing counts
- Saves the summary as JSON
- Ensure Python 3.8+ is installed.
- (Optional) create venv: python -m venv .venv source .venv/bin/activate # mac/linux .venv\Scripts\activate # windows
- Install pandas: pip install pandas
- Run: python main.py path/to/file.csv --out report.json
Example: python main.py data/titanic.csv --out summary.json
$ python main.py day1.csv --out summary.json
Produces summary.json like:
{
"n_rows": 891,
"n_cols": 12,
"columns": ["PassengerId","Survived", ...],
"dtypes": {"PassengerId":"int64", "Survived":"int64", ...},
"missing_counts": {"Age":177, "Cabin":687, ...}
}