Skip to content

docs: update notebook csv inputs#96

Merged
lipikaramaswamy merged 3 commits into
mainfrom
lipikaramaswamy/docs/release-update
Apr 7, 2026
Merged

docs: update notebook csv inputs#96
lipikaramaswamy merged 3 commits into
mainfrom
lipikaramaswamy/docs/release-update

Conversation

@lipikaramaswamy
Copy link
Copy Markdown
Collaborator

@lipikaramaswamy lipikaramaswamy commented Apr 7, 2026

Summary

Updates the docs notebooks to load sample datasets from GitHub raw URLs instead of repo-relative paths, so they work outside a local checkout. Also updates input validation and reader logic to accept HTTP(S) CSV and Parquet sources, with focused tests covering the new behavior.

Type of Change

  • Bug fix
  • New feature
  • Breaking change
  • Documentation update
  • Refactoring

Testing

  • make test passes locally
  • make check passes locally (format + lint + typecheck + lock-check) - except ty
  • Added/updated tests for changes

Documentation

  • If docs changed: make docs-build passes locally

…ative paths, update io / config to accept http(s) input csv or parquet
@lipikaramaswamy lipikaramaswamy requested a review from a team as a code owner April 7, 2026 18:00
Comment thread tests/config/test_anonymizer_config.py
Copy link
Copy Markdown
Collaborator

@andreatgretel andreatgretel left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM!

couple small things:

  • ftp://example.com/data.csv falls through to local path validation and gives "Input path does not exist: ftp:/example.com/data.csv" - a guard for unsupported URL schemes in validate_source_path would give a clearer error

  • the remote URL error paths (unsupported extension, network failure) don't have test coverage - the existing error-path tests only exercise local files

nothing blocking

@lipikaramaswamy lipikaramaswamy merged commit 7add170 into main Apr 7, 2026
11 checks passed
@lipikaramaswamy lipikaramaswamy deleted the lipikaramaswamy/docs/release-update branch April 7, 2026 19:00
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants