You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
fix: add_folder(depth="dataset") logs each dataset like other depth modes
fix: time series detection — recognizes compact YYYYMMDD, handles 4-digit years sharing a constant 2-digit token, and splits mixed yearly/quarterly/monthly/daily granularities
fix: include/exclude as a bare string no longer iterates character-by-character
perf: depth: value materializes the source once (≤1M rows) and shares it across autotag, frequency, and pattern passes
perf: frequency value counts via PyArrow (~27× on wide datasets); cleaner float/timestamp formatting
perf: pattern frequency fully vectorized in PyArrow (~1.2×)
perf: incremental scan batched — _seen updates, mark_datasets_seen, _match_path index, finalize cascade, orphan-tag detection, and last_update_date from cached mtime
perf: add_database re-applies cached metadata for unchanged tables in one batch
perf: enumeration assignment from frequencies looks up columns in O(1)