docs: add column_value_anomalies test documentation#2183
docs: add column_value_anomalies test documentation#2183devin-ai-integration[bot] wants to merge 3 commits intodocsfrom
Conversation
Co-Authored-By: Yosef Arbiv <yosef.arbiv@gmail.com>
🤖 Devin AI EngineerI'll be helping with this pull request! Here's what you should know: ✅ I will automatically:
Note: I can only respond to comments from users who have write access to this repository. ⚙️ Control Options:
|
|
Preview deployment for your docs. Learn more about Mintlify Previews.
💡 Tip: Enable Workflows to automatically generate PRs for you. |
|
Important Review skippedAuto reviews are disabled on base/target branches other than the default branch. Please check the settings in the CodeRabbit UI or the ⚙️ Run configurationConfiguration used: defaults Review profile: CHILL Plan: Pro Run ID: You can disable this status message by setting the Use the checkbox below for a quick retry:
✨ Finishing Touches🧪 Generate unit tests (beta)
Comment |
…config Co-Authored-By: Yosef Arbiv <yosef.arbiv@gmail.com>
Co-Authored-By: Yosef Arbiv <yosef.arbiv@gmail.com>
|
update the pr description and screenshots to the latest version (with seasonality, without buckets) |
Summary
Adds documentation for a new
column_value_anomaliestest — a row-level anomaly detection test that operates directly on raw column values rather than on aggregate metrics (min, max, average) per time bucket.This is a docs-first PR ahead of the implementation in
dbt-data-reliability(#982). The new page:column_anomaliestest with a comparison tabletimestamp_column,where_expression,anomaly_sensitivity,anomaly_direction,detection_period,training_period,seasonality,detection_delaydocs.jsonKey design decisions:
time_bucketis excluded — since this test operates on raw individual values (no per-bucket aggregation), time bucketing is not applicable. The test usestraining_periodanddetection_perioddirectly.seasonalityis included — allows separate baselines per day-of-week (e.g., weekend vs weekday value distributions), which is useful even without time-bucket aggregation.Screenshots
Mintlify preview: https://elementary-devin-1775975833-column-value-anomalies-docs.mintlify.app/data-tests/anomaly-detection-tests/column-value-anomalies
Review & Testing Checklist for Human
<Note>block, and config block render correctly. Verify the config block showsseasonalityand does NOT showtime_bucket.timestamp_columnshould be required — this doc says it's required, unlike other anomaly tests where it's "highly recommended." Confirm this is desired.ignore_small_changes,anomaly_exclude_metrics, anddimensionsthat other anomaly tests support. Verify whether any of these should be supported for this test type.Notes
<Note>block.code-quality,create_pylon_issue) are pre-existing issues unrelated to this PR.Link to Devin session: https://app.devin.ai/sessions/97ebe636d57244fb82a7452e1521604e
Requested by: @arbiv