Add AI-powered regex pattern detection for automatic schema enhancement #6
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
This PR implements an optional AI-powered pattern detection feature that automatically identifies regex patterns in data fields to enhance schema generation with appropriate validation matchers.
Overview
The data stream comparator now supports intelligent pattern detection that can automatically generate regex patterns for common data types like emails, phone numbers, URLs, and more. This feature works in two modes:
Key Features
Automatic Pattern Detection
Flexible Configuration
Built-in Pattern Recognition
The offline mode includes recognition for:
Implementation Details
patterndetectionpackage with pluggable detector interfaceUsage Example
The feature significantly enhances the data validation capabilities of the schema generation process while maintaining full backward compatibility with existing configurations.
✨ Let Copilot coding agent set things up for you — coding agent works faster and does higher quality work when set up for your repo.