Closed
Conversation
Collaborator
|
@gauthierpiarrette Thank you for your submission. Unfortunately, this PR doesn't meet our acceptance criteria: Repository age: The repository was created a week ago. We require a minimum of 3 months (or 6 months for Hidden Gem submissions) to ensure project stability. Please see our CONTRIBUTING.md for full requirements. You're welcome to resubmit once the project has matured and gained community traction. |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Project
Timefence
Checklist
Add project-name* [project-name](url) - Description ending with period.Why This Project Is Awesome
Which criterion does it meet? (pick one)
Explain:
Temporal data leakage is a silent killer in ML: a standard LEFT JOIN can leak future data into training, inflating offline metrics while production fails. No error, no warning. Timefence is the only tool purpose-built to detect and fix this. It audits existing datasets or builds correct ones, runs on DuckDB with zero infrastructure, and plugs into CI/CD with one line.
How It Differs
If similar entries exist, what makes this one unique?
Data validation tools (Great Expectations, Soda) check schema, nulls, and distributions - not temporal correctness. Feature stores (Feast, Tecton) serve features at scale but don't validate point-in-time correctness. Timefence fills the gap: a lightweight, dedicated tool for catching when feature_time > label_time.