Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add include_cutoff_time arg #958

Closed
wants to merge 3 commits into from

Conversation

rightx2
Copy link
Contributor

@rightx2 rightx2 commented May 9, 2020

Add include_cutoff_time arg to control whether data at cutoff times are included in feature calculations and prevent traininig_window overlapping

Pull Request Description

There was a data overlapping problem when calculating the feature matrix: The data at cutoff time is used both in calculating features and in calculating target values(#918 ). This could be data cheating and affect the result as well. There was a trial to solve the issue (#930 ), but It still didn't solve the cheating problem. So, we decided to parameterize it to control whether data at cutoff times are included in feature calculations or not(#942 ) and this PR solves it.

@rightx2
Copy link
Contributor Author

rightx2 commented May 9, 2020

I will try with other pr (Missing update origin/master before sending PR)

@rightx2 rightx2 closed this May 9, 2020
@rightx2 rightx2 deleted the include_cutoff_time branch May 9, 2020 05:24
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

1 participant