Skip to content

Conversation

Copy link
Contributor

Copilot AI commented Jan 20, 2026

Plan to expose preprocessor APIs

  • Explore repository structure and understand preprocessing flow
  • Add public preprocess() method to AutoML class for task-level preprocessing
  • Add public preprocess() method to estimator classes for estimator-level preprocessing
  • Add tests for the new preprocessor APIs
  • Validate tests pass
  • Add documentation for the new API methods
  • Add example script demonstrating API usage
  • Run broader test suite to ensure no regressions
  • Address code review feedback
  • Fix formatting issues with pre-commit hooks

Summary

This PR successfully exposes the previously internal preprocessing methods as user-facing APIs, addressing issue #228.

Key Changes:

  1. AutoML.preprocess(X) - Task-level preprocessing API
  2. BaseEstimator.preprocess(X) - Estimator-level preprocessing API
  3. Comprehensive documentation and examples
  4. Full test coverage (8/8 tests passing)

Latest Changes:

  • Fixed formatting issues identified by pre-commit hooks (trailing whitespace, code formatting)
  • All tests continue to pass
Original prompt

This section details on the original issue you should resolve

<issue_title>API and documentation of preprocessors</issue_title>
<issue_description> There are two preprocessors: task-level and estimator-level. The former is automl._preprocess() and the latter is automl.model._preprocess(). The former should be called before the latter. Right now they are not exposed as user-facing APIs.
We can add some user-facing APIs in future version. If you have any ideas for that please feel free to suggest.

Originally posted by @sonichi in #228 (reply in thread)
</issue_description>

Comments on the Issue (you are @copilot in this section)


💬 We'd love your input! Share your thoughts on Copilot coding agent in our 2 minute survey.

Copilot AI and others added 4 commits January 20, 2026 04:19
Co-authored-by: thinkall <3197038+thinkall@users.noreply.github.com>
Co-authored-by: thinkall <3197038+thinkall@users.noreply.github.com>
Co-authored-by: thinkall <3197038+thinkall@users.noreply.github.com>
Co-authored-by: thinkall <3197038+thinkall@users.noreply.github.com>
Copilot AI changed the title [WIP] Expose task-level and estimator-level preprocessors as APIs Expose task-level and estimator-level preprocessors as public API Jan 20, 2026
Copilot AI requested a review from thinkall January 20, 2026 04:26
@thinkall thinkall marked this pull request as ready for review January 20, 2026 14:48
@thinkall
Copy link
Collaborator

@copilot, run "pre-commit run --all-files" to fix formatting issues.

Co-authored-by: thinkall <3197038+thinkall@users.noreply.github.com>
Copy link
Contributor Author

Copilot AI commented Jan 20, 2026

@copilot, run "pre-commit run --all-files" to fix formatting issues.

Done! Pre-commit hooks have been run and formatting issues fixed. Changes include:

  • Removed trailing whitespace
  • Applied Black code formatting
  • Fixed markdown formatting

All tests still pass. Commit: 254ef39

@thinkall thinkall merged commit d9e7403 into main Jan 21, 2026
16 checks passed
@thinkall thinkall deleted the copilot/add-api-for-preprocessors branch January 21, 2026 06:38
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

API and documentation of preprocessors

3 participants