-
Notifications
You must be signed in to change notification settings - Fork 590
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
refactor: rework registration / file loading #5005
Conversation
e1ec4ff
to
a3b2256
Compare
Codecov Report
@@ Coverage Diff @@
## master #5005 +/- ##
===========================================
- Coverage 95.08% 82.94% -12.15%
===========================================
Files 401 210 -191
Lines 44763 23259 -21504
Branches 4391 3245 -1146
===========================================
- Hits 42563 19292 -23271
- Misses 1693 3579 +1886
+ Partials 507 388 -119
|
a3b2256
to
c30e22b
Compare
|
This will close #4906 This is definitely the best way to go about it, regex dispatcher was an issue when I was looking at this |
ec8117c
to
455b677
Compare
455b677
to
df8f6e4
Compare
404ffde
to
34f327e
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looking good! A few questions about coverage/testing.
48fd0e8
to
d1a025d
Compare
87ecd8a
to
4ac4045
Compare
Introduces explicit `read_` methods for various types of file / in-memory object loading. Removes the `RegexDispatcher` from the DuckDB, polars, and datafusion backends Should be backwards compatible with existing register functionality, plus users can now pass in iterables of CSV or parquet files (duckdb only), and also make use of the explicit `read_` methods if they choose (say if filenames are non-standard) Somehow, Datafusion can't handle a `count` so I've added a gross conditional to the tests for the moment. This also removes the last usage of RegexDispatcher, so I've removed that from `dispatch.py` and also removed the odo license file, which is now no longer required.
4ac4045
to
2ef837d
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
![]()
Introduces explicit
read_methods for various types of file /in-memory object loading.
Removes the
RegexDispatcherfrom the DuckDB, polars, and datafusion backendsShould be backwards compatible with existing register functionality,
plus users can now pass in iterables of CSV or parquet files (duckdb only), and also
make use of the explicit
read_methods if they choose (say if filenamesare non-standard)
Somehow, Datafusion can't handle a
countso I've added a grossconditional to the tests for the moment.