-
-
Notifications
You must be signed in to change notification settings - Fork 2.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Allow reading IPC files that use Arrow extension type #9373
Comments
I have created two PRs:
The Arrow2 PR needs to be merged first such that I can amend the Polars PR to depend on the official Arrow2 repository again. |
The above does not seem to work for extension types of type The |
To be honest, I have not been able to get the extension types to work smoothly with I'll close the PRs (they're still there in case anyone wants to pick up where I left) but I'm leaving this ticket open as a feature request. |
Problem description
This is partially related to #9112 but much narrower in scope.
As the official Apache Arrow specs state, an implementation reading an Arrow file using extension types should work by falling back to the known parent type.
Currently, using
LazyFrame::scan_ipc()
panics at the following line:polars/polars/polars-core/src/datatypes/field.rs
Line 158 in 9a17368
I suspect that the following
polars/polars/polars-core/src/datatypes/field.rs
Lines 146 to 155 in 9a17368
should be followed by something like
which would cause the function to recurse with the parent logical type when encountering an unknown extension.
The text was updated successfully, but these errors were encountered: