close
The Wayback Machine - https://web.archive.org/web/20220706141746/https://github.com/pola-rs/polars/issues/3520
Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Python] hdf5 reader/writer (with optional dependencies) #3520

Open
ritchie46 opened this issue May 28, 2022 · 4 comments
Open

[Python] hdf5 reader/writer (with optional dependencies) #3520

ritchie46 opened this issue May 28, 2022 · 4 comments
Labels
feature good first issue

Comments

@ritchie46
Copy link
Member

@ritchie46 ritchie46 commented May 28, 2022

We can reduce friction by figuring out how to load data most efficiently to polars memory.

@ritchie46 ritchie46 added feature good first issue labels May 28, 2022
@jorgecarleitao
Copy link
Collaborator

@jorgecarleitao jorgecarleitao commented May 28, 2022

If there is a backend for this in Rust, I think we could work it out in arrow2. It is a quite important format imo.

@ritchie46
Copy link
Member Author

@ritchie46 ritchie46 commented May 28, 2022

That's a good idea too!

@ghuls
Copy link
Collaborator

@ghuls ghuls commented May 28, 2022

HDF5 has a very big specification: https://docs.hdfgroup.org/hdf5/develop/_f_m_t3.html
and as HDF5 is very similar to a filesystem, data stored in HDF5 can be stored in quite a lot of different ways.

Rust bindings to libhdf5 can be found at: https://github.com/aldanor/hdf5-rust

@ritchie46
Copy link
Member Author

@ritchie46 ritchie46 commented May 30, 2022

I think we should explore both. Rust backed under a feature flag, and python as optional dependency. I can imagine that it increases binary size quite a bit.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
feature good first issue
Projects
None yet
Development

No branches or pull requests

3 participants