-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Critical path to a formal pull request upstream #21
Comments
Some ticket updates:
|
Actions from today's meeting, where most of our conversation was around the expected behaviour when threading:
|
The problems I think we have are encapsulated here: import pyfive
import s3fs
s3 = s3fs.S3FileSystem("http://some-s3-server/")
# thread zone 1
with s3.open('my-bucket/my-file.txt', 'rb') as f:
with pyfive.File(f) as hfile:
uwind = hfile['zonal_velocity']
### thread zone 2
r = uwind[x:y] #where x and y are thread dependent.
###
rr = uwind[xx:yy]
## end of zone 2
We assume that threading higher up the stack would be ok, albeit expensive with caching etc. |
Progress update:
|
Pseudo chunking delivered in f450776. Killed the relevant branch. |
We have a show stopper issue - variable length strings. I was aware of this (but thought we could live with out it for now, #16), but @davidhassell has shown it is a real problem for real data we use - #29. |
(Vlen support dealt with.) |
Pull request submitted. |
Minutes of our meeting on the necessary steps before the upstream pull request:
cf-python
(we are trying to ensure we don't have to make any forseeable changes here because they could complicate the move upstream).h5netcdf
branch and archive the others.DatasetID
to satisfy h5netcdf unit tests, but we really don't want to do that at scale. @davidhassell is going to look at the relevant ticket (Why not caching h5py dataset? h5netcdf/h5netcdf#251) on h5netcdf (@bnlawrence will create another branch on our pyfive which can be used for re-exposing the issue).The text was updated successfully, but these errors were encountered: