xarray-beam icon indicating copy to clipboard operation
xarray-beam copied to clipboard

Distributed Xarray with Apache Beam

Results 30 xarray-beam issues
Sort by recently updated
recently updated
newest added

Currently we have (from https://xarray-beam.readthedocs.io/en/latest/read-write.html): ```python with beam.Pipeline() as p: p | xbeam.DatasetToChunks(ds, chunks={'time': 1000}) | beam.MapTuple(print_summary) ``` ``` Key(offsets={'lat': 0, 'lon': 0, 'time': 0}, vars=None) with Key(offsets={'lat': 0, 'lon':...

Hello! Newbie here. :) Any tips for someone who would like to try an xarray-beam rechunking pipeline on AWS? (USGS is wedded to AWS at the moment)

For consistency with `DatasetToChunks`, consider calling this `chunks`. You might also add a `split_vars` parameter? _Originally posted by @shoyer in https://github.com/google/xarray-beam/pull/31#r713514917_

I experimented a bit more with this based on @mjwillson's suggestion. Amazingly, it seems that uses file-like objects in Xarray _does_ actually work as used here, though making a local...

Hiya! In some source datasets the original chunks have irregular sizes. For example one netcdf file per year, where some years are leap years hence shorter than others. Although the...

At today's Pangeo Forge meeting, @alxmrs told us a bit more about Xarray Beam. I'm happy to note that we are aligning a bit around certain abstractions. I'm not proposing...

These could facilitate directly opening data from Zarr using idiomatic patterns in Xarray-Beam (e.g., using Xarray's lazy indexing machinery instead of dask). I'm imaging `open_zarr()` returning a tuple of values...