Shane Keller
Shane Keller
Hi, I agree that the documentation should be updated to make that clearer how to use. For now, see the [arbitrage example](https://github.com/enigmampc/catalyst/blob/master/catalyst/examples/arbitrage_with_interface.py) and have a look at the "exchange_name" parameter...
I'm not a contributor so I couldn't say. I'm just getting to know the repo myself!
@AakashKumarNain same question on this. What code did you end up using? How was the performance? I want to use a keras.utils.Sequence subclass to leverage [keras fit_generator](https://keras.io/models/sequential/#fit_generator), so I'm thinking...
I have a related question. What's the recommended way to use dask to read .tiff images stored at `gcs`? I posted my question here: https://stackoverflow.com/questions/58422292/is-it-possible-to-read-a-tiff-file-from-a-remote-service-with-dask
Thanks for the suggestions. I'll circle back after I explore. On Wed, Oct 16, 2019 at 6:36 PM Genevieve Buckley wrote: > So I don't really use google cloud storage,...
Hi, I spun up a Kubernetes cluster and am running dask on it using the helm chart. I followed the dask blog post you shared as a template. ``` import...
No worries. Thanks. On Thu, May 7, 2020 at 1:28 AM Genevieve Buckley wrote: > I'm sorry I completely missed your last message @skeller88 > > > My best guess...
This 2-pass shuffle algo could be a starting point for shuffling improvements: https://blog.janestreet.com/how-to-shuffle-a-big-dataset/?utm_source=share
@akhilvasvani those repos have some useful content, but I'm not seeing answers to these specific questions in there. Is there a specific file in each repo that has the answers?
Is `gcsfs` thread-safe? A dask worker could be running multiple threads. For example: ``` fs = gcsfs.GCSFileSystem(project='project_name') def read_from_gcs(filename): r = fs.cat(filename) return imageio.core.asarray(imageio.imread(r, 'TIFF')) delayed_read = dask.delayed(read_from_gcs, pure=True) ```