Jian Xiao
Jian Xiao
## Why are these changes needed? The actor handle held at Ray client will become dangling if the Ray cluster is shutdown, and in such case if the user tries...
This test tries to simulate the real workload from users. Right now it's a bit off and we need to make the setup more realistic. Some feedbacks: - The batch...
Signed-off-by: jianoaix [[email protected]](mailto:[email protected]) ## Why are these changes needed? While it's recommended to use Ray Datasets or AIR to build batch prediction, there are use cases where users need to...
We have a guide now, but it's embedded in NYC taxi data processing example: https://docs.ray.io/en/latest/data/examples/nyc_taxi_basic_processing.html#parallel-batch-inference When users come to Datasets, they may have a workload in mind, so our documentation...
This is extracting learnings from Data oncall, where we saw user confusions around map_batches(), regarding: - UDF needs to be picklable: this is an implicit requirement so far, and we...
## Why are these changes needed? This is a core API in Datasets as a way to consume content held in a Dataset. We should add benchmark tests for this...