gpu-bdb issues

[Do Not Merge] Enable Query 27

This PR aims to re-enable query 27 , this PR currently fixes breakages though we currently are getting empty results though. Dont know why.

VibhuJawa

Problem with load test

After generating test data using PDGF tool and placing the .dat files under $DATA_DIR/SF what should be the next step to run the benchmark? running the load test generates .parquet...

MohammedElGharawy

Refactor to use public API `Series.str.find_multiple`in query 18

I noticed that gpu-bdb query 18 was using private methods from cudf for `find_multiple`: https://github.com/rapidsai/gpu-bdb/blob/f48c05d63d5cb4baa59708cb262506f6d9d3f4f1/gpu_bdb/bdb_tools/q18_utils.py#L21 https://github.com/rapidsai/gpu-bdb/blob/f48c05d63d5cb4baa59708cb262506f6d9d3f4f1/gpu_bdb/bdb_tools/q18_utils.py#L127 cudf now has public APIs that perform the same task: `Series.str.find_multiple`. This should be...

bdice

Add cpu backend to q28

ChrisJar

[CPU] ML Portion for GPU-BDB Queries

2

Below queries rely on cuML models from for ML GPU . Depending on the performance we need to decide b/w Distributed (dask-ml) vs non distributed (sklearn) implementation for the ML...

VibhuJawa

Update dockerfiles to build from ucx-py docker images

ucx-py recently added dockerfiles that have cuda enabled containers with all the pre-requisites for building ucx+ib from source. We should update our images to use that as a central source...

ayushdg

[REVIEW] Enable using CPU backend with first set of dask queries

3

This PR enables using the CPU backend option with DataFrame queries: 11, 12, 15, 16, 17 and 22 I also verified that the DataFrame versions of all of the other...

ChrisJar

gpu-bdb
gpu-bdb copied to clipboard

Metadata

[Do Not Merge] Enable Query 27

Problem with load test

Refactor to use public API `Series.str.find_multiple`in query 18

Add cpu backend to q28

[CPU] ML Portion for GPU-BDB Queries

Update dockerfiles to build from ucx-py docker images

[REVIEW] Enable using CPU backend with first set of dask queries

use unique name for interim result files

CPU backend for Queries 25, 26 and 30

Add Ci testing to pr's

← Metadata

Owner

Metadata

gpu-bdb gpu-bdb copied to clipboard

Metadata

← Metadata

Owner

Metadata

gpu-bdb
gpu-bdb copied to clipboard