Charles Blackmon-Luca
Charles Blackmon-Luca
Rough attempt at implementing cuDF's `DataFrame.quantiles`; shares a lot of common logic with `sort_values`, as the indexer that sorts the dataframe by all columns is ultimately what is used to...
Attempts to unblock issues with `_GroupBy.agg` when grouping on all of a dataframe's columns. Right now this is done by just using `_GroupBy._aca_agg` in cases where we have no non-group...
**What happened**: When attempting to compute `cumsum` on a series that is the result of a `map_partitions` call, I am getting an unexpected `AttributeError`: ```python Traceback (most recent call last):...
Motivated by https://github.com/rapidsai/cudf/pull/10830#discussion_r870657523, this PR attempts to consolidate some repetitive aspects of dask-cudf's groupby code with `_make_groupby_agg_call`, which replaces all `groupby_agg` calls made in groupby.py, which takes as input the...
This PR adds helper function `check_groupby_result` to dask-cudf's groupby tests, and is used in the basic tests to ensure that we are using dask-cudf's `groupby_agg` function to compute the result...
## Report incorrect documentation **Location of incorrect documentation** - https://github.com/rapidsai/cuml/blob/branch-22.04/notebooks/target_encoder_walkthrough.ipynb **Describe the problems or issues found in the documentation** This notebook depends on the [criteo 1 TB benchmark](https://github.com/rambler-digital-solutions/criteo-1tb-benchmark), but there...
Closes #6853 Follows the [steps](https://github.com/conda-incubator/setup-miniconda#caching-environments) outlined in the `setup-miniconda` docs to cache the conda environment between test runs. Note that (as pointed out by @gjoseph92), since we use unpinned CI...
This PR adds a table of basic statistics (max, min, mean) for the worker metrics (CPU, memory, # of file descriptors, etc.) to the Workers tab of the dashboard: ...
This PR changes the implementation of `DaskFunctions` to support overloaded UDF definitions: - the `return_type` attribute has been replaced with `return_types`, a `HashMap`, mapping the potential input types of a...