Results 28 issues of ChrisJar

**Describe the bug** The dask_cudf merge functions returns too few rows when both the dtype of the column being merged on is mismatched (eg: `int64` on the left and `int32`...

bug
dask
inactive-30d

**Is your feature request related to a problem? Please describe.** I wish I could use cuML to calculate euclidean distance on data with missing values in the same way sci-kit...

feature request
? - Needs Triage
inactive-30d

Closes #608 Blocked by: https://github.com/rapidsai/cudf/issues/11515 Note: currently, performing multiple aggregations at once seems to result in incorrect values. Ex: `SELECT STDDEV(a) AS s1, STDDEV_POP(a) AS s2 FROM df` returns the...

datafusion

#629 Implemented STDDEV_POP on cpu, but it currently fails on gpu due to: https://github.com/rapidsai/cudf/issues/11515#issuecomment-1212305118

enhancement
needs triage

**What happened**: Queries that include multiple distinct aggregations on the same column (ex: `SUM(a)` and `AVG(a)`) return incorrect results. **Minimal Complete Verifiable Example**: ```python import pandas as pd from dask_sql...

bug
needs triage

**What happened**: Joining tables backed by dask_cudf dataframes with multiple partitions causes the error `AttributeError: 'Int64Index' object has no attribute '_get_attributes_dict'` to be thrown **Minimal Complete Verifiable Example**: ```python import...

bug
needs triage

**What happened**: Using the `worker_module` argument when starting a cluster causes an unexpected argument failure. **Minimal Complete Verifiable Example**: This is a piece of code from the Dask cloud provider...

bug

**Describe the bug** Some dask custom aggregations (ex: a custom sum of squares aggregation) fail with dask_cudf. **Steps/Code to reproduce bug** ``` import cudf import dask_cudf import dask.dataframe as dd...

bug
? - Needs Triage
inactive-30d