Katharine Xiao

Results 20 issues of Katharine Xiao

### Problem Description Currently, SDV only supports individual column keys as primary keys. We should extend this to support composite keys (multiple columns that uniquely identify each row when put...

feature request
data:single-table
data:multi-table

### Problem Description CTGAN fails to finish in reasonable times on datasets like Airbnb and Housing Market. ### Expected behavior We should review CTGAN speed on big datasets by figuring...

feature request
data:single-table
feature:performance

### Problem Description When modeling a categorical column with two categories, sometimes the less frequent value is not sampled. ### Expected behavior Modeling the binary categorical column as a boolean...

feature request
feature:rdt

The field distribution mapping in GaussianCopula contains internally transformed columns (e.g. `col.value` and `col.is_null`). Should the user-facing API contain those columns? Is there a way to make it more user...

feature request
data:single-table

Remove support for py36

### Problem Description Add support for conditional sampling in HMA1 ### Expected behavior Users should be able to call `sample` with conditions in HMA1.

feature request
data:multi-table

### Problem Description Currently, we are fixing the random_state using a [decorator](https://github.com/sdv-dev/Copulas/blob/master/copulas/__init__.py#L39) that wraps around the desired functions (sample). This wrapper sets the global random state to the model's desired...

internal

### Problem Description Current, the summary metrics from the `sdgym summary` command are `% coverage, avg time, avg score`. We can update them to be more informative about the synthesizers'...

new feature