chispa issues

Handle nested nullability

4

When using `ignore_nullable=True` chispa still sees differences in ArrayType because there's a nullable difference in the inner type: `StructField(my_arr_col,ArrayType(StringType,false),false)` `StructField(my_arr_col,ArrayType(StringType,true),true)`

machielg

help wanted

good first issue

Add parameter to enable DataFramesNotEqualError to be raised without printing the differences between the two dataframes

1

When calling the assert_df_equality and assert_approx_df_equality it will be good to have the option to not display the get_string(). Sometimes the output might be to long or truncated.I think this...

slavyolov

good first issue

Update dataframe_comparer.py

1

ignore row and/or column order paramters for `assert_approx_df_equality` function

fraserpal

`ignore_column_order` param for `assert_approx_df_equality` function

3

It would be great if we could avoid column order checking when using `assert_approx_df_equality`

fraserpal

good first issue

SchemasNotEqualError not show more columns in one shcema

1

zhangabner

help wanted

good first issue

feat(dataframe): display original dataframes in compare

1

Hi there! Thank you very much for your great library. In order to debug faster and see what went wrong, I came up with a simple solution of displaying original...

azachar

fixed structfield comparison when dataType is array

1

The [newly created] test below fails. This because `are_structfields_equal` doesn't check for the case when the dataType is an array. If the dataType is array, then the nullability shouldn't matter...

dfarren

Add allow_nan_equality option to assert_approx_df_equality

10

This pull request solves the issue by making a fairly big change to the API. Now, rather than having two assertion functions for both DataFrame and column comparison, there is...

mitches-got-glitches

Add support for NaN equality within Arrays

2

When trying to `assert_df_equality` with `allow_nan_equality=True`, if the both DataFrames hold an array that contains some `nan` values then the comparer fails, even if the `nan`s are in the same...

mitches-got-glitches

good first issue

Add allow_nan_equality option to assert_approx_df_equality

3

I feel like this would be quite useful. Were there any design choices for why it wasn't included or would this be a useful addition? https://github.com/MrPowers/chispa/blob/500793efe14b1975b86fb1a923ee6cd68ba559d8/chispa/dataframe_comparer.py#L38-L40

mitches-got-glitches

chispa
chispa copied to clipboard

Metadata

Handle nested nullability

Add parameter to enable DataFramesNotEqualError to be raised without printing the differences between the two dataframes

Update dataframe_comparer.py

`ignore_column_order` param for `assert_approx_df_equality` function

SchemasNotEqualError not show more columns in one shcema

feat(dataframe): display original dataframes in compare

fixed structfield comparison when dataType is array

Add allow_nan_equality option to assert_approx_df_equality

Add support for NaN equality within Arrays

Add allow_nan_equality option to assert_approx_df_equality

← Metadata

Owner

Metadata

chispa chispa copied to clipboard

Metadata

← Metadata

Owner

Metadata

chispa
chispa copied to clipboard