Ed Andrews

Results 2 comments of Ed Andrews

here's a reproducible example of the issue using pyarrow ```python import pyarrow as pa import dask.dataframe as dd import pandas as pd # Generate a parquet with a nullable int...

Performance is the main motivation: - most of the data we work with is in parquet format where conversion from columnar to row-major pandas format tends to be a major...