flow
flow copied to clipboard
Flow PHP - data processing framework
I've been looking in the documentation for several days, but I can't find the way, nor examples. To extract data from a parquet hosted in the aws S3... I don't...
Because of the internal index in [PSRSimpleCache](https://github.com/flow-php/flow/blob/1.x/src/core/etl/src/Flow/ETL/Cache/PSRSimpleCache.php#L54) every single time we want to add something into the cache, we need to perform following operations: - check if an index exists...
https://parquet.apache.org/docs/file-format/data-pages/encodings/#a-namedeltaencadelta-encoding-delta_binary_packed--5
Parquet comes with very handy mechanism called "Column Statistics" which says for example what are the min/max values, total number of null values etc. By reading those statistics we won't...