Sam Stokes
Sam Stokes
Bottled Water's current JSON encoding (`--output-format=json`) is as per the [Avro JSON encoding spec](https://avro.apache.org/docs/1.8.0/spec.html#json_encoding), which encodes binary data as a JSON string by mapping bytes 1-1 to Unicode codepoints in...
Right now a few things make it tricky to trial Bottled Water on an existing database taking production traffic: - the initial snapshot does a `SELECT *` from all tables,...
As discussed at more length in PR #45, json.c does some work to parse Avro schemas and objects that duplicates work already done in protocol_client.c. This has performance implications (parsing...
Postgres has a [`bytea` type for storing arbitrary binary data](http://www.postgresql.org/docs/9.4/static/datatype-binary.html). Values of this type pass safely through Bottled Water in Avro mode; however in JSON mode (`--output-format=json`) they get truncated...
If you create a table with a few columns, then drop all the columns (leaving a table with no columns), then insert a row, the Bottled Water extension aborts the...
Postgres has a [fixed-precision `money` type](http://www.postgresql.org/docs/9.4/static/datatype-money.html) for storing amounts of currency. Unfortunately when a value of this type passes through Bottled Water, it gets written out multiplied by 100 (presumably...
## Changes Made Adds an OTel Gauge for the selectivity metric that the swordfish filter op calculates, in order to emit it to OTel as well as to the internal...
## Changes Made Allows setting `DAFT_PROGRESS_BAR=persist` in the native runner to keep the progress bar visible after the job finishes running (vs the default behavior where it shows during the...
### Is your feature request related to a problem? Extends #5446 to all supported providers. In #5543 we added an optional `dimensions` parameter to `embed_text` for specifying the number of...