jesterj icon indicating copy to clipboard operation
jesterj copied to clipboard

Implement JSON splitter, JSON scanner

Open dgoldenberg1234 opened this issue 9 years ago • 1 comments

We see both types of use-cases out there. JSON splitter is a common one: split JSON using a given JsonPath which identifies the start of a "document" within the JSON.

The other use-case is, one has a large JSON file(s) that one wants to treat as a "data source(s)"; so we'll want to have a scanner that reads it and does the same as the JSON splitter processor.

dgoldenberg1234 avatar Apr 05 '16 23:04 dgoldenberg1234

This issue will represent the specific case of building a scanner that parses JSON on the fly and emits documents during parsing (thus handling a json file that doesn't fit in memory, and isn't formatted with one document per line). Any other case can be handled by a Json split processor (once child docs are happy see #177) or by reading the file line by line (see #37).

nsoft avatar Apr 08 '23 16:04 nsoft