Fabian Steeg

Results 35 issues of Fabian Steeg

This is a WIP-PR to track the changes in metafacture-core used in oersi-etl (https://gitlab.com/oersi/oersi-etl) via matafacture-fix (sample workflows: https://gitlab.com/oersi/oersi-etl/-/blob/develop/data/). For merging these, it might make sense to cherry-pick into new,...

This is part of https://github.com/metafacture/metafacture.github.io/issues/10 Improve [content](https://github.com/metafacture/metafacture-documentation/blob/master/flux-commands.md) of [HelpPrinter.java](https://github.com/metafacture/metafacture-core/blob/master/metafacture-flux/src/main/java/org/metafacture/flux/HelpPrinter.java): - [ ] Group commands by type (Readers, Decoders, Encoders, Writers) - [ ] Explain signatures (a general explanation about how...

Flux
Documentation

This is part of https://github.com/metafacture/metafacture.github.io/issues/10 Add filtering support in flux-commands.md, e.g. generate filter 'buttons' for each type as input and output, e.g. `->StreamReceiver` for all decoders, `StreamReceiver->` for all encoders,...

Documentation

This is part of https://github.com/metafacture/metafacture.github.io/issues/10 Automated creation and publishing of [flux-commands.md](https://github.com/metafacture/metafacture-documentation/blob/master/flux-commands.md) (which is basically the output of `sh flux.sh > flux-commands.md`, see https://github.com/metafacture/metafacture-core/commit/4d80ab4a87eed552fb36de722765c2f1ae6e7923; Java implementation is in [HelpPrinter.java](https://github.com/metafacture/metafacture-core/blob/master/metafacture-flux/src/main/java/org/metafacture/flux/HelpPrinter.java)): - [...

Documentation
Dev Infrastructure

In metafacture-biblio, we depend on `org.dspace:oclc-harvester2:0.1.12` (see [details](https://github.com/metafacture/metafacture-core/blob/f8f7285cbe65cab8f4ffd52d183afd1f5bed0d28/metafacture-biblio/build.gradle#L24)). It's the only version of the OCLC harvester published to Central (see https://mvnrepository.com/artifact/org.dspace/oclc-harvester2). There is a GitHub repo at https://github.com/OCLC-Research/oaiharvester2 which contains...

Originally developed in OERSI. Will resolve #443.

Reads sitemap from URL, sends each `loc` URL to the receiver. e.g. `"https://hoou.de/sitemap.xml" | read-sitemap | open-http ...` in a Flux workflow to process every document linked in the sitemap....

In the [lobid-gnd reconciliation service](https://lobid.org/gnd/reconcile) we have an `extends.property_settings` item to `select` if a property should be added as an ID or a label. This only makes sense for non-reconciled...

API design

Use case is a workflow where we use `open-http` to POST records to an API. When working with large input data, we'd like to wait a short amount of time...