Ferdinand Schlatt
Ferdinand Schlatt
It might be helpful to add a note in the data section of the README that the passage corpus file needs to have ids starting at 0, are sorted and...
The preferred term for every match is also returned (useful for normalizing terms in a text). The RAM footprint is reduced by removing the sets in which the terms are...
When running `python -m quickumls.install` on an MRCONSO.RRF file with about 7M rows, the memory footprint continuously grows and some point the process is killed because of using too much...
I'm using quantulum to extract time units (5 days, 1 month, etc.), but ran into the issue that in my particular case, units are often given in reverse order: e.g....
I wanted to index a collection from in-memory documents. The collection loader already supports passing in-memory documents, the only issue was that the config would store and save the entire...
### Feature request Simplify downloading and streaming datasets locally. Specifically, perhaps add an option to `load_dataset(..., streaming="download_first")` or add better support for streaming symlinked or arrow files. ### Motivation I...
Enables streaming from local symlinks #7083 @lhoestq
When trying to login with SAML SSO with google (this is the only option possible for me), the authentication fails. An external browser window opens and notion is opened there....