Johannes Vass comments

Results 29 comments of


                                            Johannes Vass

Support for Constrained decoding

I just read about SGLang's approach for constrained decoding. Did you consider adding that to VLLM instead of Outlines? See for example this blog article: https://lmsys.org/blog/2024-02-05-compressed-fsm/

memory leak in v0.2.7

I also have problems with a memory leak with vllm 0.2.7. For me it's not limited to Ray but also concerns the API server itself, no matter whether I use...

memory leak in v0.2.7

For now my workaround is to set a memory limit and restart vllm automatically after OOM.

Automatically remove documents that are not present in the source any more

> I have a similar question that might be related to it. I see that it's not possible (at least via GUI) to remove files at document set/connector level (hence,...

How to use Huggingface Inference?

Which exact settings of the `GEN_AI_` variables did you try? For me the following works with a self-hosted Huggingface TGI: ``` GEN_AI_MODEL_VERSION="" GEN_AI_MODEL_PROVIDER="huggingface" HUGGINGFACE_API_BASE="https://xyz" GEN_AI_API_ENDPOINT="https://xyz" ``` Disclaimer: I am unsure...

I18n - Localize

@mad-mikey can you already foresee when you will be able to contribute this?

SharePoint connector - Deleted documents kept on index

See #938

Internationalization (i18n) support for the web interface

There is already another issue regarding this: #984

Environment setup

``` In [1]: import DeepInstruments Using TensorFlow backend. --------------------------------------------------------------------------- ImportError Traceback (most recent call last) in () ----> 1 import DeepInstruments /Users/johannesvass/ownCloud/Studium/2017S_Bachelorarbeit/ismir2016/DeepInstruments/__init__.py in () 51 import DeepInstruments.audio 52 import DeepInstruments.descriptors...

multilingual-e5-large exported by recent sentence-transformers version cannot be loaded

### Problem Analysis The issue seems to be a breaking change in the `tokenizers` library (probably https://github.com/huggingface/tokenizers/pull/1476) which prevents an XLM-Roberta tokenizer saved with a version >= `0.19.0` to be...