vody-am comments

Results 7 comments of


                                            vody-am

#1514 Update docker docs for VL api

@lvhan028 Google translate has been working well, so yes I will give it a shot :joy:

#1514 Update docker docs for VL api

@lvhan028 while I am here -- is there anything special one has to set to use multiple GPUs? Thus far I have experimented with one container per GPU, but it...

[Bug] Llava 1.6 34b Cuda OOM when running API server

@irexyc limiting batch size to `3` appears to alleviate this issue, your observation is correct. It would be a good idea to make this a configurable parameter.

Loading `tokenizer.model` with Rust API

oh I also have an interest in reading sentence piece tokenizers as well, in order to invoke the SigLIP text transformer in Rust! EDIT: using the library mentioned by Eric...

Batch inference

@erikreed I got it working with the base (not chat) model. The evaluation scripts serve as good examples. E.g.: ```py from transformers import AutoModelForCausalLM, AutoTokenizer import time import torch MODEL_ID...

[Feature] A series of various optimization points

Perhaps relevant under a separate issue, but I would like to chime in that I could help with measurement and performance work if some issues are created and/or discussion is...

Build failed on M4 macbook

confirming that `go build` with the above patch, produces a working version on M4