Mantis issues

Nice work! Does the mantis has image seperator when sending to LLM?

6

Hi, wanna ask ,does mantis used image separator between images sending to LLM? From i can tell, llava doesn't have it and the data used in Mantis doesn't provide a...

lucasjinreal

Support for Idefics3

Hi, Thank you for your work on this library. I'd like to know if there's any planned support for Idefics3, this model seems to be better than Idefics2, for visual...

chris-tng

Supplement: When reading the local offline json file, the data is not filtered according to the max_image_size passed in.

BrenchCC

I tried to deploy Mantis in my own server for some test. Do you have any suggestion about the tools which can deploy Mantis to run faster?

2

I am trying to add Mantis to the supported model list in VLLM or Sglang

BrenchCC

Confusion about Eval and Instruct in README

Hello! I'm a big fan of your Mantis paper, I really like it! (and thanks for this repo!) I have a simple question, to clarify the reproducibility. In the README...

emanuelevivoli

Which transformers version should we use?

3

I am running into the below issue when I train VideoScore: Training model... Parameter Offload: Total persistent parameters: 706800 in 348 params 0%| | 0/576 [00:00

kabalao

[rank2]: AttributeError: 'Collator' object has no attribute 'tokenizer'

1

hi, nice project, thanks for sharing it! i have been trying to run the classifier fine-tuning code, but i keep getting this error: .... [rank2]: Original Traceback (most recent call...

lis-kp

Regarding CUDA out of memory error only during validation

Hello, thank you so much for you work! I am trying to finetune the mantis model for multi-image question answering. For the time being I just want to check if...

Aafiya-H

Great work! It's a very impressive and capable multimodal model. I was looking through the model files and noticed an implementation for qwen2_vl_vae. However, I couldn't find any corresponding experimental...

lian700

Mantis
Mantis copied to clipboard

Metadata

Nice work! Does the mantis has image seperator when sending to LLM?

Support for Idefics3

Supplement: When reading the local offline json file, the data is not filtered according to the max_image_size passed in.

I tried to deploy Mantis in my own server for some test. Do you have any suggestion about the tools which can deploy Mantis to run faster?

Confusion about Eval and Instruct in README

Which transformers version should we use?

Update data.py

[rank2]: AttributeError: 'Collator' object has no attribute 'tokenizer'

Regarding CUDA out of memory error only during validation

Does qwen2_vl_vae work?

← Metadata

Owner

Metadata

Mantis Mantis copied to clipboard

Metadata

← Metadata

Owner

Metadata

Mantis
Mantis copied to clipboard