Chancharik Mitra issues

Results 12 issues of


                                            Chancharik Mitra

LLaVA-1.5 Inference Without Images Not Working Properly

### Question Hello! I am curious to run an experiment using LLaVA-1.5 without images (to clarify; it is important for my work to specifically prompt LLaVA-1.5 without images _not_ Vicuna13B)....

Addition: CCoT

Hello, Brady! Thanks for compiling so many great methods into this very helpful resource. Our paper is a multimodal CoT method that has been out for a little while and...

How to SpatialFeaturePlot for Aggregated Samples?

I have successfully visualized individual tissue samples with a standard output structure using `SpatialFeaturePlot`, but I would like to visualize multiple samples in one of Spaceranger's _aggregated outputs._ The output...

Evaluation Script For The Other Benchmarks

Hello! First of all, this is really fascinating work. Thanks for the contribution. I wanted to reach out and ask if you could share the evaluation scripts for mPLUG-OWL2 you...

Evaluation Results

First of all, thanks for the contribution! I was interested in probing the results of your method on datasets like SEEDBench and MMBench. Do you have any scripts, special prompting...

Multi-Image or Multi-Video Inference Example

Hello, and thanks for such a great contribution to the field of interleaved LMMs! This is really great work. I was wondering if there was an example of the format...

What is the proper way to preprocess image inputs for InternVideo2-Chat?

Hi, thanks for your fantastic video foundation model! I was interested in exploring the capabilities of InternVideo2-Chat for both images and video. According to the Huggingface code, the model can...

Is the image token always necessary or not for InternLM-XComposer2.5?

Hi, simple question about your fantastic model! I see that your multi-image demo uses image tokens while your single high-resolution image demo does not. Is the usage of image tokens...

Preview: Adding Preliminary Video Inference Feature for Old And New Models

This PR includes the following additions: 1. Support for video VQAScore inference for all previously-supported T2V models. 2. Addition of new interleaved-image/video LMM models. 3. Updating of the README and...

Why is LLaVA-OV interleaved inference and video inference configuration different?

Hello, thanks for contributing a very exciting model! I noticed that the interleaved and video inference examples given in the notebooks are set up with different configs as the model...