InternLM-XComposer
InternLM-XComposer copied to clipboard
Is the image token always necessary or not for InternLM-XComposer2.5?
Hi, simple question about your fantastic model! I see that your multi-image demo uses image tokens while your single high-resolution image demo does not. Is the usage of image tokens only necessary for multiple images? And does not using the image token somehow implicitly indicate high-resolution image inputs or no?