Results 3 issues of tic-top

MT-Bench | AGIEval | BBH MC | TruthfulQA | MMLU | HumanEval | BBH CoT | GSM8K

enhancement
regression

I want to convert this small 1.1B llama2 architecture model [PY007/TinyLlama-1.1B-intermediate-step-240k-503b](https://huggingface.co/PY007/TinyLlama-1.1B-intermediate-step-240k-503b) to llama2.c version. (Layers: 22, Heads: 32, Query Groups: 4, Embedding Size: 2048, Intermediate Size (Swiglu): 5632) Then I...

# What does this PR do? #30877 Implementation of Kosmos-2.5 in transformers. https://huggingface.co/kirp/kosmos2_5/blob/main/README.md # Usage ```python from PIL import Image import requests import torch from transformers import AutoProcessor, AutoModelForVision2Seq, AutoConfig...

run-slow