Qingyu Song

Results 3 comments of Qingyu Song

@amiryanj Hi, Did you check the distributions of train-set, validation-set, and test-set? Are the proportions between different kinds of trajectories the same? For instance, turning left, turning right, or going...

`max_tokens` cannot ensure an exact number of predicted tokens. Sometimes, a model predicts less than `max_tokens `.

Similar problem of Qwen2.5-3B-Instruct with Q2_K on Win laptops: https://github.com/ggml-org/llama.cpp/discussions/12378