Jun Zhang

Results 3 comments of Jun Zhang

Same Question

> > > I tried again, this time using the format exactly as shown in the Qwen2.5-VL blog. However, the bbox still shifts upward 😭 > > > > >...

> @jzhang38 @jbaron34 During decoding these 16 tokens can freely attend to each other with no mask between them (they just cannot attend to future token maps like the next...