Jun Zhang
Results
3
comments of
Jun Zhang
Same Question
> > > I tried again, this time using the format exactly as shown in the Qwen2.5-VL blog. However, the bbox still shifts upward 😠> > > > >...
> @jzhang38 @jbaron34 During decoding these 16 tokens can freely attend to each other with no mask between them (they just cannot attend to future token maps like the next...