Fast-LLM icon indicating copy to clipboard operation
Fast-LLM copied to clipboard

[hybrid_dev] Hybrid dev branch

Open RaymondLi0 opened this issue 6 months ago • 0 comments

✨ Description

For tracking: Hybrid-SSM dev branch

Outstanding issues

  • Missing preprocessing when flash-attn is disabled for vision-encoder. -> KeyError: 'image_encoder_attention_mask' (is an issue to run the tests)
  • when vision_encoder.image_break_token is not set, shape-mismatch error in the Multimodal-embedding layer.
  • debug-layer-outputs/gradients hangs.

RaymondLi0 avatar Aug 07 '25 17:08 RaymondLi0