Alex2025Job

Results 3 issues of Alex2025Job

Maybe there is an issue by loading the pre-trained model (line 61 ckpt_utils.py) - size mismatch for token_embed.weight: copying a param with shape torch.Size([88, 256]) from checkpoint, the shape in...

Hi, just have a question regarding the training speed using different GPUs. We have tested the training speed with A100 and H100 (single GPU for test) using the same training...

Hi, just a short question regarding the API for streaming mode. do you also provide easier APIs for streaming mode? Actually only two API is needed for me, 1. send...

request