noob-ctrl
noob-ctrl
@tgale96 After I run `dmoe_46m_8gpu.sh` script, The saved model is in the following format, with a `model_optim_rng.pt ` in each folder:  I want to merge this weights into a...
@tgale96 
@ShinoharaHare Hi, have you solved this problem?
@laixinn I deployed the model service on 2 H20s. After deploying according to the command you showed, an error message was displayed when requesting the API. The error info as...
@laixinn I solve this problem,thanks. In addition, I would like to ask if there is any comparative experimental data for Deepseek-R1-FP8 model?