JetMoE icon indicating copy to clipboard operation
JetMoE copied to clipboard

Pretraining dataset and code request

Open hitalex opened this issue 1 year ago • 5 comments

Will the pretraining datasets and corresponding code be open-sourced?

Thanks!

hitalex avatar Apr 10 '24 03:04 hitalex

Hi, thanks for the great work. I'd also be interested particularly in training code or at least if you can share some multi-node settings details. What tech did you use for parallelization across GPU nodes?

tranhd95 avatar Apr 10 '24 20:04 tranhd95

Same. Looking forward to the open-source training code and details.

liqiangniu avatar Apr 12 '24 02:04 liqiangniu

Thanks for the amazing work and the paper. Really would love to explore your training code.

shamanez avatar Apr 14 '24 20:04 shamanez

+1

geronimi73 avatar Apr 15 '24 10:04 geronimi73

https://huggingface.co/jetmoe/jetmoe-8b/discussions/5

Zengyi-Qin avatar Apr 23 '24 02:04 Zengyi-Qin