Chenning Li
Chenning Li
The error is caused by the function: random.choice(). Since the assigned weights for sampling do not sum to 1 (actually, it's all zeros). An intuitive method is to avoid the...
Hi, Have you implemented the c version of NanoGPT? It would be really helpful if you can share more details. Thanks.
Additionally, it appears that the functionality to parse the text files Transformer_HybridParallel.txt and Transformer_HybridParallel_Fwd_In_Bckwd.txt is missing.
Thank you for your response. I appreciate the instructions on the Wiki and find them clear. I'm just interested in whether I could obtain the measured traces from your end,...
Understood. I'm currently in need of some traces for transformers and LLAMA involving tens of nodes. Once again, I really appreciate your outstanding work!