akamaster
akamaster
**Chapter number or note title:** 10. Maximum Flows & MInimum Cuts **Page number:** 327 **Error description:** footnote is not complete, current text is the following: > I learned this story...
Would you please share speed up ratios and final ranks on ResNet20 (CIFAR) obtained in Coordinating Filters for Faster Deep Neural Networks paper?
**Describe the bug** DeepSpeed(DS) optimized bloom model inference produces incorrect logits hurting overall model accuracy. Numerical differences on final logits for the phrase "This is test" when compared to pure...
**Describe the bug** DeepSpeed optimized OPT inference produces garbage if using HFOPTLayerPolicy for kernel injection. Below are the results of l2 and l1 difference when HFOPTLayerPolicy is used vs. when...
**Describe the bug** DeepSpeed(DS) optimized GPT-NEOX-20B model produces incorrect logits and loss, which hurts overall model accuracy. Numerical differences on final logits for a test phrase (see code) is as...