ml-ane-transformers
ml-ane-transformers copied to clipboard
Reference implementation of the Transformer architecture optimized for Apple Neural Engine (ANE)
Results
3
ml-ane-transformers issues
Sort by
recently updated
recently updated
newest added
re https://github.com/apple/ml-ane-transformers/pull/3#issuecomment-1519099643 I'd already monkeypatched this in my own project in order to use a QA model, so here's a PR happy to make any tweaks required
Hi, I'm attempting to duplicate the pytorch LayerNorm functionality, and the formula that pytorch uses is clearly `(out * weight) + bias`, which does not match the code in `LayerNormANE`....