katmagic
Results
1
comments of
katmagic
rransom says: word-slicing the algorithm using SSE2, or perhaps even bit-slicing it, would be a much bigger win than just not computing half of the output bits.