BitNet-Transformers icon indicating copy to clipboard operation
BitNet-Transformers copied to clipboard

How long does inference on CPU cost?

Open ghost opened this issue 1 year ago • 0 comments

Training may be on CPU, but deployment has to be on CPU for high scalability.

ghost avatar Apr 05 '24 05:04 ghost