Sachin Gururangan

Results 14 comments of Sachin Gururangan

Another area i'm interested in is type-level markers encoding 'squareness' and 'singularity'. Could be as simple as equality constraints on the indices of Dimension. Singularity may be out of scope...

Hi there! Sorry for the delay. As @insanitybit mentioned, this crate was mostly just a proof of concept, haven't really had time to maintain it. Pull requests welcome for migrating...

We are working on releasing intermediate checkpoints and will let you know

just a few datapoints from OpenLM, with default hparams: we get ~2.5K tokens/sec/GPU on 256 A100s for OpenLM-7B, and ~9.5K tokens/sec/GPU on 128 A100s for OpenLM-1B. ~11.5K tokens/sec/GPU on 32...