Maxime Alvarez

Results 12 comments of Maxime Alvarez

Looks like the gradient_descent.py script expects a tabular CSV. You have to remove the header (the first line of Advertising.csv), remove the ids (the first column of Advertising.csv), and then...

To install all the dependencies: `pip install matplotlib numpy scikit-learn tensorflow keras umap-learn` Please note that `tensorflow` is not mandatory but you need a backend for Keras. Tensorflow is the...

> > apparently it needs 8 GPU (80GB vram) to run > > Say less. That's 3x 4090s. No, it's not 80GB VRAM total, it's 8 GPU with 80 GB...

> > How about running it on TPU (v4 or v5)? > > How to get TPU for most people? Google Colab Pro+

> > > > apparently it needs 8 GPU (80GB vram) to run > > > > > > > > > Say less. That's 3x 4090s. > > >...

> I was a bit surprised to see jax being used with Grok-1. If you're going to rebuild everything from scratch, JAX is an excellent choice. JAX was made to...

> Is there a scientific paper accompanying this release? I've searched but couldn't find one. I find it odd that the weights would be released but not the research. Because...

That's a technical report at best though.