Clay
Results
2
issues of
Clay
It seems to me that training with the sparse model(`model=model.to_sparse()`) costs way more time than dense model. I have tried the code in 2020 and 2021, but the results are...
Firstly, thank you for the amazing work on this project. I am very interested in using ThunderKittens with my AMD GPU, specifically the W7900. Could you please let me know...