Supports for s4 and s8 GEMM on Python?
Hi,
There are supports for using cutlass on Python https://github.com/NVIDIA/cutlass/blob/main/python/README.md, so I am wondering if there is a plan to support s4 and s8 GEMM on Python. If not, is there any resource that I can use if I want to implement them by myself?
Thanks!
### Tasks
The CUTLASS Python interface does support s8 GEMMs. Unit tests that show examples of using these are here and here.
The CUTLASS Python interface does not currently support s4. You can see this issue for some details related to this: https://github.com/NVIDIA/cutlass/issues/756
Thank you so much for the prompt response! This is helpful.
This issue has been labeled inactive-30d due to no recent activity in the past 30 days. Please close this issue if no further response or action is needed. Otherwise, please respond with a comment indicating any updates or changes to the original issue and/or confirm this issue still needs to be addressed. This issue will be labeled inactive-90d if there is no activity in the next 60 days.