Mustafa Ali

Results 7 comments of Mustafa Ali

> Maybe lacking some default configuration values. Add the 'default configuration values ...' part (line 185) in cache.cfg to other cfg files. It may work. I had the same problem...

Hi @nellie-wu , following on your comments about matching the sparse throughput described by the public doc of GPU. how can we get the peak throughput from the stats of...

nope, hoping we get some reply from the contributors

Sorry, somehow I missed your comment, yes, I looked at the cutlass implementation and it is similar to yours. I like yours because it teaches beginners like me to learn...

oh cool, I only tried with cutlass profiler and couldn't find it. Could you point out an example for that?

I am also seeing significant performance difference from the paper especially with causal, would be great if a benchmarking script is given that can reproduce the results from the paper.