Andrew Palumbo
Andrew Palumbo
Tests pass on my system: ``` Mahout JVM Sparse multiplication time: 1914 ms. Mahout JCuda Sparse multiplication time: 195 ms. - sparse mmul at geometry of 1000 x 1000 %*%...
@nsakharnykh @rawkintrevo I intend to have `dense` hammered out on Sunday.
@nsakharnykh , @rawkintrevo, I ran out of time tonight to finish out `dense %*% dense` and `dense %x% sparse`; went down a rabbit hole woth the NVIDIA `c` api docs...
@nsakharnykh I have my MAHOUT-1974 branch that is almost complete with dense, etc (less the column major issues. We'd discussed just making a PR against this. but It may be...
@nsakharnykh https://github.com/andrewpalumbo/mahout/tree/MAHOUT-1974/cuda ^^ P.S. this is still WIP so there's alot of garbage in it..
Great, thanks. I figured you were there, and very busy, I'll keep working on my end, and there should be no (or few conflicts).. no rush, since my branch is...
@rawkintrevo I asked @nsakharnykh to just go ahead and push this to the mahout/CUDA branch, since he's already up at GTC, and we're pushing this through as quickly as possible,...
need to rebase
@nsakharnykh @pat @rawkintrevo FYI `Sparse Sparse` vlaues are correct, `dense dense` is implemented but untested.
@nsakharnykh sorry for the state of this branch, I tend to commit a lot on this project, and leave a lot of [WIP]s in when jumping around to other branches....