BLASX
BLASX copied to clipboard
How GPU task are implemented by CPU thread?
This might be a naive question.... It is mentioned in paper that GPU task can be bonded to a CPU thread...? I am wondering is any references discuss more details about this or what keyword I should use to search on Google...It should be about multi GPU allocation?
Thank you!!!
And why we still need CPU here? I guess we are just combining several GPUs to compute?