CUDA-Programming icon indicating copy to clipboard operation
CUDA-Programming copied to clipboard

为什么 d_NL[(count++) * N + n1] 是合并访问的?

Open FeiGSSS opened this issue 9 months ago • 1 comments

Image Image

老师您好,

我不明白为什么 d_NL[(count++) * N + n1] 一定能保证合并内存访问? 每个 n1 的 count 增长不是同步的,也就是说每个线程对于 d_NL 的访存地址可能差距好几个 N。那为什么这样能实现合并访问呢? 还是说,只能保证 count = 0 时是合并的?

FeiGSSS avatar Apr 28 '25 09:04 FeiGSSS

谢谢,您发给我的邮件已经收到,我会尽快处理。Thank you,the email you sent me has been received and I will handle it as soon as possible.王景博fever wong

fever-Wong avatar Apr 28 '25 09:04 fever-Wong