Chang Liu comments

Results 32 comments of


                                            Chang Liu

[Roadmap] Example refactor

Add-ons: - [ ] Some P0 cases multi-gpu refactor https://github.com/dmlc/dgl/issues/4409 - [ ] Some P0 cases link prediction refactor https://github.com/dmlc/dgl/issues/4410 - [ ] Refactor RGAT example https://github.com/dmlc/dgl/issues/4411

[EXAMPLE]Add multi gpu graph predication GIN+virtualnode example

>Therefore, I think putting the example either in examples/pytorch/gin or examples/pytorch/ogb will make it hard for users to find them. I think we probably should create a folder examples/pytorch/multigpu/ and...

Multi-GPU graph classification

Given the size of this problem (GINDataset/IMDBBINARY) and very few number of epochs (=5) in use, you wouldn't be able to observe the benefits of multi-processing and multi-gpu runs. The...

Multi-GPU graph classification

> @BarclayII @chang-l Do we have examples showing perf benefit of using multi-gpus? I believe current multi-gpu examples like graphsage, and rgcn can show decent speed-up by using more GPUs....

[Performance] GSpMM regression with FP16

@yaox12 Thank you for sharing your code. I tested it on my side (A5000, cuda11.7), but I observed different results: | feat dim | fp32 (ms) | fp16 (ms) |...

[Example][Refactor] Refactor RGCN example

> Is "examples/pytorch/rgcn/entity_sample_multi_gpu.py" ready for review? No, not yet. Thank you for your review. I will address tomorrow.

[Example][Refactor] Refactor RGCN example

`entity_sample_multi_gpu.py` ready for review. Similar changes as `entity_sample.py`. Note I simplified host collective communication module (implemented with `mp.queue.put()/get()` + `gc.collect()`), to device collective communication using NCCL (`dist.reduce`). Updated PR description...