cuvs icon indicating copy to clipboard operation
cuvs copied to clipboard

Reduce device memory usage for CAGRA's graph optimization process (reverse graph creation)

Open anaruse opened this issue 10 months ago • 5 comments

Since CAGRA improves search accuracy by merging forward and reverse graphs, reverse graph is created as part of the graph optimization process. Currently, GPU is used to create the reverse graph at high speed, but when creating a graph for huge dataset, the amount of device memory may be insufficient to create the graph.

In this PR, as a countermeasure for the lack of device memory, we will add an implementation in which reverse graphs are created on the CPU.

anaruse avatar Apr 21 '25 12:04 anaruse

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

copy-pr-bot[bot] avatar Apr 21 '25 12:04 copy-pr-bot[bot]

/ok to test bfc45cd

cjnolet avatar Apr 22 '25 23:04 cjnolet

/ok to test b0f648b

cjnolet avatar Apr 24 '25 16:04 cjnolet

Auto-sync is disabled for draft pull requests in this repository. Workflows must be run manually.

Contributors can view more details about this message here.

copy-pr-bot[bot] avatar May 27 '25 17:05 copy-pr-bot[bot]

Thanks for the review Tamas, I think I've addressed everything you pointed out, could you double check?

anaruse avatar May 29 '25 10:05 anaruse