ucx icon indicating copy to clipboard operation
ucx copied to clipboard

UCS: stats performance enhancements (contig. allocation)

Open alex--m opened this issue 4 years ago • 3 comments

Signed-off-by: Alex Margolin [email protected]

What

Make both the stats nodes and the counters are more or less contiguous in (virtual) memory.

Why ?

An effort to reduce the average latency of accessing stats, and also to put all the counters in a single buffer so it could be RDMA-ed instead of UDP-ed.

How ?

Both filter and regular stats nodes are allocated from a memory pool (the first chunk should cover common UCX initialization), and the counters are allocated in a flexible array (so there's one buffer containing all of them).

alex--m avatar May 09 '21 15:05 alex--m

Can one of the admins verify this patch?

swx-jenkins3 avatar May 09 '21 15:05 swx-jenkins3

ok to test

yosefe avatar May 09 '21 16:05 yosefe

ok to test

Hi @yosefe ,

Can anyone review this PR?

Regards and thanks, Shuki

shuki-zanyovka avatar Sep 09 '21 12:09 shuki-zanyovka