ucx
ucx copied to clipboard
UCP/PROTO: Calculate all stages perf during the final proto initialization
What
This is the first stage of MULTI mode performance calculation rework. Includes CTRL messages overheads to RNDV protocols performance. Removes performance calculation from RTS/RTR protocols, they became fully relied on remote_proto_lookup output.
Why ?
Now such protocols as rndv/get/zcopy and rndv/put/zcopy during the performance estimations take into attention only RNDV + ACK stages. But for more precise calculation CTRL messages overhead should participate in proto performance too, since they contains both sender and receiver overhead.