map_benchmark icon indicating copy to clipboard operation
map_benchmark copied to clipboard

Exclude the time of inserts from the time of finds

Open renzibei opened this issue 3 years ago • 1 comments

The timing part of the current find has some issues to discuss. The time measured now includes the time used for inserts. The hash table is slowly expanded to its final size during the measurement. This brings two problems:

  1. The time we measured is not the time for find function. It's the time for find and insert. And when the number of elements in the hash table is larger, the ratio of the number of insert operations to the number of find operations is larger. So this problem is especially noticeable when the number of elements is large (e.g. RandomFind_500000).
  2. The time of RandomFind_N is not the time measured on a hash table of size N. It's the average time for all hash tables of size 1 to size N. This makes the results less informative for users who want to know the performance of a hash table of size N.

In summary, I think the method of measuring time in the find section should be improved.

renzibei avatar Jul 07 '22 05:07 renzibei

I think it's a good ideal to improve.

ktprime avatar Jul 07 '22 09:07 ktprime