search-benchmark-game issues

Use same BM25 k1/b parameters across engines.

1

The k1 and b parameters of BM25 can influence what hits may be dynamically pruned and thus performance numbers, so it would be good to use the same values across...

jpountz

Recommend scoring hits with BM25(k1=0.9,b=0.4).

4

Currently different engines use different parameters for BM25, e.g. Tantivy and Lucene use (k1=1.2,b=0.75) while PISA uses (k1=0.9,b=0.4). Robertson et al. had initially suggested that 1.2/0.75 would make good defaults...

jpountz

PISA should compute top hits for task TOP_10_COUNT

It seems to me that the pisa-0.8.2 engine forces evaluation of all hits with the TOP_10_COUNT task, but it doesn't collect them into a priority queue as I would expect....

jpountz

add range queries

add new field id_num add range queries over id_num ![Screenshot_20221229_120230](https://user-images.githubusercontent.com/1109503/209901905-41804126-71bd-4018-8e76-531265547215.png)

PSeitz