torchbench icon indicating copy to clipboard operation
torchbench copied to clipboard

TorchMetrics for higher reproducibility !

Open tchaton opened this issue 4 years ago • 2 comments

Dear @RJT1990,

Awesome project there !!!

I looked internally and I have seem metrics being manually implemented without any testing.

This makes me pretty scary in term of reproducibility and accurate reporting.

I think you should consider using https://github.com/PytorchLightning/metrics as the tool for benchmarking the runs.

There are extremely well tested metrics which works automatically in distributed settings and plain PyTorch.

Best, T.C

tchaton avatar May 05 '21 08:05 tchaton

I smell @tchaton is volunteering to make it for you guys :rabbit:

Borda avatar May 05 '21 08:05 Borda

Heya,

As discussed yesterday, we are not maintaining sotabench (and associated tools) at this stage, and our focus is elsewhere - particularly on lighter forms of capturing results for the main Papers with Code website.

On testing: this was an experimental product. As such, the emphasis was on extracting user signal rather than committing wholly to a particular implementation. I.e. "manual implementation" was sufficient for our objectives at the time :).

Thanks!

Ross

RJT1990 avatar May 05 '21 10:05 RJT1990