Dilip Sequeira comments

Results 38 comments of


                                            Dilip Sequeira

add a column of self-reported precision/numerics to MLPerf results table

The format looks reasonable (I would slightly prefer colors to postfix characters but I expect there are UI accessibility concerns.) A larger problem is having rules for transforming from what...

Does end_on_device make sense?

I agree inference is rarely the last pipeline step. However, if your accelerator is a general purpose programmable device, it's realistic for it to run post-processing too - for example,...

Does end_on_device make sense?

The timeline for getting that into 1.1 seems quite short, given there's no proposal yet.

Does end_on_device make sense?

And regarding 3DUNet not being in server... that's correct, but latency is still relevant for 3DUNet in Edge Single Stream.

Does end_on_device make sense?

It's significant only for benchmarks where the output size is large. Today, that's only segmentation.

Does end_on_device make sense?

I'm sure we can, but what are we looking for, and how would we act on it? MLPerf has, historically, set some fairly arbitrary bounds on the timed portion of...

Does end_on_device make sense?

That would be my preference regardless of this question. If we do that, does that mean we should assume the answer is (1) above?

Does end_on_device make sense?

The hyperparameter question is somewhat off-topic here: I've opened a new issue https://github.com/mlcommons/inference_policies/issues/216

Memory Mapped Buffers

I agree. This class of system programming optimizations apply to generic (broader than ML) accelerators, and it would be counterproductive to exclude them.

Timed preprocessing

Agreed on all counts.