Add enforce_max_duration setting
This PR introduces a new enforce_max_duration setting to the LoadGen test configuration. This allows users to control whether exceeding max_duration should terminate query issuance early and how minimum query count validation is applied.
Key Changes
• Exposes enforce_max_duration in Python bindings and test settings (default: true).
• IssueQueryController only stops early and logs when enforcement is enabled.
• Results logic updates: min_query_count is skipped when enforcement is disabled.
• Effective settings logging updated to include the new flag.
• Submission checker ensures official submissions must enable enforcement.
The changes are taken from the branch https://github.com/mlcommons/inference/commits/mobile_update/, which is now outdated and therefore not possible to merge into master without resolving a conflict.
Motivation
We've maintained this change in a separate branch called mobile_update until now. This makes it difficult to update the loadgen version, so we want to merge this change into the master branch.
Related issues:
https://github.com/mlcommons/mobile_app_open/pull/798 https://github.com/mlcommons/inference/pull/1621
MLCommons CLA bot All contributors have signed the MLCommons CLA ✍️ ✅
@pgmpablo157321 @freedomtan Please review this PR.
LGTM