Add optional energy efficiency reporting schema for inference benchmarks#2587
Add optional energy efficiency reporting schema for inference benchmarks#2587hongping-zh wants to merge 1 commit into
Conversation
|
MLCommons CLA bot: |
|
recheck |
1 similar comment
|
recheck |
|
Quick update: I have contacted support@mlcommons.org to resolve CLA mapping for GitHub account "hongping-zh". Waiting for support-side refresh, then I will run |
|
recheck |
|
Has there been a discussion in the inference WG regarding if solely measuring the accelerator power consumption is a useful (and non-misleading) reporting metric? High-performance inference requires coordination, processing and data movement tasks to be performed on the CPUs, and system DRAM and network usage also consumes significant power. I can see how this data would be valuable to augment entire-system power measurements, but I have concerns about it being presented on its own. Also, it is important to ensure that measurements are taken of cumulative power draw, rather than instantaneous power draw, since the latter can easily result in misleading results. Careful rules (and verified implementations) are needed to prevent power measurements from easily being gamed. |
|
Thank you, David — this is an important concern, and I agree. My intent is not for accelerator-only measurements to replace whole-system power or energy measurements. For high-performance inference, CPU coordination, host DRAM, networking, storage, and data movement can all be significant, and accelerator-only numbers would be misleading if presented as total system energy efficiency. A better framing for this PR is therefore as an optional accelerator-level energy breakdown / supplementary reporting schema. The intended use is to augment whole-system measurements where available, and to provide attribution/debugging information about the accelerator-side behavior of a run, rather than to define a standalone system-level efficiency metric. I also agree on cumulative energy. The schema should define fields such as I can update the README/schema wording to make this explicit, for example:
Would this framing address your concern, or would you prefer that the fields be renamed more explicitly as accelerator-level fields to avoid ambiguity? |
Summary
This PR proposes an optional energy-efficiency reporting schema for MLPerf Inference results.
It adds a standalone schema package under
energy-reporting/and does not modify existing benchmark logic, submission flow, or current compliance requirements.Motivation
During multi-round technical discussion in Issue #2558, several design directions converged:
energy_per_token_joulesfor LLM workloadsenergy_per_query_joulesfor CV workloadsnvml,dcgm,rocm_smi,rapl,external_analyzer)This PR translates those discussion outcomes into a concrete, reviewable schema artifact.
Scope of this PR (intentionally minimal)
This PR includes only:
energy-reporting/mlperf_energy_schema_v6.json(JSON Schema, draft 2020-12)
energy-reporting/README.md(field definitions, examples, validation-rule summary)
This PR does not include:
Compatibility / Impact
Validation
Schema and examples were validated locally:
(Validation logs can be provided if reviewers request them.)
Request for Comments (RFC)
This PR is submitted as an RFC to collect Working Group feedback on field design and integration direction before any broader implementation steps.
Feedback is especially welcome on:
cc @JiwaniZakir @arav-agarwal2
References