Add optimize-edge-ai-power-performance.md#414
Conversation
Signed-off-by: Pratik Agrawal <pratik.agarwal@aveva.com>
|
Hi @pratik98, Many thanks for submitting this pattern—the edge AI power-performance problem is exactly the kind of practical, high-impact challenge our community needs to address. The problem statement is really clear, and you've identified power-aware workload scheduling as a solid core solution. I can see the work that's gone into this. I wanted to feedback on a few things that will help get this over the line: Category and Frontmatter The pattern should be recategorised as Operations (runtime scheduling and deployment-time decisions). Also, could you confirm your LinkedIn profile URL for the Scope: Inference vs. Training The title and problem statement cover both inference and training, but the solution gives almost all the real estate to inference scheduling—workload deferral, battery-level policies, DVFS. Training only gets one bullet: "Schedule on-device model updates and federated learning rounds during charging periods." On-device training (federated learning, incremental updates) has quite different constraints and tooling from inference. I'd suggest one of two paths:
Which direction feels right to you? Cost Impact Section This is a required section in our template, but it's currently missing. I'd like you to add a section explaining how this pattern affects operational spend. Consider angles like:
Be honest about trade-offs rather than painting an all-positive picture. This helps implementers understand the full business case. Monitoring and Feedback—More Concrete Guidance The Monitoring subsection is quite sparse. "Track power-performance metrics," "implement feedback loops," and "user-configurable policies" read more like a checklist than actionable guidance. Could you flesh this out with examples? For instance:
Considerations—Hardware Assumptions Your assumptions note that "edge devices have mechanisms to monitor power consumption, battery level, and thermal state." That's true for modern smartphones and many IoT platforms, but some older or specialist hardware lacks these sensors. Worth adding a consideration: what do implementers do if power monitoring isn't available? Is there a fallback? References Solid selection overall. If there's a real-world implementation or case study (e.g. a mobile OS or IoT framework doing this), that would ground the pattern nicely. Next Steps Could you please:
Don't worry about getting every detail perfect on first pass—this is exactly the kind of feedback we iterate on. I'm happy to help you shape any of these sections further if you'd like to talk through them. Feel free to ask if anything needs clarification. Many thanks again for contributing. This is valuable work. Note: This initial review was generated using a Claude AI skill for pattern evaluation. The recommendations reflect our review standards, but a team member will follow up with any further feedback. |
Title
Optimize power-performance for edge AI inference and training
Version
Designation of iteration on the pattern. This will initially be assigned by the patterns working group
Submitted By
Pratik Agrawal
Published Date
The date this version of the pattern is published. This will be provided by the patterns working group upon approval
Intent
Edge AI workloads for inference and training consume significant power on resource-constrained devices. Optimizing power-performance characteristics through Power-Aware Workload Scheduling reduces energy consumption and extends battery life while maintaining acceptable accuracy and latency.
Tags
Pre-defined list of tags which might apply to the pattern (e.g. Cloud, Web)
Problem
Edge devices running AI inference and training workloads face unique power constraints compared to cloud or data center deployments. These devices—including smartphones, IoT sensors, embedded systems, and industrial edge computers—often operate on battery power or have strict thermal limits. Running AI models continuously at peak performance can rapidly drain batteries, cause thermal throttling, and reduce device lifespan.
Traditional AI deployment strategies optimize for accuracy and latency without considering power consumption as a first-class metric. This approach is unsustainable for edge deployments where:
The challenge is balancing the competing demands of inference latency and power consumption while adapting to dynamic conditions like battery level, thermal state, and workload priority.
Solution
Implement power-aware AI deployment strategies for edge devices that dynamically optimize the trade-off between performance and energy consumption:
1. Power-Aware Workload Scheduling
2. Monitoring and Feedback
SCI Impact
How will this pattern affect an SCI score of an application and why
Optimizing power-performance for edge AI impacts SCI as follows:
E: Reduced energy consumption through power-aware workload scheduling and continuous monitoring directly decreases operational energy. For battery-powered devices, improved energy efficiency extends battery life and reduces charging frequency, further lowering total energy consumption.I: Power-aware scheduling can leverage time-shifting to run intensive workloads during periods of cleaner grid energy. Edge devices that charge during off-peak hours or use renewable energy sources can reduce their carbon intensity.M: Extending battery life through efficient power management reduces battery replacement frequency, decreasing embodied carbon. Lower thermal stress from optimized power consumption extends overall device lifespan, reducing the need for hardware replacements and their associated embodied carbon.Assumptions
Pros & Cons
Discussion section for pros and cons of this pattern