# The IoT Energy Challenge: A Software Perspective

Kyriakos Georgiou, Samuel Xavier-de-Souza, and Kerstin Eder

Abstract—The Internet of Things (IoT) sparks a whole new world of embedded applications. Most of these applications are based on deeply embedded systems that have to operate on limited or unreliable sources of energy, such as batteries or energy harvesters. Meeting the energy requirements for such applications is a hard challenge, which threatens the future growth of the IoT. Software has the ultimate control over hardware. Therefore, its role is significant in optimizing the energy consumption of a system. Currently, programmers have no feedback on how their software affects the energy consumption of a system. Such feedback can be enabled by energy transparency, a concept that makes a program's energy consumption visible, from hardware to software. This letter discusses the need for energy transparency in software development and emphasizes on how such transparency can be realized to help tackle the IoT energy challenge.

Index Terms—Energy efficiency, Internet of Things (IoT), software metrics.

### I. Introduction

THE INTERNET of Things (IoT) is no longer just a buzzword in the media; it is becoming a reality. The emergence of IoT leads us into a new era of innovation and creativity. This sets high expectations on both the research community and industry for delivering the necessary technological advancements that will allow for the materialization of new IoT applications.

Powering billions of embedded devices deployed into the environment is one of the biggest challenges that IoT faces. Battery-based solutions tend to be impractical and costly due to the need of recharging or replacement. Energy harvesting appears as a viable option for many IoT applications, but it comes with two caveats. First, it is often an unreliable source of energy. Second, there is still a large gap between the energy it can deliver and the required energy budget for many IoT applications. For mission critical IoT applications, such as health-care, completing a task before running out of energy budget is vital.

Traditionally, hardware innovation has been the safe heaven to achieve sufficiently large savings of energy in information and communication technology. Similarly, hardware innovation is currently the prominent response to tackle the energy

Manuscript received June 9, 2017; accepted July 13, 2017. Date of publication August 16, 2017; date of current version September 7, 2018. This work was supported in part by the ARTEMIS Joint Undertaking (Project: EMC2) under Grant 621429, and in part by the Royal Society Newton Advanced Fellowship under Grant NA160108. This manuscript was recommended for publication by T. Mitra. (Corresponding author: Kyriakos Georgiou.)

K. Georgiou and K. Eder are with the University of Bristol, Bristol BS8 1UB, U.K. (e-mail: kyriakos.georgiou@bristol.ac.uk; kerstin.eder@bristol.ac.uk).

S. Xavier-de-Souza is with the Universidade Federal do Rio Grande do Norte, Natal 59078-900, Brazil (e-mail: samuel@dca.ufrn.br).

Color versions of one or more of the figures in this paper are available online at http://ieeexplore.ieee.org.

Digital Object Identifier 10.1109/LES.2017.2741419

challenge IoT faces. New ultralow-energy embedded devices were introduced, and existing technologies were customized to create new, more energy efficient versions, such as the Bluetooth Low Energy. These are well suited for energy-critical applications. But, is that all we can do to tackle this challenge?

It is estimated that up to 80% of the total energy consumption of an embedded system depends on how software drives hardware [1]. Inefficient software can drive energy-efficient hardware to waste the system's energy budget. Furber [2], principal designer of the ARM microprocessor, gave an interview in 2010 and stated:

"Programmers will not be able to afford to be ignorant about the energy cost of the programs they write ... You need tools that give you feedback and tell you how good your decisions are. Currently the tools don't give you that kind of feedback."

These tools are now needed more than ever to overcome the IoT energy challenge. Programmers have very limited information on how much energy their programs consume, and which parts use the most energy. This has two important implications for the development of IoT applications.

- Much guesswork is needed, and thus poor energy-related choices are only identified at a late stage when the system malfunctions due to a failure to meet the system's energy requirements.
- The high level of expertise needed and the lack of energy-aware development tools significantly reduce the number of embedded developers who are able to deliver energy-critical systems.

Because of this, the whole process of developing energy-constrained applications becomes difficult and costly. There is a need for tools that expose the software's effect on the energy consumption of a system. Such energy transparency will allow programmers, toolchains, and runtime systems to make energy-aware decisions in order to meet the strict energy constraints of the IoT.

## II. ENABLING ENERGY TRANSPARENCY IS DIFFICULT

Various layers have been introduced in the system stack, abstracting away complex details to make programming easier. This prevents programmers from understanding the impact of their coding choices on the way hardware is utilized at runtime. The task of enabling the required energy transparency can be seen as equivalent to the task of reverse engineering the code transformations that take place between and at each software abstraction layer. This will allow the propagation of resource usage information from hardware to software. Such a reverse engineering process is a hard challenge and highly dependent on the architecture and compiler choice. Instead, novel energy transparency techniques are needed which can

1943-0663 © 2017 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications\_standards/publications/rights/index.html for more information.

approximate the energy consumption of a program at different software abstraction levels, without the need for reverse engineering. For these techniques to enable energy-aware software development, the following requirements must be met.

- 1) Both Actual Energy Consumption and Bounds Must Be Provided: Energy consumption bounds will guide developers in meeting strict energy budget requirements. Actual energy consumption estimates are necessary to serve as a benchmark to identify potential energy optimizations.
- 2) Energy Transparency at Multiple Levels of Software Abstraction: Software developers and toolchains can significantly influence the energy consumption of a program mainly at three software abstraction levels; the source code, the compiler's intermediate representation (IR), and the instruction set architecture (ISA).
- 3) Fine-Grained Energy Characterization of Software: Identifying energy hot-spots requires the ability to attribute the energy consumption estimates to the various basic software components, such as control flow graph basic blocks.
- 4) Target and Programming Language Independence: Target and programming language agnostic techniques must be provided in a common framework to enable practical and cost effective energy-aware development for a large number of embedded architectures.
- 5) Explore Multithreaded and Multicore Architectures:
  Novel multicore, multithreaded embedded architectures emerged over the last decade, driven by the increasing demand for more computing power. As this trend is expected to grow in the future, it is important to consider such architectures and to fully exploit the potential savings of parallelization when the number of cores increases and the cores' voltage and frequency scale.
- 6) Enable Design Space Exploration: Developers and toolchains need to apply multiobjective optimizations to find the optimum balance between the available resources, such as execution time, energy, code size, and number of cores and threads used. To enable this, energy transparency techniques have to provide sufficiently accurate feedback on the effect that each different configuration has on the resources of interest.
- 7) Fast and Easy to Deploy: Energy consumption estimation speed is critical to the iterative software optimization process. Furthermore, energy transparency techniques should be easy to deploy and use.

#### III. EXISTING APPROACHES AND LIMITATIONS

This section examines the state of the art of energy transparency techniques and their limitations in regards of the above requirements.

## A. Measuring Energy Consumption

While physical measurements are potentially most accurate to determine the energy consumption of a program, they fail to meet many of the requirements set in Section II. First, measurements typically require sophisticated equipment and hardware knowledge that most software developers lack. Second, most hardware components have no provisions for



Fig. 1. Software abstraction level and energy estimation accuracy tradeoff.

energy measurements; thus custom modifications are needed to probe their power supply. These make physical measurements difficult to deploy and use. Furthermore, extracting energy consumption bounds with end-to-end measurements is inappropriate in most cases, as the whole input space would need to be exhaustively searched. Finally, fine-grained energy characterization of software can be challenging. Usually, this requires expensive measuring equipment, such as oscilloscopes, that can support a high sampling frequency.

## B. Estimating Energy Consumption

Estimating the energy consumption of a program for a particular hardware platform requires two main elements: 1) an analysis technique and 2) a way to convey energy information to the analysis. The latter is typically achieved via energy modeling. An energy model statically captures the dynamic behavior of a processor in regards to its energy consumption characteristics; for example, it can associate energy costs with atomic units in a program, such as ISA instructions, or to various execution events, such as a cache miss.

To perform energy estimation at a software level of abstraction, an energy model is required at the same level. An energy analysis technique automatically inherits any precision loss existing in the energy model that the technique utilizes. Fig. 1 illustrates the tradeoff between the energy consumption estimation accuracy and the level of software abstraction. Modeling at a lower level is always more accurate as it is closer to the hardware, where the actual power dissipation occurs. On the other hand, when moving to higher levels of abstraction, the amount of program information, such as types and loop structures, increases. Such information can be crucial for static code analysis and optimization. However, when moving from source code to the ISA level, much of this information is lost due to the various transformation and optimization passes that occur at the respective levels of the software stack.

In the next sections, the state of the art of energy consumption estimation techniques is examined.

- 1) Profiling-Based Energy Consumption Estimation: In this case, estimation is performed by collecting execution statistics and utilizing them with an appropriate energy model. Such a model needs to provide energy information for the various entities that occur in the execution statistics. Three main techniques are used to collect execution statistics for energy consumption estimation.
- a) Simulation: This is typically performed at low hardware design levels, such as the register-transfer level (RTL) [3]. Thus, it is difficult to achieve fine-grained energy consumption attribution to the various software components.

Moreover, modeling and profiling at such low levels is impractical for most commercial embedded processors since essential circuit information, such as the effective capacitance of major architectural blocks, is not available. To account for these issues, simulation-based energy estimation has been performed at the ISA level [4]. Energy models at this level can be constructed for deeply embedded commercial processors by treating the hardware as a black box [4]. ISA energy models are less accurate than lower-level models, but ISA simulation is considerably faster than hardware simulation and allows for fine-grained energy characterization of software.

Energy modeling and profiling at the ISA level are insufficient for more complex architectures or system-level energy consumption estimation. This is because it is difficult to statically capture the behavior of performance-enhancing hardware components, such as caches.

Data-sensitive energy models are needed to account for the input data effect when estimating the actual energy consumption. These account for the inter-instruction effects caused by the operand values used for each instruction. Combining these models with dynamic analysis techniques that can track the data used across the whole execution can yield more accurate estimations. Simulation can provide this information but at the expense of increased estimation time.

Generally, simulation-based energy consumption estimation tends to be slow. This makes it difficult for the technique to be incorporated in software development tools, were instant feedback is required for an iterative process of optimizing energy consumption.

b) Code instrumentation: This is performed by instrumenting the code with instructions that extract execution statistics at runtime. The main challenge is to extract the statistics out of the hardware while minimizing the instrumentation overhead that can significantly impact the estimation's accuracy. Therefore, the amount of execution statistics collected is typically less compared to simulation. This makes the retrieved energy consumption estimations less accurate than simulation-based estimations. A major advantage of this method is that it is significantly faster than simulation-based energy estimation.

Recent work [5] demonstrated a new profiling technique that collects execution statistics at the compiler's IR level. This was combined with a dynamic mapping technique that lifts an ISA energy model to the compiler's IR to retrieve energy estimations at the IR level. The technique guarantees no energy overhead in the estimations due to instrumentation code, and achieved an average accuracy of 2.5%.

c) Performance monitoring counters: Statistical performance monitoring counter (PMC)-based estimation is preferable for more complex architectures were ISA-level modeling and analysis is insufficient to accurately capture their complexity. PMC execution statistics can be used to construct energy models and estimate the energy consumption of multithreaded and multicore architectures [6]. Run-time power estimation can be enabled using PMC [7]. This allows for energy-aware decisions to be made at runtime.

For a given processor, energy modeling and profiling using PMCs can be challenging due to the limited types of the events that can be monitored and the restricted number of counters that can be sampled simultaneously. The same constraints

apply when trying to port the PMC-based modeling and estimation techniques to a new target. Moreover, runtime use of such PMC-based energy estimation methods could cause a significant overhead on the system's performance [6].

- 2) SRA-Based Energy Consumption Estimation: Similar to physical measurements, it is equally impractical to capture energy consumption bounds using profile-based estimation. Static resource analysis (SRA) offers a better alternative. The following two techniques have been used for statically estimating energy consumption bounds.
- a) Automatic complexity analysis: This has been used to create cost relations that capture energy in terms of program input size at both ISA and compiler's IR levels [8]. The technique can be fully automatic and programming-language independent. The main drawback is the difficulty to extract a closed-form solution for the cost relations of a program. Therefore, the approach does not scale well to large programs with complex structure.
- b) Implicit path enumeration technique (IPET): This is the most popular method for worst case execution time (WCET) analysis [9]. In [10], the technique was used to extract energy consumption bounds on a simulated processor. More recently in [5], IPET was applied at the ISA level of a multithreaded, multicore embedded architecture, using an ISA energy model, and at the compiler's IR level, using a mapping technique that lifts ISA energy models to the compiler's IR level. The authors also demonstrated how the technique can be used for design space exploration for two concurrency patterns: 1) task-farms and 2) pipelined programs.

Like WCET estimation, SRA-based energy estimation works best with predictable architectures and software. Using, SRA to analyze multithreaded programs with complex communication patterns is still a hard challenge. Furthermore, currently, there is no practical method to perform average-case static analysis [11].

## IV. OUTSTANDING CHALLENGES

Techniques that are based on energy modeling, codeinstrumentation and SRA do not need difficult to deploy physical energy measurements and expensive simulations. Moreover, they can provide fine-grained energy characterization of software at multiple levels of abstraction with good accuracy. A combination of SRA and code-instrumentation techniques provides both bounds and actual energy consumption. Furthermore, new energy estimation techniques, such as those presented in [5], are compiler and architecture agnostic, provided an ISA energy model exists. Therefore, such techniques can be relatively easily integrated into development toolchains to provide feedback-directed energy optimization. However, there is still a number of outstanding challenges that need to be addressed.

Currently, there is no practical solution to provide tight upper energy consumption bounds. SRA approaches combined with worst case energy models can lead to significant overestimation [10]. Symbolic simulation at the RTL could retrieve tighter bounds [12], but such approaches require sensitive architectural information, typically not available for commercial processors. To retrieve tight energy bounds at the ISA level, data-sensitive energy models and static analysis would be required. These have to identify the worst case data input.

Recent work demonstrated that finding the data that will trigger the worst case energy consumption is an NP-hard problem and that no practical method can approximate (within reasonable time in general) tight energy consumption upper bounds within any level of confidence [13].

Energy models are typically characterized while using a constant power source. This is an ideal condition, as there are no significant variations in the power supplied to the processor. This is not an issue when the energy estimations retrieved are used to optimize the energy consumption at development time. However, IoT applications typically run on a battery or on an energy harvester. To be able to use the energy estimations for making real-time decisions, the power profile of the power source for a specific application has to be taken into consideration by both energy modeling and energy consumption estimation techniques.

Activities external to the processor, such as sensing and communicating, can consume significantly more energy than the computation part of an embedded system. Computation usually controls such activity. Therefore, the energy usage of peripherals and I/O operations can be profiled and included as part of the energy cost of the computation that triggers them. This will allow existing energy estimation techniques to provide system-wide energy estimates. Such an approach is more feasible for systems with predictable behavior.

While existing SRA-based estimation techniques can handle the complexity of deeply embedded architectures, they generally do not scale well to multithreaded/multicore architectures. Concurrent software introduces complexities over traditional sequential variants, which SRA is inherently limited to cope with. The task is even harder than estimating WCET since all concurrent computations contribute to the worst-case energy scenario, while, for WCET analysis, only the computation that causes the worst case across all threads needs to be considered. Therefore, SRA support for multithreaded and multicore software is currently limited to a range of simpler concurrent software patterns. Both hardware and toolchain vendors need to work closely together to provide novel architectures and methods that will allow energy transparency by design.

The inter-thread and inter-core activity must be considered to enable exploiting the interplay between performance and energy on multithreaded, manycore and multicore architectures that support voltage and frequency scaling. Finding optimal configurations based on runtime execution statistics and a power model for the architecture is more practical than using simulation. This is because simulation is typically several orders of magnitude slower than hardware execution. Furthermore, an energy model that accounts for voltage and frequency scaling and parallelism can answer optimization questions such as how many cores and at what frequency they should run for a given performance target [14], [15]. Hardware vendors should provide more support to enable such techniques; for example more PMCs. Finally, the combination of both static and runtime analysis must be explored to realize energy savings beyond those that can be achieved when these approaches are used in isolation.

Perhaps, the biggest challenge of enabling energy-aware software development lies in the tool-vendor's and programmer's perception. Traditionally, optimizing energy consumption has been treated as a side effect of improving execution

time. Toolchains have been long focused on improving execution time, and developers are left assured that a compiler will do the best possible job on optimizing their code for both time and energy. The IoT energy challenge is an opportunity to start treating energy consumption as a first class citizen while developing software.

#### V. Conclusion

Hardware offers energy saving capabilities for the software to exploit. The responsibility lies in the hands of the system engineer to program and configure the selected device in the most energy efficient way for the task at hand. Development toolchains need to be enhanced with energy transparency that provides the necessary feedback to enable energy-aware software development. Hardware vendors need to provide architectures and information, such as energy models, that will support energy transparency. Finally, tool vendors and programmers need to take a more active role in delivering energy efficient systems. We cannot keep ignoring the software role in the energy consumption of a system.

#### REFERENCES

- [1] G. Luo, B. Guo, Y. Shen, H. Liao, and L. Re, "Analysis and optimization of embedded software energy consumption on the source code and algorithm level," in *Proc. EMC*, Dec. 2009, pp. 1–5.
- [2] "A conversation with Steve Furber," Queue, vol. 8, no. 2, pp. 1–8, Feb. 2010. [Online]. Available: http://dl.acm.org/citation.cfm?id=1716385
- [3] W. Ye, N. Vijaykrishnan, M. Kandemir, and M. J. Irwin, "The design and use of simplepower: A cycle-accurate energy estimation tool," in *Proc. DAC*, Los Angeles, CA, USA, 2000, pp. 340–345.
  [4] V. Tiwari, S. Malik, A. Wolfe, and M. T.-C. Lee, "Instruction level
- [4] V. Tiwari, S. Malik, A. Wolfe, and M. T.-C. Lee, "Instruction level power analysis and optimization of software," J. VLSI Signal Process. Syst. Signal Image Video Technol., vol. 13, nos. 2–3, pp. 223–238, 1996.
- [5] K. Georgiou, S. Kerrison, Z. Chamski, and K. Eder, "Energy transparency for deeply embedded programs," ACM Trans. Archit. Code Optim., vol. 14, no. 1, pp. 1–26, Mar. 2017.
- [6] S. Schubert, D. Kostic, W. Zwaenepoel, and K. G. Shin, "Profiling soft-ware for energy consumption," in *Proc. GreenCom*, Besançon, France, Nov. 2012, pp. 515–522.
- [7] G. Contreras and M. Martonosi, "Power prediction for Intel XScale® processors using performance monitoring unit events," in *Proc. ISLPED*, San Diego, CA, USA, 2005, pp. 221–226.
- [8] U. Liqat et al., "Inferring parametric energy consumption functions at different software levels: ISA vs. LLVM IR," in Foundational and Practical Aspects of Resource Analysis. Cham, Switzerland: Springer, 2016, pp. 81–100.
- [9] R. Wilhelm et al., "The worst-case execution-time problem—Overview of methods and survey of tools," ACM Trans. Embedded Comput. Syst., vol. 7, no. 3, pp. 1–53, Apr. 2008.
- [10] R. Jayaseelan, T. Mitra, and X. Li, "Estimating the worst-case energy consumption of embedded software," in *Proc. RTAS*, San Jose, CA, USA, Apr. 2006, pp. 81–90.
- [11] J. M. Townley, "Practical programming for static average-case analysis: The MOQA investigation," Ph.D. dissertation, Dept. Comput. Sci., Univ. College Cork, Cork, Ireland, 2013.
- [12] H. Cherupalli, H. Duwe, W. Ye, R. Kumar, and J. Sartori, "Determining application-specific peak power and energy requirements for ultra-low power processors," SIGARCH Comput. Archit. News, vol. 45, no. 1, pp. 3–16, Apr. 2017.
- [13] J. Morse, S. Kerrison, and K. Eder, "On the limitations of analysing worst-case dynamic energy of processing," CoRR, Tech. Paper abs/1603.02580, 2016.
- [14] C. A. Barros, L. F. Q. Silveira, C. A. Valderrama, and S. Xavier-de-Souza, "Optimal processor dynamic-energy reduction for parallel workloads on heterogeneous multi-core architectures," *Microprocessors Microsyst.*, vol. 39, no. 6, pp. 418–425, 2015.
- [15] G. Hager, J. Treibig, J. Habich, and G. Wellein, "Exploring performance and power properties of modern multi-core chips via simple machine models," *Concurrency Comput. Pract. Exp.*, vol. 28, no. 2, pp. 189–210, 2016