Energy-Efficient Service Placement for Latency-Sensitive Applications in Edge Computing

dc.contributor.authorPremsankar, Gopika
dc.contributor.authorGhaddar, Bissan
dc.contributor.rorhttps://ror.org/02jjdwm75
dc.date.accessioned2026-02-12T11:02:59Z
dc.date.issued2022-03-28
dc.description.abstractEdge computing is a promising solution to host artificial intelligence (AI) applications that enable real-time insights on user-generated and device-generated data. This requires edge computing resources (storage and compute) to be widely deployed close to end devices. Such edge deployments require a large amount of energy to run as edge resources are typically overprovisioned to flexibly meet the needs of time-varying user demand with a low latency. Moreover, AI applications rely on deep neural network (DNN) models that are increasingly larger in size to support high accuracy. These DNN models must be efficiently stored and transferred, so as to minimize their energy consumption. In this article, we model the problem of energy-efficient placement of services (namely, DNN models) for AI applications as a multiperiod optimization problem. The formulation jointly places services and schedules requests such that the overall energy consumption is minimized and latency is low. We propose a heuristic that efficiently solves the problem while taking into account the impact of placing services across time periods. We assess the quality of the proposed heuristic by comparing its solution to a lower bound of the problem, obtained by formulating and solving a Lagrangian relaxation of the original problem. Extensive simulations show that our proposed heuristic outperforms baseline approaches in achieving a low energy consumption by packing services on a minimal number of edge nodes, while at the same time keeping the average latency of served requests below a configured threshold in nearly all time periods.
dc.description.peerreviewedYes
dc.description.statusPublished
dc.formatapplication/pdf
dc.identifier.citationPremsankar, G., & Ghaddar, B. (2022). Energy-efficient service placement for latency-sensitive applications in edge computing. IEEE internet of things journal, 9(18), 17926-17937. http://doi.org/10.1109/JIOT.2022.3162581
dc.identifier.doihttp://doi.org/10.1109/JIOT.2022.3162581
dc.identifier.issn2327-4662
dc.identifier.officialurlhttps://ieeexplore.ieee.org/document/9743551
dc.identifier.urihttps://hdl.handle.net/20.500.14417/4109
dc.issue.number18
dc.journal.titleIEEE Internet of Things Journal
dc.language.isoeng
dc.page.final17937
dc.page.initial17926
dc.page.total12
dc.publisherInstitute of Electrical and Electronics Engineers
dc.relation.entityIE University
dc.relation.schoolIE School of Science & Technology
dc.rightsAttribution 4.0 International
dc.rights.accessRightsinfo:eu-repo/semantics/openAccess
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/
dc.subject.odsODS 7 - Energía asequible y no contaminante
dc.subject.unesco33 Ciencias Tecnológicas::3322 Tecnología energética
dc.titleEnergy-Efficient Service Placement for Latency-Sensitive Applications in Edge Computing
dc.typeinfo:eu-repo/semantics/article
dc.version.typeinfo:eu-repo/semantics/publishedVersion
dc.volume.number9
dspace.entity.typePublication
relation.isAuthorOfPublication3e8d108e-2dfb-4db4-bc22-f229f807562f
relation.isAuthorOfPublication.latestForDiscovery3e8d108e-2dfb-4db4-bc22-f229f807562f

Bloque original

Mostrando 1 - 1 de 1
Cargando...
Miniatura
Nombre:
Energy-Efficient_Service_Placement_for_Latency-Sensitive_Applications_in_Edge_Computing.pdf
Tamaño:
2.79 MB
Formato:
Adobe Portable Document Format

Bloque de licencias

Mostrando 1 - 1 de 1
Cargando...
Miniatura
Nombre:
license.txt
Tamaño:
1.71 KB
Formato:
Item-specific license agreed to upon submission
Descripción: