REDS: Resource-Efficient Deep Subnetworks for Dynamic Resource Constraints

Francesco Corti; Balz Maag; Joachim Schauer; Ulrich Pferschy; Olga Saukh

REDS: Resource-Efficient Deep Subnetworks for Dynamic Resource Constraints

Francesco Corti, Balz Maag, Joachim Schauer, Ulrich Pferschy, Olga Saukh

Institut für Technische Informatik (4480)

Publikation: Arbeitspapier › Preprint

Abstract

Deep models deployed on edge devices frequently encounter resource variability, which arises from fluctuating energy levels, timing constraints, or prioritization of other critical tasks within the system. State-of-the-art machine learning pipelines generate resource-agnostic models, not capable to adapt at runtime. In this work we introduce Resource-Efficient Deep Subnetworks (REDS) to tackle model adaptation to variable resources. In contrast to the state-of-the-art, REDS use structured sparsity constructively by exploiting permutation invariance of neurons, which allows for hardware-specific optimizations. Specifically, REDS achieve computational efficiency by (1) skipping sequential computational blocks identified by a novel iterative knapsack optimizer, and (2) leveraging simple math to re-arrange the order of operations in REDS computational graph to take advantage of the data cache. REDS support conventional deep networks frequently deployed on the edge and provide computational benefits even for small and simple networks. We evaluate REDS on six benchmark architectures trained on the Google Speech Commands, FMNIST and CIFAR10 datasets, and test on four off-the-shelf mobile and embedded hardware platforms. We provide a theoretical result and empirical evidence for REDS outstanding performance in terms of submodels’ test set accuracy, and demonstrate an adaptation time in response to dynamic resource constraints of under 40μs, utilizing a 2-layer fully-connected network on Arduino Nano 33 BLE Sense.

Originalsprache	englisch
Seiten	1-26
Seitenumfang	26
Publikationsstatus	Eingereicht - 30 Okt. 2023

Zugriff auf Dokument

Resource-Efficient Deep Subnetworks for Dynamic Resource ConstraintsEingereichtes Manuskript, 1,28 MBLizenz: CC BY-NC-SA 4.0

Andere Dateien und Links

FWF - DENISE - Doktorandenschule für zuverlässige elektronikgestützte Systeme
Mütze, A., Saukh, O., Römer, K. U., Boano, C. A., Corti, F., Schuß, M., Mohamed Hydher, M. H. & Dawara, A. A.
1/05/22 → 30/04/26
Projekt: Forschungsprojekt
Intelligent & Networked Embedded Systems
Boano, C. A., Römer, K. U., Schuß, M., Cao, N., Saukh, O., Hofmann, R., Stocker, M., Schuh, M. P., Papst, F., Salomon, E., Brunner, H., Gallacher, M., Mohamed Hydher, M. H., Wang, D., Corti, F., Krisper, M., Basic, F. & Petrovic, K.
1/09/13 → …
Projekt: Arbeitsgebiet

Dieses zitieren

@techreport{396cd6d9dbc744f78539a5679c43e92d,

title = "REDS: Resource-Efficient Deep Subnetworks for Dynamic Resource Constraints",

abstract = "Deep models deployed on edge devices frequently encounter resource variability, which arises from fluctuating energy levels, timing constraints, or prioritization of other critical tasks within the system. State-of-the-art machine learning pipelines generate resource-agnostic models, not capable to adapt at runtime. In this work we introduce Resource-Efficient Deep Subnetworks (REDS) to tackle model adaptation to variable resources. In contrast to the state-of-the-art, REDS use structured sparsity constructively by exploiting permutation invariance of neurons, which allows for hardware-specific optimizations. Specifically, REDS achieve computational efficiency by (1) skipping sequential computational blocks identified by a novel iterative knapsack optimizer, and (2) leveraging simple math to re-arrange the order of operations in REDS computational graph to take advantage of the data cache. REDS support conventional deep networks frequently deployed on the edge and provide computational benefits even for small and simple networks. We evaluate REDS on six benchmark architectures trained on the Google Speech Commands, FMNIST and CIFAR10 datasets, and test on four off-the-shelf mobile and embedded hardware platforms. We provide a theoretical result and empirical evidence for REDS outstanding performance in terms of submodels{\textquoteright} test set accuracy, and demonstrate an adaptation time in response to dynamic resource constraints of under 40μs, utilizing a 2-layer fully-connected network on Arduino Nano 33 BLE Sense.",

author = "Francesco Corti and Balz Maag and Joachim Schauer and Ulrich Pferschy and Olga Saukh",

year = "2023",

month = oct,

day = "30",

language = "English",

pages = "1--26",

type = "WorkingPaper",

}

TY - UNPB

T1 - REDS: Resource-Efficient Deep Subnetworks for Dynamic Resource Constraints

AU - Corti, Francesco

AU - Maag, Balz

AU - Schauer, Joachim

AU - Pferschy, Ulrich

AU - Saukh, Olga

PY - 2023/10/30

Y1 - 2023/10/30

N2 - Deep models deployed on edge devices frequently encounter resource variability, which arises from fluctuating energy levels, timing constraints, or prioritization of other critical tasks within the system. State-of-the-art machine learning pipelines generate resource-agnostic models, not capable to adapt at runtime. In this work we introduce Resource-Efficient Deep Subnetworks (REDS) to tackle model adaptation to variable resources. In contrast to the state-of-the-art, REDS use structured sparsity constructively by exploiting permutation invariance of neurons, which allows for hardware-specific optimizations. Specifically, REDS achieve computational efficiency by (1) skipping sequential computational blocks identified by a novel iterative knapsack optimizer, and (2) leveraging simple math to re-arrange the order of operations in REDS computational graph to take advantage of the data cache. REDS support conventional deep networks frequently deployed on the edge and provide computational benefits even for small and simple networks. We evaluate REDS on six benchmark architectures trained on the Google Speech Commands, FMNIST and CIFAR10 datasets, and test on four off-the-shelf mobile and embedded hardware platforms. We provide a theoretical result and empirical evidence for REDS outstanding performance in terms of submodels’ test set accuracy, and demonstrate an adaptation time in response to dynamic resource constraints of under 40μs, utilizing a 2-layer fully-connected network on Arduino Nano 33 BLE Sense.

AB - Deep models deployed on edge devices frequently encounter resource variability, which arises from fluctuating energy levels, timing constraints, or prioritization of other critical tasks within the system. State-of-the-art machine learning pipelines generate resource-agnostic models, not capable to adapt at runtime. In this work we introduce Resource-Efficient Deep Subnetworks (REDS) to tackle model adaptation to variable resources. In contrast to the state-of-the-art, REDS use structured sparsity constructively by exploiting permutation invariance of neurons, which allows for hardware-specific optimizations. Specifically, REDS achieve computational efficiency by (1) skipping sequential computational blocks identified by a novel iterative knapsack optimizer, and (2) leveraging simple math to re-arrange the order of operations in REDS computational graph to take advantage of the data cache. REDS support conventional deep networks frequently deployed on the edge and provide computational benefits even for small and simple networks. We evaluate REDS on six benchmark architectures trained on the Google Speech Commands, FMNIST and CIFAR10 datasets, and test on four off-the-shelf mobile and embedded hardware platforms. We provide a theoretical result and empirical evidence for REDS outstanding performance in terms of submodels’ test set accuracy, and demonstrate an adaptation time in response to dynamic resource constraints of under 40μs, utilizing a 2-layer fully-connected network on Arduino Nano 33 BLE Sense.

UR - https://arxiv.org/abs/2311.13349

UR - https://github.com/FraCorti/REDS

M3 - Preprint

SP - 1

EP - 26

BT - REDS: Resource-Efficient Deep Subnetworks for Dynamic Resource Constraints

ER -

REDS: Resource-Efficient Deep Subnetworks for Dynamic Resource Constraints

Abstract

Zugriff auf Dokument

Andere Dateien und Links

Fingerprint

Projekte

FWF - DENISE - Doktorandenschule für zuverlässige elektronikgestützte Systeme

Intelligent & Networked Embedded Systems

Dieses zitieren