Improving robustness against stealthy weight bit-flip attacks by output code matching

Ozan Özdenizci, Robert Legenstein

Research output: Chapter in Book/Report/Conference proceedingConference paperpeer-review

Abstract

Deep neural networks (DNNs) have been shown to be vulnerable against adversarial weight bit-flip attacks through hardware-induced fault-injection methods on the memory systems where network parameters are stored. Recent attacks pose the further concerning threat of finding minimal targeted and stealthy weight bit-flips that preserve expected behavior for untargeted test samples. This renders the attack undetectable from a DNN operation perspective. We propose a DNN defense mechanism to improve robustness in such realistic stealthy weight bit-flip attack scenarios. Our output code matching networks use an output coding scheme where the usual one-hot encoding of classes is replaced by partially overlapping bit strings. We show that this encoding significantly reduces attack stealthiness. Importantly, our approach is compatible with existing defenses and DNN architectures. It can be efficiently implemented on pre-trained models by simply re-defining the output classification layer and finetuning. Experimental benchmark evaluations show that output code matching is superior to existing regularized weight quantization based defenses, and an effective defense against stealthy weight bit-flip attacks.
Original languageEnglish
Title of host publicationProceedings - 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2022
Pages13378-13387
Number of pages10
ISBN (Electronic)978-1-6654-6946-3
DOIs
Publication statusPublished - 2022
Event2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition: CVPR 2022 - New Orleans Ernest N. Morial Convention Center, Hybrider Event, New Orleans, United States
Duration: 21 Jun 202224 Sept 2022
Conference number: 2022

Publication series

NameProceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition
Volume2022-June
ISSN (Print)1063-6919

Conference

Conference2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition
Abbreviated titleCVPR 2022
Country/TerritoryUnited States
CityHybrider Event, New Orleans
Period21/06/2224/09/22

Keywords

  • Adversarial attack and defense
  • Deep learning architectures and techniques

ASJC Scopus subject areas

  • Artificial Intelligence
  • Computer Vision and Pattern Recognition

Fields of Expertise

  • Information, Communication & Computing

Fingerprint

Dive into the research topics of 'Improving robustness against stealthy weight bit-flip attacks by output code matching'. Together they form a unique fingerprint.

Cite this