These labels were automatically added by AI and may be inaccurate. For details, see About Literature Database.
Abstract
Early backdoor attacks against machine learning set off an arms race in
attack and defence development. Defences have since appeared demonstrating some
ability to detect backdoors in models or even remove them. These defences work
by inspecting the training data, the model, or the integrity of the training
procedure. In this work, we show that backdoors can be added during
compilation, circumventing any safeguards in the data preparation and model
training stages. The attacker can not only insert existing weight-based
backdoors during compilation, but also a new class of weight-independent
backdoors, such as ImpNet. These backdoors are impossible to detect during the
training or data preparation processes, because they are not yet present. Next,
we demonstrate that some backdoors, including ImpNet, can only be reliably
detected at the stage where they are inserted and removing them anywhere else
presents a significant challenge. We conclude that ML model security requires
assurance of provenance along the entire technical pipeline, including the
data, model architecture, compiler, and hardware specification.