Skip to main content

Model Extraction Resistance

Definition

Model Extraction Resistance describes the ability of a machine learning model to prevent unauthorized parties from reconstructing or copying its underlying parameters or architecture through repeated queries. Attackers attempt to reverse-engineer a model by observing its outputs to various inputs, effectively stealing the model. Defenses aim to make this process computationally prohibitive or inaccurate, protecting the intellectual property and proprietary algorithms embedded within the model. It safeguards the investment in model development from illicit duplication.