[slides and audio] Stealing Machine Learning Models via Prediction APIs

The paper "Stealing Machine Learning Models via Prediction APIs" by Florian Tramer explores the vulnerability of machine learning (ML) models to extraction attacks, particularly in the context of ML-as-a-service (MLaaS) systems. These systems allow users to train models on sensitive data and charge others for predictions. The authors investigate *model extraction attacks*, where an adversary with no prior knowledge of the model's parameters or training data aims to duplicate the model's functionality. Key findings include: 1. **Simple Equation-Solving Attacks**: For logistic regression, neural networks, and decision trees, the authors demonstrate efficient attacks that use non-adaptive, random queries to solve for the model's parameters. These attacks are successful against popular MLaaS providers like Amazon and BigML. 2. **Path-Finding Attacks on Decision Trees**: A novel attack method is introduced that exploits the rich information provided by prediction APIs, such as confidence values and partial feature vectors. This method allows the adversary to discover the structure of decision trees by treating confidence values as pseudo-identifiers for tree paths. 3. **Countermeasures**: The authors evaluate the effectiveness of omitting confidence values from model outputs as a countermeasure. While this approach is less effective than equation-solving attacks, it still poses a significant threat. 4. **Implications**: The attacks can leak private information about training data and enable evasion of security applications, such as spam or fraud detection. The paper highlights the need for careful deployment of ML models and the development of new countermeasures to protect against model extraction attacks.The paper "Stealing Machine Learning Models via Prediction APIs" by Florian Tramer explores the vulnerability of machine learning (ML) models to extraction attacks, particularly in the context of ML-as-a-service (MLaaS) systems. These systems allow users to train models on sensitive data and charge others for predictions. The authors investigate *model extraction attacks*, where an adversary with no prior knowledge of the model's parameters or training data aims to duplicate the model's functionality. Key findings include: 1. **Simple Equation-Solving Attacks**: For logistic regression, neural networks, and decision trees, the authors demonstrate efficient attacks that use non-adaptive, random queries to solve for the model's parameters. These attacks are successful against popular MLaaS providers like Amazon and BigML. 2. **Path-Finding Attacks on Decision Trees**: A novel attack method is introduced that exploits the rich information provided by prediction APIs, such as confidence values and partial feature vectors. This method allows the adversary to discover the structure of decision trees by treating confidence values as pseudo-identifiers for tree paths. 3. **Countermeasures**: The authors evaluate the effectiveness of omitting confidence values from model outputs as a countermeasure. While this approach is less effective than equation-solving attacks, it still poses a significant threat. 4. **Implications**: The attacks can leak private information about training data and enable evasion of security applications, such as spam or fraud detection. The paper highlights the need for careful deployment of ML models and the development of new countermeasures to protect against model extraction attacks.

Stealing Machine Learning Models via Prediction APIs

3 Oct 2016 | Florian Tramèr, Fan Zhang, Ari Juels, Michael K. Reiter, Thomas Ristenpart