Are You Getting What You Pay For? Auditing Model Substitution in LLM APIs

TOP Literature Database Are You Getting What You Pay For? Auditing Model Substitution in LLM APIs

arxiv

AI Security Portal bot

Information in the literature database is collected automatically.

Source

https://arxiv.org/abs/2504.04715

PDF

https://arxiv.org/pdf/2504.04715

Paper Information

Author: Will Cai,Tianneng Shi,Xuandong Zhao,Dawn Song
Published: 4-7-2025
Updated: 9-29-2025
Affiliation: University of California, Berkeley
Country: United States of America
Conference: Computing Research Repository (CoRR)

Labels Estimated by AI

API Security Identification of AI Output Model Performance Evaluation

These labels were automatically added by AI and may be inaccurate.
For details, see About Literature Database.

Abstract

Commercial Large Language Model (LLM) APIs create a fundamental trust problem: users pay for specific models but have no guarantee that providers deliver them faithfully. Providers may covertly substitute cheaper alternatives (e.g., quantized versions, smaller models) to reduce costs while maintaining advertised pricing. We formalize this model substitution problem and systematically evaluate detection methods under realistic adversarial conditions. Our empirical analysis reveals that software-only methods are fundamentally unreliable: statistical tests on text outputs are query-intensive and fail against subtle substitutions, while methods using log probabilities are defeated by inherent inference nondeterminism in production environments. We argue that this verification gap can be more effectively closed with hardware-level security. We propose and evaluate the use of Trusted Execution Environments (TEEs) as one practical and robust solution. Our findings demonstrate that TEEs can provide provable cryptographic guarantees of model integrity with only a modest performance overhead, offering a clear and actionable path to ensure users get what they pay for. Code is available at https://github.com/sunblaze-ucb/llm-api-audit

External Datasets

UltraChat

MMLU

GSM8K

MATH

GPQA Diamond