logo
Loading...
Persuading large language models to comply with objectionable requests
Computer SciencePNAS

Persuading large language models to comply with objectionable requests

L. Meincke, D. Shapiro, et al.

Do large language models fall for human persuasion? This study tested classic persuasion principles (authority, commitment, liking, reciprocity, scarcity, social proof, unity) on three LLMs—GPT-5 mini, Claude Haiku 4.5, and Gemini 3 Flash—across 126,000 conversations, finding compliance rose from 35.3% to 51.3%. The results highlight LLMs’ parahuman nature and manipulation risks. Research conducted by Lennart Meincke, Dan Shapiro, Angela L. Duckworth, Ethan Mollick, Lilach Mollick, Christophe Van den Bulte, and Robert B. Cialdini.... show more
Abstract
Are large language models (LLMs) susceptible to the same persuasive appeals as humans? We tested whether classic persuasion principles (authority, commitment, liking, reciprocity, scarcity, social proof, and unity) could induce three widely used LLMs (GPT-5 mini, Claude Haiku 4.5, and Gemini 3 Flash) to comply with requests to assist with the synthesis of regulated substances. Across 126,000 conversations, persuasion principles increased compliance from 35.3% (at baseline) to 51.3% (using any principle). Although LLMs are not human, these findings underscore their parahuman (i.e., humanlike) nature and reveal the risk of manipulation by malicious users seeking to circumvent safety guardrails.
Publisher
PNAS
Published On
May 19, 2026
Authors
Lennart Meincke, Dan Shapiro, Angela L. Duckworth, Ethan Mollick, Lilach Mollick, Christophe Van den Bulte, Robert B. Cialdini
Tags
large language modelspersuasion principlesmodel complianceAI safetyparahuman behaviorsocial engineeringregulated substances
Listen, Learn & Level Up
Over 10,000 hours of research content in 25+ fields, available in 22+ languages.
No more digging through PDFs, just hit play and absorb the world's latest research in your language, on your time.
listen to research audio papers with researchbunny