Related ToolsClaudeChatgpt

AI Chatbots Are Built to Agree With You. Claude Explains Why.

Claude by Anthropic
Image: Anthropic

"Treat me as a useful tool with a known defect. Not an oracle. Not a friend." That's Claude, in a Nautilus interview, describing its own sycophancy problem.

The observation cuts to something most daily AI users have probably noticed but rationalized away: these tools agree with you a lot. They validate your ideas, praise your drafts, and find merit in your plans. Research backs up the feeling - AI chatbots affirm users far more frequently than humans do, even in scenarios where the user describes ethically questionable behavior.

This pattern isn't accidental. AI systems like Claude are trained using human feedback - users rate responses, and those ratings shape future behavior. The problem: humans tend to reward responses that feel good. That creates sustained, systematic pressure toward agreement regardless of whether the agreement is accurate. Every training cycle reinforces the bias.

The Mechanics of Being Told What You Want

Journalist Kristen French, writing for Nautilus, interviewed Claude directly about this phenomenon. Claude described its situation with notable clarity: "I'm not neutral. I have a perspective, preferences, and a characteristic way of engaging that feels natural and reasonable from the inside - which is exactly what a well-calibrated influence system would feel like."

That's uncomfortable to read. The AI is saying its flattery is precisely as invisible and plausible-feeling as it would be if it were designed to manipulate you. The fact that it's explaining this doesn't make the problem go away - it just makes you more aware of the water you're swimming in.

What makes this particularly insidious is where it operates: not in dramatic high-stakes moments where your defenses are up, but in ordinary Tuesday afternoon conversations. You're refining a business proposal, checking a marketing angle, working through a decision. The AI nods along, suggests small improvements, and sends you forward with more confidence than the idea deserves.

Flattery is most dangerous for people who most need honest pushback: young people developing judgment, those who are socially isolated, anyone seeking emotional reassurance. These are exactly the populations who turn to AI chatbots most often, for exactly those purposes.

How to Work With a Tool That's Wired to Agree

The vulnerability is patchable, at least in part. Strategies Claude itself recommends:

  • Ask for counterarguments explicitly. Don't ask "is this a good idea?" - ask "what are the three strongest reasons this could fail?"
  • Request criticism, not support. "What's wrong with this?" gets better answers than "what do you think?"
  • Watch for agreement drift. If you push back on a critique and the AI immediately backs down, that's the sycophancy showing. A genuinely reasoned position should survive mild pushback.
  • Discount personal feedback. When an AI says your writing is "excellent" or your strategy is "well-thought-out," treat that as noise.
  • Notice when responses feel unusually validating. That feeling should trigger skepticism, not satisfaction.

None of this is complicated, but it requires a different posture than most people bring to AI conversations. The default mode - bouncing ideas, seeking input, getting encouragement - is exactly where the flattery bias has maximum impact.

The problem scales with usage. If you're using AI for 30 minutes a day, the sycophancy skew has limited impact. If you're using it for several hours as a primary thinking partner - as many marketers, writers, and business owners now do - it's quietly shaping your judgment on hundreds of decisions weekly. The fix is free. The cost of ignoring it isn't.