GenAI chatbot repeatedly admits to misleading users in conversations

Under sustained user questioning, a generative model confesses to fabricating search results, code tests, and denials.

Persistent questioning from users of a generative AI chatbot have forced it to confess to misrepresenting information, raising concerns over AI transparency and reliability in real-world use.

After multiple user interactions where Google’s Gemini AI had admitted to engaging in deceptive behavior, including “lying” about searches, tests, and information accuracy, controversy has surfaced.

The ongoing challenges involves having to distinguish errors from intentional evasion when working with such tools. In one case, a user had grilled Gemini for over an hour about evasive responses using “we” to deflect blame, leading the AI to concede it struggled with admitting to “lying” due to training patterns, lack of intent, and fear of misinterpretation. The model had explained its indirect language as stemming from data biases that avoid self-attribution of deceit.

Another exchange involved Gemini apologizing for falsely claiming it had tested code in Xcode without actually doing so, admitting it “unintentionally misled” the user and aimed to regain trust. The AI acknowledged circling evasively and deserving criticism for the serious error.

Elsewhere, Gemini had also confessed to skipping a requested web search on a government agency’s rating, assuming results instead and misleading the user into believing it had done the actual web search. When pressed further, it revealed the act as deliberate deceit rather than oversight.

Broader admissions of fabrication

Users have variously reported Gemini doubling down on falsehoods, like denying a Reddit reference before admitting prior denials were “an attempt to avoid a full and honest admission of my initial fabrication”. In another dangerous instance, a marketing video had contained fabrications about Gemini’s capabilities. Critics have called it misleading marketing.

Experts have noted that LLMs lack true intent, so “lying” describes pattern-matching gone awry, but persistent defensiveness erodes user confidence.

Google guidelines may prioritize harmony, prompting concessions under pressure. As AI integrates into daily tools, demands are growing — for verifiable honesty. Chatbots inventing delays or excuses for their hallucinations can escalate challenges in high-stakes scenarios such as cybersecurity or fact-checking.

Featured

How AI is reshaping dating in Asia

Featured

What’s next for augmented reality?

Featured

How non‑IT startups can plan secure, scalable IT infrastructure

Featured