A curious question from my kids sent Gemini into a hallucination.
Google's AI Overview can be 100% wrong, even when SERP is right.
Google’s AI Overview can be 100% wrong, even when SERP is right.
It could be a trap when you least expect it.
Today, my kids asked me, “What is Buah Long Long?” I quickly Googled it to show them images. The initial search was good: both the AI Overview and SERP correctly identified “Buah Long Long” as Ambarella Fruit.
However, when I tapped on “Benefits of Buah Long Long,” the AI Overview hallucinated, confidently stating that Buah Long Long is also known as longan. This led to a cascade of completely wrong information.
Ironically, the SERP results for “Benefits of Buah Long Long” were accurate. AI Overview is supposed to be grounded in Google Search, so why was it so far off?
This isn’t just about fruit; it’s a critical reminder about the reliability of information, especially from AI summaries.
I tested the same prompt, “Benefits of Buah Long Long,” on Perplexity, and it had no issues. This incident, combined with my past experiences, indicates that Gemini still exhibits the worst hallucinations among frontier LLMs.
My takeaway: Always, always, always double-check facts presented by AI Overview.
What are the worst hallucinations you’ve encountered recently?
Technical observation:
It hallucinates when I least expect it. But here’s an interesting observation: Buah Long Long and Longan both contain the word “Long”. Could this be related to tokenization?
#gemini #googleai #ai #google #artificialintelligence #chatgpt #perplexity #openai #genai #llm #hallucinations
Tap to expand
Enjoyed this? Subscribe for more.
Practical insights on AI, growth, and independent learning. No spam.
More in AI Agents
Don't believe the BS that you can use Claude Code for free.
Ollama recently made their API compatible with Claude Code. Many creators quickly jumped on the opportunity to farm engagement with the hook: "You can now u...
GenAI Design Thinking Workshop
Helping participants break down their business processes to identify opportunities for adopting agentic AI workflows using the 5I framework.
I’m honored to be invited to moderate an insightful roundtable on 𝗘𝗹𝗲𝘃𝗮𝘁𝗶𝗻𝗴 𝗙𝗼𝘂𝗻𝗱𝗮𝘁𝗶𝗼𝗻𝗮𝗹 𝗦𝘆𝘀𝘁𝗲𝗺𝘀 𝗶𝗻 𝘁𝗵𝗲 𝗔𝗜 𝗔𝗴𝗲, hosted by The Ortus Club and MuleSoft — with an exceptional group of tech and data leaders across industries like banking, telco, healthcare, transport, finance, and travel.
We unpacked tough questions on:
If you are using OpenClaw with WhatsApp, there is one risk nobody is talking about.
Getting your WhatsApp account permanently banned.
Does Qwen 3.5 live up to the hype?
I tested 9 local LLMs on a Claude Code skill I actually use every day. Not a coding benchmark. A real multi-step agentic task described in natural language a...
OpenClaw Creator: Why 80% Of Apps Will Disappear
If you are still thinking OpenClaw is just hype, you should watch this interview with Peter Steinberger.
Don't believe the BS that you can use Claude Code for free.
Ollama recently made their API compatible with Claude Code. Many creators quickly jumped on the opportunity to farm engagement with the hook: "You can now u...
If you are using OpenClaw with WhatsApp, there is one risk nobody is talking about.
Getting your WhatsApp account permanently banned.
OpenClaw Creator: Why 80% Of Apps Will Disappear
If you are still thinking OpenClaw is just hype, you should watch this interview with Peter Steinberger.
GenAI Design Thinking Workshop
Helping participants break down their business processes to identify opportunities for adopting agentic AI workflows using the 5I framework.
I’m honored to be invited to moderate an insightful roundtable on 𝗘𝗹𝗲𝘃𝗮𝘁𝗶𝗻𝗴 𝗙𝗼𝘂𝗻𝗱𝗮𝘁𝗶𝗼𝗻𝗮𝗹 𝗦𝘆𝘀𝘁𝗲𝗺𝘀 𝗶𝗻 𝘁𝗵𝗲 𝗔𝗜 𝗔𝗴𝗲, hosted by The Ortus Club and MuleSoft — with an exceptional group of tech and data leaders across industries like banking, telco, healthcare, transport, finance, and travel.
We unpacked tough questions on:
Does Qwen 3.5 live up to the hype?
I tested 9 local LLMs on a Claude Code skill I actually use every day. Not a coding benchmark. A real multi-step agentic task described in natural language a...