- AI Fire
- Posts
- 😵 GPT-5 x Gemi2.5 Lost in Fake Maze-crosoft
😵 GPT-5 x Gemi2.5 Lost in Fake Maze-crosoft
MUST-see 7 Major Flaws in GPT-5 Called HackedGPT

Google just showed us how Gemini became a math genius, thanks for this gift. Also: Anthropic’s all models now get retirement plans, just like us after years lol.
What are on FIRE 🔥
IN PARTNERSHIP WITH HUBSPOT
Want to get the most out of ChatGPT?
ChatGPT is a superpower if you know how to use it correctly.
Discover how HubSpot's guide to AI can elevate both your productivity and creativity to get more things done.
Learn to automate tasks, enhance decision-making, and foster innovation with the power of AI.
AI INSIGHTS
Gemini DeepThink crushed it at this year’s International Math Olympiad (IMO). Now we know why with its full paper behind these results: a brand-new math benchmark suite called IMO-Bench!
It’s a mega test suite for evaluating how well AI does real math reasoning, built with input from IMO medalists. Here’s what’s inside:
IMO-AnswerBench: 400 tough short-answer problems (with paraphrased versions to block memorized answers)
IMO-ProofBench: 60 full problems where AI has to show step-by-step work
IMO-GradingBench: to check how well AI models can grade math proofs
And it uses an AnswerAutoGrader that can handle messy outputs and still match human judgment 98.9% of the time. Some fun takeaways:
Popular math datasets is mostly saturated. This benchmark forces multi-step reasoning
AI graders? Actually matching human judges pretty closely
Gemini’s edge was real. It trained with this benchmark in the loop. That’s why it crushed math problems other models stumbled on. The IMO-Bench is now public.
PRESENTED BY SYNTHFLOW
Introducing Voice AI Agents on WhatsApp
WhatsApp has always been where customers start conversations. Now, with Synthflow, those conversations can continue seamlessly over calls — answered directly by Voice AI Agents.
Enterprises can finally manage WhatsApp calls with the same automation, analytics, and security as phones.
The result: faster resolutions, 24/7 coverage, and a unified system for every customer call, whether it starts on telephony or WhatsApp.
AI SOURCES FROM AI FIRE
1. Stop wasting money on paid AI tools? Get Google's FREE AI arsenal (Gemini, NotebookLM, and AI Studio) that rivals paid competitors
2. How to build your "AI employees" (and stop being like an intern). A full guide to build specialized AI employees that actually do your work.
3. 24 AI secrets most people miss. Are u making these mistakes? Specific methods to make AI a true personal assistant for your work, and life.
4. This "Data Library" 5-step process can make you real money. Not complex businesses. You can make money by selling simple data.
TODAY IN AI
AI HIGHLIGHTS
🤖 OpenAI, Anthropic are hiring a new breed of AI specialist: the FDEs (demand for this role up 800% in 2025!) If you’re still in AI field, it’s one of the best options, here.
📊 Bonus: Here are 16 charts that break down the AI boom (with detailed explanation)
🖼️ Gemini now creates custom presentations with a single prompt & your own notes. You can easily refine the slides using follow-up prompts. See how to use it here.
😇 Anthropic now gives its AI models retirement plans & exit interviews, especially when Sonnet 3.6 had final wishes. Every version will be preserved forever. Say bye!
🚨 Tenable just exposed 7 major flaws in GPT-5 called HackedGPT. They enable silent data theft, and even long‑term memory hijacking. It’s better to take a look here.
🧠 Microsoft built a fake “Magentic Marketplace” to test real‑world AI agents. But even GPT‑5, GPT‑4o, and Gemini struggled hard. Here's what they found.
💰 AI Daily Fundraising: Giga secured $61M, led by Y Combinator & Redpoint Ventures, to expand its enterprise voice AI & focus on real-time customer support.
CERTIFIED GOOGLE AI COURSE PICK
Free Course: Generative AI Leader Professional Certificate
Built for business leaders, not just techies
Covers the real-world gen AI stack from tools to strategy
Teaches how to build gen AI agents using Google tools (Gemini, NotebookLM, AI Studio)
Used by employees at P&G, Capgemini, L'Oréal, and more
Get a Google Cloud certificate to prove your skills
Insider Tip: 69 Q&As for the Generative AI Leader exam with explanations.
👉 Start the course or bookmark it
P/S: This course is one of the most practical gen AI foundations (and the cert is legit & powerful). Try to pass the exam, or reply this email if you want us to help (no cost)
NEW EMPOWERED AI TOOLS
AI CHART
You’ve probably heard about AI getting cheaper. But it’s actually collapsing. The most powerful models today (like GPT-4.5-level and up) are now 900× cheaper per year than they were just a few years ago:
Top-tier LLMs (GPT-4.5+ on PhD-level tasks): From ~$10 per million tokens in 2022 → ~$0.01 by Q4 2025 → 900× price drop/year
Mid-tier models (strong science reasoning): ~$1 → ~$0.01 → 40× drop/year
Basic models (simple tasks): Already cheap, still fell 9× per year
It’s plotted on a log scale, and still looks like a freefall. Do you know that “Moore’s Law meets Jevons Paradox” case?
→ It’s just the idea that when something becomes more efficient, we’ll use more of it. Google says even 7-year-old TPUs are running at 100% utilization.
New use cases pop up faster than we can ship hardware. Because now everything can be an LLM use case.
Same thing happened with transistors. $0.000000001 = disposable sensors in shipping tags. Now swap “transistor” for “token”, and here we are.
CTV ads made easy: Black Friday edition
As with any digital ad campaign, the important thing is to reach streaming audiences who will convert. Roku’s self-service Ads Manager stands ready with powerful segmentation and targeting — plus creative upscaling tools that transform existing assets into CTV-ready video ads. Bonus: we’re gifting you $5K in ad credits when you spend your first $5K on Roku Ads Manager. Just sign up and use code GET5K. Terms apply.
We read your emails, comments, and poll replies daily
How would you rate today’s newsletter?Your feedback helps us create the best newsletter possible |
Hit reply and say Hello – we'd love to hear from you!
Like what you're reading? Forward it to friends, and they can sign up here.
Cheers,
The AI Fire Team







Reply