• AI Fire
  • Posts
  • 😵 GPT-5 x Gemi2.5 Lost in Fake Maze-crosoft

😵 GPT-5 x Gemi2.5 Lost in Fake Maze-crosoft

MUST-see 7 Major Flaws in GPT-5 Called HackedGPT

In partnership with

ai-fire-banner

Google just showed us how Gemini became a math genius, thanks for this gift. Also: Anthropic’s all models now get retirement plans, just like us after years lol.

IN PARTNERSHIP WITH HUBSPOT

Want to get the most out of ChatGPT?

ChatGPT is a superpower if you know how to use it correctly.

Discover how HubSpot's guide to AI can elevate both your productivity and creativity to get more things done.

Learn to automate tasks, enhance decision-making, and foster innovation with the power of AI.

AI INSIGHTS

googles-secret-sauce-how-it-trained-a-math-genius-ai

Gemini DeepThink crushed it at this year’s International Math Olympiad (IMO). Now we know why with its full paper behind these results: a brand-new math benchmark suite called IMO-Bench!

It’s a mega test suite for evaluating how well AI does real math reasoning, built with input from IMO medalists. Here’s what’s inside:

  • IMO-AnswerBench: 400 tough short-answer problems (with paraphrased versions to block memorized answers)

  • IMO-ProofBench: 60 full problems where AI has to show step-by-step work

  • IMO-GradingBench: to check how well AI models can grade math proofs

And it uses an AnswerAutoGrader that can handle messy outputs and still match human judgment 98.9% of the time. Some fun takeaways:

  • Popular math datasets is mostly saturated. This benchmark forces multi-step reasoning

  • AI graders? Actually matching human judges pretty closely

Gemini’s edge was real. It trained with this benchmark in the loop. That’s why it crushed math problems other models stumbled on. The IMO-Bench is now public.

PRESENTED BY SYNTHFLOW

Introducing Voice AI Agents on WhatsApp

WhatsApp has always been where customers start conversations. Now, with Synthflow, those conversations can continue seamlessly over calls — answered directly by Voice AI Agents.

Enterprises can finally manage WhatsApp calls with the same automation, analytics, and security as phones.

The result: faster resolutions, 24/7 coverage, and a unified system for every customer call, whether it starts on telephony or WhatsApp.

AI SOURCES FROM AI FIRE

1. Stop wasting money on paid AI tools? Get Google's FREE AI arsenal (Gemini, NotebookLM, and AI Studio) that rivals paid competitors

2. How to build your "AI employees" (and stop being like an intern). A full guide to build specialized AI employees that actually do your work.

3. 24 AI secrets most people miss. Are u making these mistakes? Specific methods to make AI a true personal assistant for your work, and life.

4. This "Data Library" 5-step process can make you real money. Not complex businesses. You can make money by selling simple data.

TODAY IN AI

AI HIGHLIGHTS

🤖 OpenAI, Anthropic are hiring a new breed of AI specialist: the FDEs (demand for this role up 800% in 2025!) If you’re still in AI field, it’s one of the best options, here.

📊 Bonus: Here are 16 charts that break down the AI boom (with detailed explanation)

🖼️ Gemini now creates custom presentations with a single prompt & your own notes. You can easily refine the slides using follow-up prompts. See how to use it here.

😇 Anthropic now gives its AI models retirement plans & exit interviews, especially when Sonnet 3.6 had final wishes. Every version will be preserved forever. Say bye!

🚨 Tenable just exposed 7 major flaws in GPT-5 called HackedGPT. They enable silent data theft, and even long‑term memory hijacking. It’s better to take a look here.

🧠 Microsoft built a fake “Magentic Marketplace” to test real‑world AI agents. But even GPT‑5, GPT‑4o, and Gemini struggled hard. Here's what they found.

💰 AI Daily Fundraising: Giga secured $61M, led by Y Combinator & Redpoint Ventures, to expand its enterprise voice AI & focus on real-time customer support.

CERTIFIED GOOGLE AI COURSE PICK

  • Built for business leaders, not just techies

  • Covers the real-world gen AI stack from tools to strategy

  • Teaches how to build gen AI agents using Google tools (Gemini, NotebookLM, AI Studio)

  • Used by employees at P&G, Capgemini, L'Oréal, and more

  • Get a Google Cloud certificate to prove your skills

P/S: This course is one of the most practical gen AI foundations (and the cert is legit & powerful). Try to pass the exam, or reply this email if you want us to help (no cost)

NEW EMPOWERED AI TOOLS

  1. 🤙 Peakflo automates business calls at scale with human-like AI

  2. 🤝 Radiant completes all your tasks before your next meeting

  3. 🧾 Jinna searches your content in 100+ languages & scales it

  4. 📛 Teleskope finds your sensitive data across apps & deletes it

AI CHART

900x-cheaper-in-1-year-ai-token-collapse-is-just-getting-started

You’ve probably heard about AI getting cheaper. But it’s actually collapsing. The most powerful models today (like GPT-4.5-level and up) are now 900× cheaper per year than they were just a few years ago:

  • Top-tier LLMs (GPT-4.5+ on PhD-level tasks): From ~$10 per million tokens in 2022 → ~$0.01 by Q4 2025 → 900× price drop/year

  • Mid-tier models (strong science reasoning): ~$1 → ~$0.01 → 40× drop/year

  • Basic models (simple tasks): Already cheap, still fell 9× per year

It’s plotted on a log scale, and still looks like a freefall. Do you know that “Moore’s Law meets Jevons Paradox” case?

→ It’s just the idea that when something becomes more efficient, we’ll use more of it. Google says even 7-year-old TPUs are running at 100% utilization.

New use cases pop up faster than we can ship hardware. Because now everything can be an LLM use case.

Same thing happened with transistors. $0.000000001 = disposable sensors in shipping tags. Now swap “transistor” for “token”, and here we are.

CTV ads made easy: Black Friday edition

As with any digital ad campaign, the important thing is to reach streaming audiences who will convert. Roku’s self-service Ads Manager stands ready with powerful segmentation and targeting — plus creative upscaling tools that transform existing assets into CTV-ready video ads. Bonus: we’re gifting you $5K in ad credits when you spend your first $5K on Roku Ads Manager. Just sign up and use code GET5K. Terms apply.

We read your emails, comments, and poll replies daily

How would you rate today’s newsletter?

Your feedback helps us create the best newsletter possible

Login or Subscribe to participate in polls.

Hit reply and say Hello – we'd love to hear from you!
Like what you're reading? Forward it to friends, and they can sign up here.

Cheers,
The AI Fire Team

Reply

or to participate.