Details
- Google AI Developers showcased Gemini 3 Flash handling complex function calling by sequencing tasks to prepare ramen, reasoning across 100 ingredients and 100 tools simultaneously in a demo video.
- Gemini 3 Flash, released December 17, 2025, combines Gemini 3 Pro's frontier reasoning with Flash-level speed, latency, and cost efficiency, outperforming Gemini 2.5 Pro on benchmarks like GPQA Diamond (90.4%) and SWE-bench Verified (78%).
- It excels in agentic workflows, multimodal understanding (text, audio, images, video), and coding, enabling near real-time analysis for tasks like video plans, data extraction, and interactive apps.
- Now the default model in the Gemini app and Google Search's AI Mode, providing faster, more accurate responses worldwide at no extra cost, with 30% fewer tokens on average.
- Available to developers via Vertex AI, Gemini CLI, and enterprises, powering applications at less than a quarter of Gemini 3 Pro's cost with higher rate limits.
- Companies like Box, JetBrains, Figma, Salesforce, and Workday report breakthroughs in accuracy for extraction tasks, handwriting recognition, and complex data processing.
Impact
Google's Gemini 3 Flash demo underscores its edge in agentic AI, where it handles intricate tool orchestration like the ramen task more efficiently than predecessors, pressuring rivals like OpenAI's o1 series and Anthropic's Claude in real-time reasoning at lower costs. By setting Gemini 3 Flash as the default in apps and Search, Google accelerates mainstream adoption, lowering barriers for everyday users and developers while integrating frontier capabilities into billions of interactions. This efficiency—three times faster than Gemini 2.5 Pro with comparable PhD-level performance—shifts market dynamics toward scalable agentic systems, easing GPU bottlenecks and enabling on-device-like responsiveness in cloud workflows. Over the next 12-24 months, it could redirect R&D toward hybrid speed-intelligence models, boosting enterprise funding for AI agents in coding, analysis, and planning, while aligning with trends in multimodal safety and real-time search.