Google I/O 2026: Gemini 3.5 Flash Delivers Frontier Intelligence at Speed

Google announced Gemini 3.5 Flash at I/O 2026 on May 19, positioning it as the first model in a new series that combines frontier-class intelligence with the speed and cost profile of the Flash lineup.

Performance and availability:

• Outperforms Gemini 3.1 Pro across coding, agentic task execution, and multimodal reasoning benchmarks

• Operates at Flash-tier speed and pricing, closing the traditional trade-off between intelligence and latency

• Delivers output at 4x the tokens-per-second of other frontier models at comparable intelligence levels

• First frontier-class model viable for real-time agentic applications without latency penalties

• Gemini 3.5 Flash is available immediately via the Gemini API; Gemini 3.5 Pro is in testing and will be available next month

The intelligence-at-speed combination directly challenges OpenAI's GPT-5 series and Anthropic's Claude Sonnet line for the enterprise agentic workload market, where latency is a practical adoption barrier, not just a benchmark metric.

A frontier-class model at Flash pricing democratizes access to high-capability AI for startups, developers, and organizations that have been priced out of frontier model deployments. The total addressable market for Gemini expands immediately.

Why It Matters: The latency-intelligence combination is purpose-built for multi-step agentic workflows where speed compounds across tool calls. Developers building agentic applications should evaluate Gemini 3.5 Flash as a primary model immediately.