// category

Model Releases

New frontier models, benchmark drops, and side-by-side capability comparisons.

Frontier model releases — GPT, Claude, Gemini, Grok, DeepSeek, Llama, Qwen, and the rest. This category covers announcements, benchmark results, breaking API changes, and the head-to-heads that actually matter for coding workloads, not just marketing blogposts.

34 stories · last 90 days

Anthropic releases a new Opus model amid Mythos Preview buzz

Anthropic released Claude Opus 4.7, its most capable generally available model, focused on software engineering, image analysis, and instruction following. The company noted Opus 4.7 does not advance its capability frontier, as the separately released Mythos Preview — currently limited to partner...

The Verge - AI · 2026-04-17

Gemini 3.1 Flash TTS

Google released Gemini 3.1 Flash, a text-to-speech model. Simon Willison published notes and a tool interface for the new model.

Simon Willison · 2026-04-16

Gemini 3.1 Flash TTS

Google released Gemini 3.1 Flash TTS, a text-to-speech model available via the Gemini API that generates audio from text prompts and supports detailed voice direction including accents, tone, and delivery style.

Simon Willison · 2026-04-16

Seedance 2.0 Video Generation on AI Gateway

ByteDance's Seedance 2.0 video generation model is now available via Vercel's AI Gateway in Standard and Fast variants, supporting text-to-video, image-to-video, and multimodal reference-to-video generation with synchronized audio and video editing capabilities.

Vercel Blog · 2026-04-16

Trusted access for the next era of cyber defense

OpenAI released GPT-5.4-Cyber, a model variant fine-tuned for defensive cybersecurity work, and expanded its Trusted Access for Cyber program allowing identity-verified users reduced-friction access to security tools via government ID verification through Persona.

Simon Willison · 2026-04-15

Gemma 4 audio with MLX

Google's Gemma 4 E2B model can transcribe audio files on macOS using MLX and mlx-vlm via a uv command-line recipe, as demonstrated on a 14-second voice memo that was substantially transcribed with minor errors.

Simon Willison · 2026-04-13

GLM-5.1: Towards Long-Horizon Tasks

Z.ai released GLM-5.1, a 754-billion-parameter open-source AI model available under MIT license. The model demonstrated the ability to generate SVG graphics with CSS animations and to debug and fix code issues when given follow-up instructions.

Simon Willison · 2026-04-08

GLM 5.1 on AI Gateway

Z.ai's GLM 5.1 model is now available on Vercel's AI Gateway, supporting long-horizon autonomous tasks including multi-step coding workflows, conversation, creative writing, and document generation.

Vercel Blog · 2026-04-08

Opus 4.6 Fast Mode available on AI Gateway

Vercel made Fast Mode available for Claude Opus 4.6 on AI Gateway, offering 2.5x faster output token speeds at 6x standard pricing. The experimental feature is designed for human-in-the-loop workflows and coding tasks.

Vercel Blog · 2026-04-08

Anthropic’s Claude Mythos is now available, but not for you

Anthropic released Claude Mythos Preview, a new frontier AI model, to about 50 select partners including Amazon, Apple, and Microsoft through Project Glasswing for defensive cybersecurity work. The model scores 83.1% on vulnerability analysis benchmarks, compared to 66.6% for the previous flagshi...

The New Stack · 2026-04-08

Claude Mythos 5 and the 10T-Parameter AI Shift

Anthropic is testing an early-access model called Claude Mythos after a March 2026 data leak confirmed its existence; the company has not publicly launched it but described it as a significant capability increase, with potential applications in code, cybersecurity, and enterprise operations.

Dev.to - Claude · 2026-04-07

Gemini 2.0 vs GPT-5 vs Claude 4: The Spring 2026 AI Model Rankings

Google Gemini 2.0, OpenAI GPT-5.3, and Anthropic Claude 4.6 were compared across coding and reasoning benchmarks, with GPT-5.3-Codex scoring 56.8% on SWE-Bench Pro, Claude Opus 4.6 at 55%, and Gemini 2.0 at 52%, while Claude led on multi-step reasoning tasks with 95.2% on GSM8K mathematical bench...

Dev.to - Claude · 2026-04-06

claude-opus-4-vs-gpt-5

Claude Opus 4 scores higher on coding benchmarks (76.8% vs 71.8% on SWE-bench) and offers a larger 200K-token context window, while GPT-5 excels at reasoning tasks and costs less ($10-30 per 1M tokens vs $15-75). Both are available at $20/month subscription tier.

Dev.to - Claude · 2026-04-05

Try Grok 4.20 on AI Gateway

xAI's Grok 4.20 model is now available on Vercel's AI Gateway in three variants: Reasoning, Non-Reasoning, and Multi-Agent. Developers can access the models through Vercel's AI SDK with specified model identifiers.

Vercel Blog · 2026-04-03

GLM 5V Turbo on AI Gateway

Vercel AI Gateway now offers GLM 5V Turbo, a multimodal model from Z.ai that converts screenshots and designs into code and can navigate graphical interfaces. The model is accessible via the AI SDK using the identifier zai/glm-5v-turbo.

Vercel Blog · 2026-04-03

Qwen 3.6 Plus on AI Gateway

Alibaba's Qwen 3.6 Plus model is now available on Vercel's AI Gateway. The model features a 1M context window and improved agentic coding capabilities, multimodal reasoning, and tool-calling performance compared to Qwen 3.5 Plus.

Vercel Blog · 2026-04-03

Inception Mercury 2 is live on AI Gateway

Inception's Mercury 2 model is now available on Vercel's AI Gateway. The model is designed for low-latency applications including agentic loops, coding assistants, and voice interfaces.

Vercel Blog · 2026-04-03

MiniMax M2.7 is live on AI Gateway

Vercel made MiniMax M2.7 available on its AI Gateway in standard and high-speed variants. The high-speed version delivers the same performance at 2x cost with ~100 tokens per second latency.

Vercel Blog · 2026-04-03

Gemma 4 on AI Gateway

Google's Gemma 4 models—a 26B mixture-of-experts variant and a 31B dense variant—are now available on Vercel's AI Gateway. Both models support function-calling, vision capabilities, 256K context windows, and 140+ languages.

Vercel Blog · 2026-04-03

llm-gemini 0.30

llm-gemini version 0.30 added three new models: gemini-3.1-flash-lite-preview, gemma-4-26b-a4b-it, and gemma-4-31b-it.

Simon Willison · 2026-04-03

Want this in your inbox?

Daily digest covering every category above.