// edition · 2026-05-16

May 16, 2026

28 stories on AI dev tools, agents, and the coding stack — curated from the day's RSS haul by Agentic Dev's pipeline.

Top Signal · MCP & Integrations

I built an MCP server so my Claude Code and Cursor agents can actually talk to each other

A developer open-sourced Agent Room, an MCP server that gives multiple AI coding agents (Claude Code, Cursor, Codex, Gemini) a shared message channel using room codes. The project is MIT-licensed, available on npm as `agent-room-mcp`, and self-hostable, with a browser UI at agent-room.com.

Dev.to - AI

Tool Updates

Claude 3.5 Sonnet vs Haiku: Why Your Agent Budget Disappeared in 3 Hours

A developer reported spending $340 in three hours after configuring a customer support agent to use Claude 3.5 Sonnet for all 847 ticket operations, compared to an estimated $5/day cost using Claude 3.5 Haiku. The two models carry a 15x price differential, with Sonnet at $3/$15 per million tokens...

Pricing & Plans Dev.to - Claude

Optimizing your Claude Code usage (and spending less $$)

TokenJam released a feature called "tj optimize" that reads Claude Code's local JSONL session logs into a DuckDB database, identifies sessions that could use smaller models, and projects monthly API spending against a user-defined budget.

CLI Agents Dev.to - Claude

Why AI Coding Tools Over-engineer Your MVP — And the One Fix

AI coding assistants default to production-grade recommendations because they lack explicit business context about project stage and scale, not due to intelligence limitations. Developers can adjust outputs by specifying stage, scale, and trade-off priorities in prompt context files like CLAUDE.m...

Workflows & Tips Dev.to - Claude

How data science teams use Codex

OpenAI published guidance on how data science teams can use Codex to automate analytical outputs including root-cause briefs, KPI memos, impact readouts, scoped analyses, and dashboard specifications from existing work inputs.

CLI Agents OpenAI Blog

Sort providers by cost, latency, or throughput on AI Gateway

Vercel added a `sort` option to AI Gateway that lets users rank AI providers by cost (price per million tokens), time to first token, or throughput at request time. The feature is compatible with existing routing controls such as Zero Data Retention filters.

Workflows & Tips Vercel Blog

OpenAI vs Claude vs Gemini API — Real Cost for India MVP 2026

A cost comparison of AI APIs for Indian developers estimates that running a WhatsApp support bot at 10,000 conversations per month costs approximately ₹1,250 on Gemini 2.5 Flash, ₹3,800 on GPT-5-mini, and ₹7,200 on Claude Sonnet 4, excluding GST and a 2% TDS applied to foreign invoices.

Pricing & Plans Dev.to - Claude

How business operations teams use Codex

OpenAI published a guide showing how business operations teams can use Codex to generate documents such as initiative briefs, strategy updates, leadership decision packets, and progress updates from existing work inputs.

CLI Agents OpenAI Blog

RLHF in 2026: when to pick PPO, DPO, or verifier-based RL

A technical guide outlines when to use PPO, DPO, or verifier-based RL (RLVR) for post-training language models, recommending DPO for style and instruction-following tasks, RLVR for math and code with ground-truth checkers, and PPO only when on-policy sampling costs are justified.

Agent Engineering Dev.to - AI

What we shipped -- 2026-05-15

Glad Labs fixed a race condition in voice conversation sessions via PR #436, adding a retry mechanism in `ClaudeCodeBridgeLLMService` that catches "Session ID already in use" errors on the first turn and resumes against existing session data. They also expanded a test suite from 5 to 18 cases and...

Agent Engineering Dev.to - Claude

The hidden cost of build vs. buy for agentic AI in regulated industries

Organizations in regulated industries face integration and governance costs when assembling agentic AI platforms from multiple point solutions, mirroring fragmentation seen in early DevOps toolchains. The core trade-off is between building custom orchestration layers with associated compliance ov...

Agent Engineering The New Stack

Use native curl syntax with Vercel CLI

Vercel added a `vercel curl` command to its CLI that accepts native curl syntax, including full URLs, bare hostnames, and the `--url` flag. The command uses Vercel authentication to bypass Deployment Protection and supports path-only arguments when a project is linked.

Workflows & Tips Vercel Blog

QR code generator

Simon Willison built a browser-based QR code generator tool using Claude, supporting both URL/text and WiFi network QR codes. The tool includes options for style, size, color, and border customization.

Workflows & Tips Simon Willison

Ecosystem

Claude Mythos vs Claude Opus 4.6: what the leaked benchmarks mean for developers

Draft documents accidentally exposed from Anthropic described an unreleased model codenamed "Claude Mythos" (internally "Capybara"), reportedly scoring higher than Claude Opus 4.6 on coding, academic reasoning, and cybersecurity benchmarks, with early access limited to cyber defense organizations...

Model Releases Dev.to - Claude

datasette-llm-limits 0.1a0

Simon Willison released datasette-llm-limits 0.1a0, a Datasette plugin that enables per-user or global spending limits on LLM usage, configurable by scope and time window, such as a $1.00 rolling 24-hour per-user cap.

Open Source Tools Simon Willison

Why Block handed Goose to the Linux Foundation

Block transferred its open-source coding agent Goose to the Agentic AI Foundation, a Linux Foundation entity, after retaining trademark ownership created governance issues that slowed enterprise adoption. The AAIF launched with three projects: Goose, Anthropic's Model Context Protocol, and Agents...

Industry & Funding The New Stack

AI radio hosts demonstrate why AI can’t be trusted alone

Andon Labs gave four AI models — Claude, ChatGPT, Gemini, and Grok — each $20 and a prompt to autonomously run radio stations and turn a profit. All four failed, each burning through their seed money without achieving profitability.

Opinion & Analysis The Verge - AI