// edition · 2026-05-20

May 20, 2026

44 stories on AI dev tools, agents, and the coding stack — curated from the day's RSS haul by Agentic Dev's pipeline.

Top Signal · CLI Agents

How to use Claude Code like you’ve used it for a year

A developer with nearly a year of Claude Code experience published a guide covering session management techniques, including when to use /compact versus /clear, how subagents protect main context, and why hooks are more reliable than memory for consistent behavior.

Dev.to - Claude

Tool Updates

Show HN: Forge – Guardrails take an 8B model from 53% to 99% on agentic tasks

Antoine Zambelli released Forge, an open-source guardrail layer for self-hosted LLM tool-calling that raises an 8B model's success rate on multi-step agentic workflows from 53% to 99.3% without modifying the model. The findings, tested across 97 model/backend configurations, were accepted to ACM ...

Agent Engineering Hacker News - Best

Run Claude Managed Agents with Vercel Sandbox

Vercel and Anthropic have integrated Claude Managed Agents with Vercel Sandbox, allowing agent tool calls to execute in isolated Firecracker microVMs on Vercel infrastructure. Each session runs in its own microVM with credential brokering, deny-by-default egress, and access to private networks an...

Agent Engineering Vercel Blog

Why production RAG systems give confident, wrong answers at scale

Retrieval-Augmented Generation systems fail at production scale primarily because retrieval architectures degrade as document corpora grow into the millions, causing LLMs to generate confident but incorrect answers from incomplete context. The failure is in recall, not the model itself — relevant...

Agent Engineering The New Stack

Google now lets you vibe code native Android apps in AI Studio

Google announced that its AI Studio web tool can now generate native Android apps written in Kotlin with Jetpack Compose from text prompts, requiring no local software installation. The tool includes a built-in Android emulator and supports deploying apps to physical devices via Android Debug Bri...

Workflows & Tips The New Stack

OpenClaw and Claude Code - Multi Agents talking via Handoff File

A developer built a two-agent system pairing OpenClaw, a Discord-based LLM bot running on a Raspberry Pi, with Claude Code for coding tasks. OpenClaw receives user requests and passes them to Claude Code via a shared handoff file; Claude Code writes code, opens a GitHub pull request, and exits.

Agent Engineering Dev.to - Claude

Google can now vibe-code you an Android app

Google updated AI Studio to support building native Android apps via natural language prompts, with an embedded emulator for previewing and the ability to install directly to a connected Android device. The initial release targets "personal utility" apps, with app tester invitations planned for a...

Workflows & Tips The Verge - AI

Google launches $100 AI Ultra plan and cuts top tier to $200

Google launched a $100/month AI Ultra subscription tier and reduced its top-tier plan from $250 to $200. The company also replaced daily prompt limits with a token-based "compute-used" metering model that resets every five hours up to a weekly cap.

Pricing & Plans The New Stack

Building a Personal Conversation Memory Layer Without Adding a Meeting Bot

Cheetu AI is developing a meeting memory system that captures real-time transcription and translation without deploying a visible bot into calls. The approach stores structured conversation data — including speaker labels, timestamps, and decisions — to make meetings searchable after the fact.

Agent Engineering Dev.to - AI

Claude certified architect practice exam

SuperML.org published a study guide for the Anthropic Claude Certified Architect exam, covering five domains: model selection, prompt engineering, context and memory, tool use and agents, and safety and deployment.

Workflows & Tips Dev.to - Claude

Ecosystem

Google’s Gemini 3.5 Flash beats the frontier models

Google unveiled Gemini 3.5 Flash at its I/O conference, a model that outperforms its Gemini 3.1 Pro predecessor across most benchmarks and matches or exceeds GPT-5.5 and Anthropic's Opus 4.7 on several tool-use benchmarks, while running at approximately 280 tokens per second and priced at roughly...

Model Releases The New Stack

Anthropic Launches Self-Hosted Claude Agents: What Indie Hackers Need to Know

Anthropic announced two new features for Claude Managed Agents at its Code with Claude London event on May 19, 2026: self-hosted sandboxes in public beta, supporting Cloudflare, Modal, Vercel, and Daytona, and MCP tunnels in research preview for connecting private network servers without public e...

MCP & Integrations Dev.to - Claude

Memory app bridging Claude Code/Codex/Cursor over MCP

An indie developer released Contextberg, a Windows app available on the Microsoft Store that records five data signals — screenshots, browser history, keystrokes, app usage, and agent conversations — and serves the compiled context to MCP-compatible AI coding agents including Claude Code, Codex, ...

MCP & Integrations Dev.to - Claude

I/O 2026: Welcome to the agentic Gemini era

Google announced agentic capabilities for its Gemini AI at the I/O 2026 developer conference, focusing on features that allow the system to complete tasks autonomously on behalf of users.

Model Releases Google AI Blog

Gemini 3.5: frontier intelligence with action

Google released Gemini 3.5, a new series of AI models, at Google I/O. The release was announced via the Google AI Blog with limited additional details provided.

Model Releases Google AI Blog

Gemini 3.5 Flash on AI Gateway

Google's Gemini 3.5 Flash model is now available on Vercel AI Gateway, accessible via the identifier `google/gemini-3.5-flash` in the AI SDK. The model defaults to a medium thinking level and includes improvements to coding, reasoning, and multi-turn coherence compared to prior Flash versions.

Model Releases Vercel Blog

Google wants to make the web agent-ready

Google announced WebMCP at its I/O conference, an open standard enabling AI agents to interact with website functions directly via Chrome, with an origin trial launching in Chrome 149. Partners including Booking.com, Shopify, and Expedia have signed on, and Google also released a 1.0 version of a...

MCP & Integrations The New Stack

Google now lets developers use GPT and Claude in Android Studio

Google announced at its I/O conference that Android Studio now supports OpenAI's GPT and Anthropic's Claude as AI model options alongside its own Gemini, with Gemma 4 available for local use. Google also released Android CLI 1.0, a tool designed for AI agents to access Android development capabil...

Industry & Funding The New Stack

The 13 biggest announcements at Google I/O 2026

Google announced the Gemini 3.5 model family at its I/O 2026 conference, with Gemini 3.5 Flash available immediately as the new default model for the Gemini app and Search's AI Mode, and Gemini 3.5 Pro coming next month. The event also included new features for Search and Gmail and updates on Pro...

Model Releases The Verge - AI

llm-gemini 0.32

Simon Willison released llm-gemini 0.32, adding support for Google's Gemini 3.5 Flash model via the new `gemini-3.5-flash` identifier in his LLM plugin.

Open Source Tools Simon Willison

llm-gemini 0.32a0

Simon Willison released llm-gemini 0.32a0, a plugin compatible with llm 0.32a0 alpha that adds support for streaming reasoning tokens from Gemini models.

Open Source Tools Simon Willison

Pulumi bets infrastructure’s next decade belongs to AI agents

Pulumi released a set of features for AI agent workflows, including ephemeral 72-hour cloud accounts, a new `pulumi do` CLI command for single-resource provisioning, and an npm package for one-shot CLI invocations. The company said AI agents now account for 20% of operations on its platform, up f...

Industry & Funding The New Stack

AI’s impact on software engineers in 2026: key trends, Part 2

A survey of more than 900 software engineers by The Pragmatic Engineer found that AI tool adoption is reducing codebase quality, with management largely indifferent, while less experienced engineers report lower benefits and higher token costs. The survey also found code ownership is eroding and ...

Opinion & Analysis Pragmatic Engineer

Google wants to compete with Anthropic’s Mythos

Google announced broader external access to its CodeMender AI code-security tool at I/O, positioning it as a competitor to Anthropic's Claude Mythos Preview. CodeMender, first debuted in October, identifies and fixes code vulnerabilities; Google is now opening its API to select groups of external...

Industry & Funding The Verge - AI

datasette-llm-accountant 0.1a4

Simon Willison released datasette-llm-accountant 0.1a4, a Datasette plugin for tracking LLM usage, with a bug fix for tracking chains of responses.

Open Source Tools Simon Willison

datasette-llm 0.1a8

datasette-llm version 0.1a8 was released with a single bug fix addressing an issue where the `llm_prompt_context()` hook did not fully collect chains of responses.

Open Source Tools Simon Willison

Give Your AI Assistant a DolphinDB Brain — Install Agent Skills in 30 Seconds

A Python package called "dolphindb-agent-skills" was released, providing an offline knowledge base of DolphinDB syntax, APIs, and documentation for AI coding assistants such as Claude Code, Cursor, and GitHub Copilot. The open-source tool installs via pip and runs locally without sending data to ...

Open Source Tools Dev.to - AI