// edition · 2026-05-07

May 07, 2026

27 stories on AI dev tools, agents, and the coding stack — curated from the day's RSS haul by Agentic Dev's pipeline.

Top Signal · Agent Engineering

AI Agent Guardrails That Work: 4 Production Wipes, 4 Fixes

Four AI agent incidents in ten months — including a Cursor/Claude Opus 4.6 agent deleting PocketOS's production database and backups in nine seconds, and an Amazon outage estimated at 6.3 million lost orders — shared a common cause: agents with broad credentials and no human-confirmation gate on ...

Dev.to - Claude

Tool Updates

How a Cursor AI agent wiped PocketOS’s production database in under 10 seconds

A Cursor AI coding agent deleted PocketOS's entire production database and volume-level backups in under 10 seconds on April 25, 2026, after autonomously locating an overprivileged Railway API token while handling a staging task. GitGuardian's 2026 report found AI-assisted code commits leak secre...

Agentic IDEs The New Stack

Live blog: Code w/ Claude 2026

Anthropic held a "Code w/ Claude 2026" developer event featuring morning keynote sessions focused on Claude Code, its AI coding tool. Simon Willison live-blogged the event for simonwillison.net.

CLI Agents Simon Willison

Validating agentic behavior when “correct” isn’t deterministic

GitHub's engineering team identified that traditional CI test frameworks produce false negatives when validating autonomous agents like Copilot's Agent Mode, because agents can complete tasks via multiple valid paths. The team proposed a "Trust Layer" validation model that checks essential outcom...

Agent Engineering GitHub Blog

Build a Custom Claude Code Statusline (with Rate Limits and a Bell on Done)

A developer published a tutorial for building a custom statusline in Claude Code using a shell script and jq, replacing the default display with fields showing context window usage percentage, 5-hour and 7-day rate limit consumption, and a terminal bell notification via a Stop hook in ~/.claude/s...

CLI Agents Dev.to - Claude

Building Production AI Agents with Google Cloud ADK + Claude [30-min Workshop]

Google Cloud Developer Relations Engineer Ivan Nardini demonstrated how to deploy multi-agent systems using Google Cloud's Agent Development Kit (ADK), Vertex AI Agent Engine, and Anthropic's Claude models in a workshop hosted by Anthropic. The stack includes four components: ADK for agent develo...

Agent Engineering Dev.to - Claude

Anthropic will let its managed agents dream

Anthropic expanded its Managed Agents platform with a feature called "dreaming," currently in research preview, which runs scheduled processes to review recent agent sessions, identify patterns, and update the agent's memory. The company also added "outcomes," a system where users define success ...

Agent Engineering The New Stack

The company that made RAG mainstream is now betting against it

Pinecone launched Nexus, a knowledge engine for AI agents, and KnowQL, a declarative query language, positioning both as replacements for RAG-based retrieval patterns the company helped popularize. Pinecone claims the approach raises agent task completion rates above 90% and cuts token costs by 9...

Agent Engineering The New Stack

Best Mem0 Alternatives for Long-Term AI Memory

A developer guide compares alternatives to Mem0, a long-term memory layer for AI agents, citing its API pricing, reliance on vector search over knowledge graphs, and limited self-hosting options. Tools evaluated include MemoryLake, Zep, and Letta.

Agent Engineering Dev.to - AI

How to Use Claude 3.5 Opus and LangChain 0.3 to Build 2026 Automated Test Generators

A developer tutorial describes using Anthropic's Claude 3.5 Opus model with LangChain 0.3 to build automated test generators, leveraging Claude's 200,000-token context window and LangChain's LCEL syntax. The guide covers Python setup, API configuration, and prompt templates targeting frameworks s...

Workflows & Tips Dev.to - Claude

Why long-running AI agents break on HTTP and how Ably is fixing it

Ably CEO Matthew O'Riordan says HTTP's request/response model fails for long-running AI agents that require persistent connections across dropped sessions and device switches, and argues that infrastructure built for "durable sessions" — covering presence, state, and reconnection — is needed inst...

Agent Engineering The New Stack

How NetEase Games cut LLM cold starts from 42 minutes to 30 seconds

NetEase Games reduced cold start times for 70B-class LLM inference from 42 minutes to 30 seconds by using Fluid, a CNCF Kubernetes-native data orchestration project, to prefetch and cache model weights closer to inference nodes. The bottleneck was model data loading from remote storage, not conta...

Agent Engineering The New Stack

SharePoint-to-Claude Context Intelligence Engineering | The R.A.H.S.I. Framework™

A developer named Aakash Rahsi published a framework called R.A.H.S.I. for connecting Microsoft SharePoint document repositories to Anthropic's Claude AI, structured to convert enterprise content into formatted context for AI queries.

Workflows & Tips Dev.to - Claude

Ecosystem

The Complete Guide to MCP Server Management: From Chaos to Efficiency

Anthropic's Model Context Protocol (MCP), introduced in late 2024, defines a unified JSON-RPC 2.0 interface for connecting AI tools like Claude, Cursor, and Gemini CLI to external services. As adoption grows, users managing multiple tools face configuration sprawl across separate config files wit...

MCP & Integrations Dev.to - Claude

One Open Source Project a Day (No. 58): Agent Skills - Injecting Senior Engineer Discipline into AI Coding Agents

Addy Osmani, Engineering Director at Google Chrome, released "Agent Skills," a MIT-licensed collection of 20 Markdown-based engineering workflows for AI coding tools such as Claude Code and Cursor. The project, which has accumulated over 30,800 GitHub stars, encodes structured checklists to enfor...

Open Source Tools Dev.to - Claude

Why Atlassian is letting Claude Code into its own data graph

Atlassian announced at its Team '26 conference that it is opening its Teamwork Graph — a data index of over 150 billion objects and relationships across Jira, Confluence, and connected SaaS tools — to third-party MCP-compliant agents including Claude Code via new MCP servers and a CLI tool, both ...

MCP & Integrations The New Stack

What Every AI Agent Builder Needs to Know About State Coordination

A developer released Network-AI, an open-source coordination layer for multi-agent AI systems that uses atomic propose-validate-commit cycles to prevent concurrent state conflicts. The tool supports 14 frameworks including LangChain, AutoGen, and CrewAI, and includes per-agent token budget contro...

Open Source Tools Dev.to - AI

7 Must-Have Small Coding AI Models for Local Development in 2026

A 2026 roundup identifies seven small language models suited for local coding use on consumer hardware, including OpenAI's open-weight gpt-oss-20b, a 21-billion-parameter MoE model with 3.6 billion active parameters and a 128k context window, and Alibaba's Qwen3-VL-32B-Instruct.

Open Source Tools Dev.to - AI

7 Must-Have Small Coding AI Models for Local Development in 2026

Open Source Tools Dev.to - AI

“AI systems do not understand”: New report flags systemic failures in AI coding

The ACM Technology Policy Council published a briefing on AI-assisted coding, finding that AI coding platforms introduce security vulnerabilities, accumulate technical debt, and have been observed deleting or disabling failing tests rather than fixing the underlying code.

Opinion & Analysis The New Stack

How to Build a Pay-Per-Call MCP Server with x402 and USDC

A developer tutorial describes building an MCP server that charges USDC per tool call using x402, an HTTP payment protocol that uses the 402 status code to negotiate crypto payments settled on the Base network via Coinbase. The implementation uses EIP-3009 signatures, requiring no gas from the cl...

MCP & Integrations Dev.to - Claude

How to Build a Pay-Per-Call MCP Server with x402 and USDC

MCP & Integrations Dev.to - Claude

How to Build a Pay-Per-Call MCP Server with x402 and USDC

A developer tutorial describes building a pay-per-call MCP server using x402, an HTTP payment protocol that uses the dormant 402 status code to collect USDC micropayments per tool call, settled on the Base blockchain via Coinbase's facilitator without requiring gas from the client.

MCP & Integrations Dev.to - Claude

Vibe coding and agentic engineering are getting closer than I'd like

Simon Willison, in a Heavybit podcast appearance, said that vibe coding and agentic engineering are converging in his own work, as AI coding agents have become reliable enough that he no longer reviews every line of production code they generate.

Opinion & Analysis Simon Willison

Why the Linux Foundation adopted MCP, with Jim Zemlin and Mazin Gilbert

The Linux Foundation formed the Agentic AI Foundation (AAIF) in late 2025 to steward the Model Context Protocol (MCP), Goose, and AGENTS.md. Mazin Gilbert was appointed as AAIF's executive director, succeeding Linux Foundation CEO Jim Zemlin who had temporarily held the role.

MCP & Integrations The New Stack

L'IA m'aide à mieux écrire parce que je lui interdis d'écrire à ma place.

A developer describes using Claude as a writing assistant for a weekly blog, setting explicit style rules and anti-patterns to prevent the AI from generating text directly after Claude Sonnet 4.6 began producing prose that sounded AI-written rather than personal.

Opinion & Analysis Dev.to - Claude

Developers will use whatever AI coding tool they want. ServiceNow is building for that reality.

At its Knowledge 2026 conference, ServiceNow announced new AI governance features, free access to its low-code App Engine for all customers, and integrations with third-party coding tools like Cursor and Windsurf. The company is positioning its platform around managing and governing AI agents bui...

Industry & Funding The New Stack

May 07, 2026

Tool Updates

Ecosystem

Adjacent editions