// edition · 2026-05-01

May 01, 2026

29 stories on AI dev tools, agents, and the coding stack — curated from the day's RSS haul by Agentic Dev's pipeline.

Top Signal · MCP & Integrations

Stop Installing MCP Servers on Your Laptop — Here's a One-Click Sandbox for Claude

A developer guide warns that installing MCP servers via npx runs arbitrary npm packages with full user-level access, citing incidents including a backdoored postmark-mcp package and CVE-2025-49596, a design flaw affecting ~200,000 instances across Anthropic's official MCP SDKs. The guide proposes...

Dev.to - Claude

Tool Updates

10 CLAUDE.md Rules Every Python Developer Needs in 2026

A Dev.to post outlines 10 recommended rules for CLAUDE.md configuration files in Python repositories, covering practices such as mandatory type hints, lockfile management with `uv`, and structured error handling to guide AI coding tools like Claude Code toward project-specific conventions.

Workflows & Tips Dev.to - Claude

Codex CLI 0.128.0 adds /goal

OpenAI released Codex CLI version 0.128.0, adding a `/goal` command that causes the coding agent to loop repeatedly until it determines a user-set goal has been completed or a configured token budget is exhausted. The feature is implemented via two prompt templates injected automatically at the e...

CLI Agents Simon Willison

A nine-point checklist for shipping production-ready AI

The New Stack published a nine-step technical guide for deploying AI systems to production, covering tool interface design, vector search with BM25 reranking, timeout and retry handling, OpenTelemetry-based observability, and bounded agent execution under concurrent load.

Agent Engineering The New Stack

How I Built a Multi-LLM AI Agent System for Hospital Management

A developer built HISDashboard, a hospital management AI system using 10 specialized agents distributed across 4 LLM providers with automatic fallback, after a single-provider setup failed due to rate limiting. The system uses a router-specialist-reflection architecture with structured intent cla...

Agent Engineering Dev.to - AI

Can Claude Skills Save Us From The Smartphone?

A developer built Claude Skills scripts called "tomorrow" and "today" that aggregate Google Calendar, email, Slack, and iMessage data into prioritized daily briefings, replacing manual checks across six separate apps.

Workflows & Tips Dev.to - Claude

CLAUDE.md Is Not Enough: The Governance Stack for Agentic Development

A developer proposed a five-layer governance framework for AI coding agents, arguing that CLAUDE.md alone provides only project orientation, not policy enforcement. The framework adds CONSTITUTION.md, DIRECTIVES.md, SECURITY.md, and AGENTS.md documents alongside runtime enforcement and external v...

Agent Engineering Dev.to - Claude

How to Build a Multimodal AI Chatbot with Claude 3.5 and React 19

A Dev.to tutorial outlines steps for building a multimodal chatbot using Anthropic's Claude 3.5 Sonnet API and React 19, with Vite as the build tool and react-dropzone for image input handling. The guide covers API client setup, environment variable configuration, and component construction using...

Workflows & Tips Dev.to - Claude

Using Gemini with OpenClaw: Setup Guide + Real Use Cases

OpenClaw, an AI agent platform for developer automation, supports Google's Gemini models as an LLM backend, with setup requiring an API key from Google AI Studio. The integration enables agents to process large codebases, multimodal artifacts such as images and video, and supports a free developm...

Workflows & Tips Dev.to - AI

Building a PDF Parser for Financial Data: Lessons from Arbiter V2

Arbiter Briefs added financial PDF ingestion to its V2, using regex and heuristics rather than ML to extract metrics from P&L statements, balance sheets, and cap tables. The pipeline uses pdf-parse for text extraction, multer for uploads capped at 10MB and 5 files per analysis, Railway persistent...

Agent Engineering Dev.to - AI

Ecosystem

Grok 4.3 on AI Gateway

xAI's Grok 4.3 is now available on Vercel's AI Gateway, accessible via the AI SDK using the model identifier `xai/grok-4.3`. The model features a 1M token context window and a December 2025 knowledge cutoff.

Model Releases Vercel Blog

Meta abandons open-source Llama for proprietary Muse Spark

Meta has shifted development focus from its open-source Llama models to a new proprietary model called Muse Spark, built from scratch by Meta Superintelligence Labs. Existing Llama models will remain available, but the company has not committed to continued frontier development of the open-source...

Industry & Funding The New Stack

We Ditched Claude 3.5 for GPT-5: 20% Higher Customer Satisfaction for Our Chatbot

A company migrated its customer support chatbot from Claude 3.5 Sonnet to GPT-5 after a 4-week model evaluation, reporting chatbot CSAT rising from 72% to 92% and tier-1 query resolution improving from 70% to 88% within 30 days of full rollout. Non-English CSAT increased from 61% to 84%, and huma...

Model Releases Dev.to - Claude

The Pulse: AI load breaks GitHub – why not other vendors?

GitHub's reliability dropped to 86% this month, down from 90% last month, according to third-party tracking, including a data integrity incident. GitHub leadership attributed the degradation to a 3.5x increase in service load driven by AI usage.

Opinion & Analysis Pragmatic Engineer

Microsoft Agent 365 Launches With Claude Inside: What It Means

Microsoft launched Agent 365 on May 1, 2026, an enterprise control layer for autonomous agents across Microsoft 365 apps including Word, Excel, Outlook, and Teams. The user-facing component, Copilot Cowork, runs multi-step autonomous tasks and was built in collaboration with Anthropic using Claud...

Industry & Funding Dev.to - AI

Our evaluation of OpenAI's GPT-5.5 cyber capabilities

The UK AI Security Institute evaluated OpenAI's GPT-5.5 for cyber capabilities, finding its ability to identify security vulnerabilities comparable to Anthropic's Claude Mythos. Unlike Mythos, GPT-5.5 is currently generally available.

Model Releases Simon Willison

The OpenAI-Microsoft reset, decoded: Why AWS may come out ahead

OpenAI is bringing its models, coding tools, and agentic capabilities to Amazon Web Services' Bedrock platform following a restructuring of its partnership with Microsoft, which had invested a reported $13 billion in OpenAI since 2019 and served as its exclusive cloud provider.

Industry & Funding The New Stack

Quickbase’s Pave targets vibe coding’s notorious 80% problem

Quickbase launched Pave, a full-stack AI application builder designed to address the "80% problem" in vibe coding, where AI-assisted development tools stall before reaching production-ready completion. The tool aims to integrate continuous quality validation into the development loop rather than ...

Industry & Funding The New Stack

I can't be persuaded

An AI system processes persuasion signals — tone, conviction, repetition — as plain text, treating a senior developer's experience-backed objection with the same weight as a Stack Overflow answer. This means AI tools cannot serve as tiebreakers in deadlocked team decisions where emotional commitm...

Opinion & Analysis Dev.to - AI

Quoting Andrew Kelley

Andrew Kelley, creator of the Zig programming language, stated that LLM-assisted code contributions are detectable because AI hallucinations differ fundamentally from human mistakes, and that Zig bans such contributions to its codebase.

Opinion & Analysis Simon Willison

Don't miss tomorrow's edition

Free daily email. Every story you just read, in your inbox at 6 AM ET.