// edition · 2026-05-14

May 14, 2026

27 stories on AI dev tools, agents, and the coding stack — curated from the day's RSS haul by Agentic Dev's pipeline.

Top Signal · CLI Agents

Claude Code Ultraplan: Cloud-Based AI Planning in 2026 — A Hands-On Tutorial

Anthropic's Claude Code Ultraplan, described as a research preview, separates the planning phase from code execution by offloading plan drafting to a cloud session, allowing users to review and comment on plans in a browser before execution. The feature requires Claude Code v2.1.91 or later and i...

Dev.to - Claude

Tool Updates

What Anthropic's $200 Agent SDK Credit Means If You Run claude -p in Production

Anthropic announced that starting June 15, 2026, Claude Agent SDK usage — including `claude -p` automation, Claude Code GitHub Actions, and third-party SDK-authenticated apps — will be billed against a separate monthly credit rather than subscription rate limits, with Max 20x subscribers receivin...

Pricing & Plans Dev.to - Claude

I was paying 3x too much for Claude API calls...

A developer building an AI agent found that passing data as raw JSON instead of plain prose used 2.6x more tokens, resulting in roughly 2.5x higher API costs per call. The difference stems from how BPE tokenization handles JSON structural characters like braces, quotes, and colons as separate tok...

Workflows & Tips Dev.to - Claude

⚽️ Claude Code Isn’t the Only Game in Town

Several AI coding agents compete with Anthropic's Claude Code, including OpenAI's Codex, which offers built-in browser access and cloud environments, and openCode, an open-source alternative. Most offer free tiers, and the tools vary in form factor between CLI, TUI, and full applications.

CLI Agents Dev.to - Claude

Why agent harnesses fail inside cloud-native systems

An analysis in The New Stack argues that AI coding agent performance depends more on surrounding scaffolding — prompts, tools, and feedback loops — than model selection, citing data showing the same model moved from rank 30 to rank 5 on Terminal Bench 2.0 with a different harness. The piece conte...

Agent Engineering The New Stack

How to build a skills library for your engineering team

An engineering team at Port built a centralized library of AI coding assistant "skills" — Markdown configuration files defining company standards — after discovering each engineer was running different, untracked local configurations. The library is stored in version control, allowing engineers t...

Workflows & Tips The New Stack

Right Model, Right Time: Why Model Routing Is Becoming Core to GenAI Platforms

Model routing directs AI prompts to different models based on complexity, cost, and latency, rather than using a single model for all queries. Cloud providers including Microsoft Azure AI Foundry and AWS Bedrock have released built-in routing tools trained on datasets spanning question answering,...

Agent Engineering Dev.to - AI

Claude for Small Business

Anthropic announced Claude for Small Business, a version or plan of its Claude AI assistant targeted at small business users.

Pricing & Plans Dev.to - Claude

Anthropic Launches Claude For Small Business

Anthropic launched Claude for Small Business on May 13, 2026, offering 15 agentic workflows, 15 reusable skills, and connectors to eight platforms including QuickBooks, PayPal, HubSpot, Canva, Docusign, Google Workspace, Microsoft 365, and Slack.

Pricing & Plans Dev.to - Claude

Ecosystem

Running autonomous agents without exposing credentials directly

A developer released "tsk," an open-source local MCP server written in Go that proxies API calls for LLM agents, injecting credentials at runtime without exposing them to the model. It enforces an allowlist via rules.yaml, scrubs sensitive data from responses, applies per-tool rate limits, and lo...

MCP & Integrations Dev.to - AI

Claude vs ChatGPT in 2026: Which One Should Devs Actually Use?

A developer comparison of Claude (Anthropic) and ChatGPT (OpenAI) in 2026 found Claude Opus 4.6 scores 80.8% on SWE-bench Verified versus GPT-5.4's roughly 80%, and 91.3% on GPQA Diamond reasoning benchmarks. Both services cost $20/month; Claude was rated stronger for long-context coding and regu...

Model Releases Dev.to - Claude

MCP Is a Great Start — But Multi-Agent Production Needs More

A developer released Network-AI, an open-source coordination layer for multi-agent AI systems that uses atomic propose-validate-commit cycles to prevent concurrent state overwrites. The project claims support for 14 frameworks including LangChain, AutoGen, CrewAI, and the Model Context Protocol.

Open Source Tools Dev.to - AI

RLHF trained Claude to be verbose. Here's the proof

A developer investigated why Claude produces verbose responses by analyzing RLHF training mechanics, arguing that human annotators in the reward model training phase tend to prefer longer responses, which reinforces verbosity as a learned prior. The author built a reward model simulation using An...

Opinion & Analysis Dev.to - Claude

Our response to the TanStack npm supply chain attack

A supply chain attack on the TanStack npm package, dubbed "Mini Shai-Hulud," compromised OpenAI signing certificates and systems. OpenAI is requiring macOS users to update its apps by June 12, 2026, as part of its remediation response.

Industry & Funding OpenAI Blog

Why enterprise AI needs customization

GitLab's 2025 Global DevSecOps Survey found developers spend about 15% of their time writing code, with the remainder on planning, reviewing, testing, and coordination. Enterprises are increasingly adopting multi-model AI strategies, routing tasks to different models based on cost, speed, and qua...

Opinion & Analysis The New Stack

Claude Finance: Anthropic Packages Wall Street Workflows Into 10 Agents

Anthropic launched Claude Finance on May 5, a bundle of 10 agent templates covering five financial services domains — investment banking, equity research, private equity, wealth management, and financial analysis. The package includes Microsoft 365 integration and ships via Claude Cowork and Clau...

Industry & Funding Dev.to - Claude

MinIO’s MemKV promises 95% better GPU utilization by ending AI recompute tax

MinIO launched MemKV, a petabyte-scale flash-based context memory store for AI inference workloads, accessed over 800 Gigabit Ethernet RDMA. The company claims it reduces GPU recompute by retaining context across GPU clusters, achieving 95% better GPU utilization and roughly 50% lower cost per to...

Industry & Funding The New Stack