// edition · 2026-05-14

May 14, 2026

27 stories on AI dev tools, agents, and the coding stack — curated from the day's RSS haul by Agentic Dev's pipeline.

Top Signal · CLI Agents

Claude Code Ultraplan: Cloud-Based AI Planning in 2026 — A Hands-On Tutorial

Anthropic's Claude Code Ultraplan, described as a research preview, separates the planning phase from code execution by offloading plan drafting to a cloud session, allowing users to review and comment on plans in a browser before execution. The feature requires Claude Code v2.1.91 or later and i...

Dev.to - Claude

Tool Updates

What Anthropic's $200 Agent SDK Credit Means If You Run claude -p in Production

Anthropic announced that starting June 15, 2026, Claude Agent SDK usage — including `claude -p` automation, Claude Code GitHub Actions, and third-party SDK-authenticated apps — will be billed against a separate monthly credit rather than subscription rate limits, with Max 20x subscribers receivin...

Pricing & Plans Dev.to - Claude

I was paying 3x too much for Claude API calls...

A developer building an AI agent found that passing data as raw JSON instead of plain prose used 2.6x more tokens, resulting in roughly 2.5x higher API costs per call. The difference stems from how BPE tokenization handles JSON structural characters like braces, quotes, and colons as separate tok...

Workflows & Tips Dev.to - Claude

Anthropic’s Claude Code agent view is a better dashboard. So why aren’t developers convinced?

Anthropic released an "agent view" dashboard for Claude Code that lets developers monitor and manage multiple AI coding sessions from a single CLI interface, showing session status and enabling inline replies. Developer reactions are mixed, with some welcoming the centralized view while others ar...

CLI Agents The New Stack

⚽️ Claude Code Isn’t the Only Game in Town

Several AI coding agents compete with Anthropic's Claude Code, including OpenAI's Codex, which offers built-in browser access and cloud environments, and openCode, an open-source alternative. Most offer free tiers, and the tools vary in form factor between CLI, TUI, and full applications.

CLI Agents Dev.to - Claude

Why agent harnesses fail inside cloud-native systems

An analysis in The New Stack argues that AI coding agent performance depends more on surrounding scaffolding — prompts, tools, and feedback loops — than model selection, citing data showing the same model moved from rank 30 to rank 5 on Terminal Bench 2.0 with a different harness. The piece conte...

Agent Engineering The New Stack

"When 'Control request timeout: initialize' actually means SIGKILL: Claude Code CLI OOM inside Celery"

A Celery worker running Claude Code CLI as a subprocess was intermittently failing with a misleading "Control request timeout: initialize" error, which turned out to be the Linux kernel OOM killer terminating the CLI process mid-startup. The fix was routing the task to a dedicated ECS Fargate que...

Agent Engineering Dev.to - Claude

Building a safe, effective sandbox to enable Codex on Windows

OpenAI built a secure sandbox for its Codex coding agent on Windows, implementing controlled file access and network restrictions to allow safe execution of automated coding tasks.

CLI Agents OpenAI Blog

How to build a skills library for your engineering team

An engineering team at Port built a centralized library of AI coding assistant "skills" — Markdown configuration files defining company standards — after discovering each engineer was running different, untracked local configurations. The library is stored in version control, allowing engineers t...

Workflows & Tips The New Stack

Right Model, Right Time: Why Model Routing Is Becoming Core to GenAI Platforms

Model routing directs AI prompts to different models based on complexity, cost, and latency, rather than using a single model for all queries. Cloud providers including Microsoft Azure AI Foundry and AWS Bedrock have released built-in routing tools trained on datasets spanning question answering,...

Agent Engineering Dev.to - AI

Claude for Small Business

Anthropic announced Claude for Small Business, a version or plan of its Claude AI assistant targeted at small business users.

Pricing & Plans Dev.to - Claude

Agentic Endpoint Remediation at Enterprise Scale | Intune Security Copilot | Rahsi Framework™ Analysis

A technical analysis describes using Microsoft Intune's Security Copilot integration to automate endpoint remediation at enterprise scale, converting endpoint signals into AI-driven, governed remediation actions. The piece applies a proprietary methodology called the Rahsi Framework™ to evaluate ...

Agent Engineering Dev.to - AI

Anthropic Launches Claude For Small Business

Anthropic launched Claude for Small Business on May 13, 2026, offering 15 agentic workflows, 15 reusable skills, and connectors to eight platforms including QuickBooks, PayPal, HubSpot, Canva, Docusign, Google Workspace, Microsoft 365, and Slack.

Pricing & Plans Dev.to - Claude

Ecosystem

Running autonomous agents without exposing credentials directly

A developer released "tsk," an open-source local MCP server written in Go that proxies API calls for LLM agents, injecting credentials at runtime without exposing them to the model. It enforces an allowlist via rules.yaml, scrubs sensitive data from responses, applies per-tool rate limits, and lo...

MCP & Integrations Dev.to - AI

Claude vs ChatGPT in 2026: Which One Should Devs Actually Use?

A developer comparison of Claude (Anthropic) and ChatGPT (OpenAI) in 2026 found Claude Opus 4.6 scores 80.8% on SWE-bench Verified versus GPT-5.4's roughly 80%, and 91.3% on GPQA Diamond reasoning benchmarks. Both services cost $20/month; Claude was rated stronger for long-context coding and regu...

Model Releases Dev.to - Claude

MCP Is a Great Start — But Multi-Agent Production Needs More

A developer released Network-AI, an open-source coordination layer for multi-agent AI systems that uses atomic propose-validate-commit cycles to prevent concurrent state overwrites. The project claims support for 14 frameworks including LangChain, AutoGen, CrewAI, and the Model Context Protocol.

Open Source Tools Dev.to - AI

I tested OpenAI’s three claims about GPT-5.5 Instant, and only one fully held up

A journalist tested GPT-5.5 Instant against GPT-5.2 after OpenAI replaced its default ChatGPT model, finding the conciseness claim did not hold up — GPT-5.2 produced shorter answers in all three test cases — while GPT-5.5 showed reduced hallucinations on factual queries.

Model Releases The New Stack

RLHF trained Claude to be verbose. Here's the proof

A developer investigated why Claude produces verbose responses by analyzing RLHF training mechanics, arguing that human annotators in the reward model training phase tend to prefer longer responses, which reinforces verbosity as a learned prior. The author built a reward model simulation using An...

Opinion & Analysis Dev.to - Claude

Notion just turned its workspace into a hub for AI agents

Notion launched a developer platform that allows teams to connect AI agents, external data sources, and custom code directly into their Notion workspace.

Industry & Funding TechCrunch - AI

Our response to the TanStack npm supply chain attack

A supply chain attack on the TanStack npm package, dubbed "Mini Shai-Hulud," compromised OpenAI signing certificates and systems. OpenAI is requiring macOS users to update its apps by June 12, 2026, as part of its remediation response.

Industry & Funding OpenAI Blog

Why enterprise AI needs customization

GitLab's 2025 Global DevSecOps Survey found developers spend about 15% of their time writing code, with the remainder on planning, reviewing, testing, and coordination. Enterprises are increasingly adopting multi-model AI strategies, routing tasks to different models based on cost, speed, and qua...

Opinion & Analysis The New Stack

Claude Finance: Anthropic Packages Wall Street Workflows Into 10 Agents

Anthropic launched Claude Finance on May 5, a bundle of 10 agent templates covering five financial services domains — investment banking, equity research, private equity, wealth management, and financial analysis. The package includes Microsoft 365 integration and ships via Claude Cowork and Clau...

Industry & Funding Dev.to - Claude

Anthropic’s Cat Wu says that, in the future, AI will anticipate your needs before you know what they are

Cat Wu, Anthropic's head of product for Claude Code and Cowork, said the next major step for AI is proactivity — systems that anticipate user needs before users are aware of them.

Opinion & Analysis TechCrunch - AI

Anthropic now has more business customers than OpenAI, according to Ramp data

Anthropic now has more business customers than OpenAI among companies tracked by fintech firm Ramp, with 34.4% of Ramp's clients paying for Anthropic services versus 32.3% for OpenAI.

Industry & Funding TechCrunch - AI

Adaption aims big with AutoScientist, an AI tool that helps models train themselves

Adaption released AutoScientist, a tool that automates the fine-tuning process for AI models, allowing them to adapt to specific capabilities without manual intervention.

Industry & Funding TechCrunch - AI

MinIO’s MemKV promises 95% better GPU utilization by ending AI recompute tax

MinIO launched MemKV, a petabyte-scale flash-based context memory store for AI inference workloads, accessed over 800 Gigabit Ethernet RDMA. The company claims it reduces GPU recompute by retaining context across GPU clusters, achieving 95% better GPU utilization and roughly 50% lower cost per to...

Industry & Funding The New Stack

Red Hat’s skill packs give AI agents something a bigger model never could: 20 years of institutional memory

Red Hat announced a dedicated AI skills repository at its Summit in Atlanta, offering "skill packs" that layer agent capabilities on top of Red Hat Enterprise Linux, OpenShift, and Ansible. The company's Ask Red Hat chatbot, now on its Customer Support Portal, was trained on over 20 years of Red ...

Industry & Funding The New Stack

May 14, 2026

Tool Updates

Ecosystem

Adjacent editions