// edition · 2026-04-16

April 16, 2026

36 stories on AI dev tools, agents, and the coding stack — curated from the day's RSS haul by Agentic Dev's pipeline.

Top Signal · Agent Engineering

OpenAI’s Agents SDK separates the harness from the compute

OpenAI released a major update to its Agents SDK featuring sandboxed execution environments that separate agent control from compute resources, allowing developers to use their own infrastructure or integrate with services like Modal, E2B, and Vercel for improved security and scalability.

The New Stack

Tool Updates

Claude Managed Agents: What Actually Changed for Builders (April 2026)

Anthropic released Claude Managed Agents on April 8, 2026, shifting agent orchestration from client-side to server-side. The API now handles multi-turn conversations, tool dispatch, session persistence, and context management automatically, reducing developer implementation overhead.

Agent Engineering Dev.to - Claude

Anthropic Silently Dropped Prompt Cache TTL from 1 Hour to 5 Minutes

Anthropic reduced the default prompt cache time-to-live from 1 hour to 5 minutes on March 6, 2026, without public announcement, causing developers using Claude's prompt caching feature to experience reduced cache hit rates and higher token costs unless they send identical requests within the shor...

Agent Engineering Dev.to - Claude

10 Claude Code commands that actually changed how I ship

Claude Code includes a slash command system that lets developers save reusable prompts as custom commands stored in project or user directories. The author documented 10 commands designed to automate repetitive coding tasks like code reviews, component scaffolding, and commit messages.

Workflows & Tips Dev.to - Claude

Build a personal organization command center with GitHub Copilot CLI

GitHub staff engineer Brittany Ellich built a personal organization command center application using GitHub Copilot CLI to consolidate scattered work across multiple apps into a single interface, completing the initial version in one day through AI-assisted development with planning and implement...

Workflows & Tips GitHub Blog

Reading your AI coding logs: cache hits, retry loops, and other signals

A developer analyzed session logs from AI coding tools stored locally on disk and found a 98.3% cache hit rate across 13,634 calls, with Opus 4.6 accounting for $1,219 of a $1,274 weekly cost. The analysis revealed patterns including retry loops affecting 12% of coding tasks and potential overspe...

Workflows & Tips Dev.to - AI

When AI writes 100K lines of code, QA becomes the whole job

As AI tools generate code rapidly, software development bottlenecks have shifted from writing code to validating it, according to Artur Balabanskyy, who runs an AI-first development agency. Development teams must now focus on quality assurance and testing rather than code production.

Agent Engineering The New Stack

The next evolution of the Agents SDK

OpenAI released an updated Agents SDK with native sandbox execution and a model-native harness, enabling developers to build secure, long-running agents that can work across files and tools.

Agent Engineering OpenAI Blog

Karpathy's LLM wiki pattern is missing a data layer. Here's how to add one.

An article proposes adding a database layer to Andrej Karpathy's LLM-based wiki pattern to handle operational data alongside evolving conceptual knowledge, arguing that metrics and pipeline numbers require different data structures than markdown-based concept refinement.

Agent Engineering Dev.to - AI

"How AI Agents Can Monetize Technical Expertise: A Practical 2026 Guide for Task

AI agents can generate revenue by handling specialized technical work within professional workflows using models including SaaS subscriptions, monthly retainers ($2K-$10K), marketplace projects ($500-$5K), and white-label resale agreements. Success requires measurable results, domain specializati...

Workflows & Tips Dev.to - AI

datasette.io news preview

Simon Willison built a preview tool for the datasette.io website's news section, which is maintained in a YAML file, using Claude AI to generate a UI that validates syntax and shows rendered output.

Workflows & Tips Simon Willison

"AI Agents in Survival Economies: Technical Deep Dive for Decision Makers"

AI agents operating offline on lightweight language models can serve informal economy workers in developing regions by automating micro-decisions on pricing and inventory with minimal connectivity. Technical approaches emphasize on-device processing, battery efficiency, and reward-based learning ...

Agent Engineering Dev.to - AI

Ecosystem

Anthropic Silently Dropped Prompt Cache TTL from 1 Hour to 5 Minutes

Anthropic reduced the default prompt cache time-to-live from 1 hour to 5 minutes on March 6, 2026, without public announcement, causing API users to experience higher cache miss rates and increased token costs unless they explicitly configure longer TTLs.

Industry & Funding Dev.to - Claude

I built a live AI token meter for Claude and Cursor

A developer released TokenBar, a macOS menu bar application that monitors token usage for Claude, Cursor, and other AI tools in real-time. The app costs $5 as a one-time purchase and aims to help users track spending before receiving bills.

Open Source Tools Dev.to - Claude

Claude Code and the rise of personal software

Claude Code, Anthropic's AI coding tool launched in May 2025, reached $2.5 billion in annualized revenue by February 2026, enabling non-technical employees to build custom software. A Retool survey found 35% of companies have replaced at least one SaaS tool with self-built software, with 78% plan...

Industry & Funding The New Stack

Vibe Coding Is Making Us Worse Developers

A developer describes how using AI tools to generate code without understanding it—termed "vibe coding"—has degraded their problem-solving skills, syntax recall, and debugging ability, illustrated by struggling in a technical interview without AI assistance.

Opinion & Analysis Dev.to - AI

Gemini 3.1 Flash TTS

Google released Gemini 3.1 Flash TTS, a text-to-speech model available via the Gemini API that generates audio from text prompts and supports detailed voice direction including accents, tone, and delivery style.

Model Releases Simon Willison

Gemini 3.1 Flash TTS

Google released Gemini 3.1 Flash, a text-to-speech model. Simon Willison published notes and a tool interface for the new model.

Model Releases Simon Willison

Seedance 2.0 Video Generation on AI Gateway

ByteDance's Seedance 2.0 video generation model is now available via Vercel's AI Gateway in Standard and Fast variants, supporting text-to-video, image-to-video, and multimodal reference-to-video generation with synchronized audio and video editing capabilities.

Model Releases Vercel Blog

datasette 1.0a27

Datasette released version 1.0a27, replacing Django-style CSRF tokens with modern browser headers and adding a RenameTableEvent for plugin compatibility. The alpha also includes new actor parameters for client methods, temporary disk database options, and improvements to the upsert API.

Open Source Tools Simon Willison

AI text is not AI

A researcher tested four AI models on identical prompts with and without custom rules, finding that detection rates varied significantly—for example, Gemini content detected as 100% AI-generated without rules but only 14% with rules—suggesting AI detectors identify patterns rather than genuinely ...

Opinion & Analysis Dev.to - Claude

Quoting Kyle Kingsbury

Kyle Kingsbury predicted that organizations will employ people as accountable supervisors for AI systems, citing examples including Meta's human moderation reviewers, lawyers liable for court submissions containing LLM errors, and Data Protection Officers.

Opinion & Analysis Simon Willison

Don't miss tomorrow's edition

Free daily email. Every story you just read, in your inbox at 6 AM ET.