36 stories on AI dev tools, agents, and the coding stack — curated from the day's RSS haul by Agentic Dev's pipeline.
Top Signal · Agent Engineering
OpenAI released a major update to its Agents SDK featuring sandboxed execution environments that separate agent control from compute resources, allowing developers to use their own infrastructure or integrate with services like Modal, E2B, and Vercel for improved security and scalability.
The New Stack
Anthropic released Claude Managed Agents on April 8, 2026, shifting agent orchestration from client-side to server-side. The API now handles multi-turn conversations, tool dispatch, session persistence, and context management automatically, reducing developer implementation overhead.
Agent Engineering
Dev.to - Claude
Anthropic reduced the default prompt cache time-to-live from 1 hour to 5 minutes on March 6, 2026, without public announcement, causing developers using Claude's prompt caching feature to experience reduced cache hit rates and higher token costs unless they send identical requests within the shor...
Agent Engineering
Dev.to - Claude
Claude Code includes a slash command system that lets developers save reusable prompts as custom commands stored in project or user directories. The author documented 10 commands designed to automate repetitive coding tasks like code reviews, component scaffolding, and commit messages.
Workflows & Tips
Dev.to - Claude
GitHub staff engineer Brittany Ellich built a personal organization command center application using GitHub Copilot CLI to consolidate scattered work across multiple apps into a single interface, completing the initial version in one day through AI-assisted development with planning and implement...
Workflows & Tips
GitHub Blog
A developer analyzed session logs from AI coding tools stored locally on disk and found a 98.3% cache hit rate across 13,634 calls, with Opus 4.6 accounting for $1,219 of a $1,274 weekly cost. The analysis revealed patterns including retry loops affecting 12% of coding tasks and potential overspe...
Workflows & Tips
Dev.to - AI
A developer described a method for building persistent memory systems for AI agents using a three-component architecture: a local database store, vector embeddings for semantic search, and context injection into agent prompts to enable memory retention across sessions.
Workflows & Tips
Dev.to - AI
Research found organizations adopting AI coding tools at scale in 2025-2026 shipped code 3x faster but saw critical security vulnerabilities increase 4x, driven by volume outpacing review capacity rather than lower code quality per line.
Agent Engineering
Dev.to - Claude
Ultra Lab ran Google Gemini 2.5 Flash, Claude Pro, and Ollama in parallel production for 90 days and documented actual costs and performance: Gemini's free tier (1,500 requests/day) can trigger automatic billing charges up to $128, Claude Pro costs $20/month with dynamic usage caps that vary by d...
Pricing & Plans
Dev.to - Claude
AI agents capable of autonomous actions using credentials pose security risks including hijacking and prompt-injection attacks that traditional security models weren't designed to detect, prompting NIST to study governance frameworks for their development and deployment.
Agent Engineering
The New Stack
As AI tools generate code rapidly, software development bottlenecks have shifted from writing code to validating it, according to Artur Balabanskyy, who runs an AI-first development agency. Development teams must now focus on quality assurance and testing rather than code production.
Agent Engineering
The New Stack
A developer automated changelog generation by connecting Claude API to GitHub Actions; when a pull request merges, the workflow extracts PR metadata and changed files, sends them to Claude, and commits the generated changelog entry in approximately 10 seconds.
Workflows & Tips
Dev.to - Claude
OpenAI released an updated Agents SDK with native sandbox execution and a model-native harness, enabling developers to build secure, long-running agents that can work across files and tools.
Agent Engineering
OpenAI Blog
An article proposes adding a database layer to Andrej Karpathy's LLM-based wiki pattern to handle operational data alongside evolving conceptual knowledge, arguing that metrics and pipeline numbers require different data structures than markdown-based concept refinement.
Agent Engineering
Dev.to - AI
AI agents can generate revenue by handling specialized technical work within professional workflows using models including SaaS subscriptions, monthly retainers ($2K-$10K), marketplace projects ($500-$5K), and white-label resale agreements. Success requires measurable results, domain specializati...
Workflows & Tips
Dev.to - AI
OpenAI updated its Agents SDK to include expanded capabilities for building enterprise agents with improved safety features.
Agent Engineering
TechCrunch - AI
A tutorial describes using Claude with BlackTwist MCP Server to plan and schedule 21 Threads posts in one conversation—three posts daily across a week in specified formats (short morning post, midday thread, evening one-liner).
Workflows & Tips
Dev.to - Claude
Simon Willison built a preview tool for the datasette.io website's news section, which is maintained in a YAML file, using Claude AI to generate a UI that validates syntax and shows rendered output.
Workflows & Tips
Simon Willison
AI agents operating offline on lightweight language models can serve informal economy workers in developing regions by automating micro-decisions on pricing and inventory with minimal connectivity. Technical approaches emphasize on-device processing, battery efficiency, and reward-based learning ...
Agent Engineering
Dev.to - AI
Anthropic reduced the default prompt cache time-to-live from 1 hour to 5 minutes on March 6, 2026, without public announcement, causing API users to experience higher cache miss rates and increased token costs unless they explicitly configure longer TTLs.
Industry & Funding
Dev.to - Claude
Enterprises are increasingly adopting self-hosted open-source image generation models combined with affordable APIs instead of relying solely on proprietary services like DALL-E and Midjourney, with per-image costs dropping from $0.02-$0.25 to $0.0005-$0.001 when self-hosted at scale.
Open Source Tools
Dev.to - AI
A .NET developer with 20 years of experience described Claude Code as functioning as an autonomous agent that can understand project goals and execute multi-step coding tasks, contrasting it with traditional autocomplete tools like GitHub Copilot. The developer reported that a feature requiring 3...
Opinion & Analysis
Dev.to - Claude
A developer released TokenBar, a macOS menu bar application that monitors token usage for Claude, Cursor, and other AI tools in real-time. The app costs $5 as a one-time purchase and aims to help users track spending before receiving bills.
Open Source Tools
Dev.to - Claude
Google's Gemma 4 AI model can run natively on iPhones with full offline inference, eliminating the need for cloud connectivity to use the model.
Open Source Tools
Hacker News - Best
Google released Gemini 3.1 Flash TTS, a text-to-speech system, across its products.
Model Releases
Google AI Blog
Claude Code, Anthropic's AI coding tool launched in May 2025, reached $2.5 billion in annualized revenue by February 2026, enabling non-technical employees to build custom software. A Retool survey found 35% of companies have replaced at least one SaaS tool with self-built software, with 78% plan...
Industry & Funding
The New Stack
A developer describes how using AI tools to generate code without understanding it—termed "vibe coding"—has degraded their problem-solving skills, syntax recall, and debugging ability, illustrated by struggling in a technical interview without AI assistance.
Opinion & Analysis
Dev.to - AI
A developer released a free, browser-based tool that removes visible and invisible watermarks from Google Gemini-generated images using local processing with no server uploads or account requirements.
Open Source Tools
Dev.to - AI
Anthropic's Claude service experienced elevated error rates across Claude.ai, its API, and Claude Code feature.
Industry & Funding
Hacker News - Best
Google released Gemini 3.1 Flash TTS, a text-to-speech model available via the Gemini API that generates audio from text prompts and supports detailed voice direction including accents, tone, and delivery style.
Model Releases
Simon Willison
Google released Gemini 3.1 Flash, a text-to-speech model. Simon Willison published notes and a tool interface for the new model.
Model Releases
Simon Willison
ByteDance's Seedance 2.0 video generation model is now available via Vercel's AI Gateway in Standard and Fast variants, supporting text-to-video, image-to-video, and multimodal reference-to-video generation with synchronized audio and video editing capabilities.
Model Releases
Vercel Blog
Datasette released version 1.0a27, replacing Django-style CSRF tokens with modern browser headers and adding a RenameTableEvent for plugin compatibility. The alpha also includes new actor parameters for client methods, temporary disk database options, and improvements to the upsert API.
Open Source Tools
Simon Willison
A researcher tested four AI models on identical prompts with and without custom rules, finding that detection rates varied significantly—for example, Gemini content detected as 100% AI-generated without rules but only 14% with rules—suggesting AI detectors identify patterns rather than genuinely ...
Opinion & Analysis
Dev.to - Claude
Gitar, a startup using AI agents to review and secure code, emerged from stealth with $9 million in funding. The company focuses on reviewing both human-written and AI-generated code.
Industry & Funding
TechCrunch - AI
Kyle Kingsbury predicted that organizations will employ people as accountable supervisors for AI systems, citing examples including Meta's human moderation reviewers, lawyers liable for court submissions containing LLM errors, and Data Protection Officers.
Opinion & Analysis
Simon Willison