25 stories on AI dev tools, agents, and the coding stack, curated from the day's RSS haul by Agentic Dev's pipeline.
Top Signal · Pricing & Plans
Cursor reduced its Teams plan annual price by 20% to $32 per user per month and introduced a $120/month Premium tier, while adding enterprise spend alerts and an "organizations" governance dashboard for managing budgets and model access across multiple deployments.
The New Stack
A software manager built Kiri, an open-source on-premises proxy that intercepts requests to cloud AI coding tools and replaces sensitive code with placeholders before forwarding them, allowing use of tools like Claude Code without exposing proprietary source code externally.
CLI Agents
Dev.to - Claude
Claude Code is Anthropic's command-line tool that generates HTML, CSS, JavaScript, and backend code from plain-language instructions. The tool can scaffold full websites, integrate with frameworks like React and Next.js, and guide users through deployment to platforms such as Vercel or Netlify.
CLI Agents
Dev.to - Claude
Six independent developers building memory tools for Claude Code collaborated on GitHub issue #47023 to draft a four-hook lifecycle spec (PreCompact, PostCompact, SessionEnd, SessionStart) for external memory layers. The proposal, initiated April 12, addresses the absence of lifecycle events arou...
Agent Engineering
Dev.to - Claude
Carnegie Mellon computer science professor Andy Pavlo told attendees at Percona Live 2026 that databases represent the most difficult challenge for autonomous AI agents, because hallucinated queries or configuration changes in production systems can cause complete data loss, unlike errors in othe...
Agent Engineering
The New Stack
Haystack is an open-source NLP framework developed for building production-grade RAG pipelines and agent workflows, supporting document stores, retrievers, evaluation tools, and Docker deployment.
Workflows & Tips
Dev.to - AI
A developer open-sourced "Claude with Leetcode," a MIT-licensed tool that automatically commits accepted LeetCode solutions to GitHub and uses Claude to generate daily data structure and algorithm analysis of each submission.
Workflows & Tips
Dev.to - Claude
Claude's inline visualization feature has been broken since mid-March 2026 due to the external domain `claudemcpcontent.com` failing to resolve on any DNS server. A workaround exists: requesting output as a PNG file via Python's Pillow library bypasses the broken MCP dependency entirely.
Workflows & Tips
Dev.to - Claude
A missing newline in a shell script caused two `export` lines to merge, preventing `ANTHROPIC_AUTH_TOKEN` from being set when routing Claude CLI through DeepSeek's API. The error "There's an issue with the selected model (deepseek-v4-pro)" was resolved by separating the two environment variable d...
Workflows & Tips
Dev.to - Claude
AI agent workloads on Kubernetes expand the attack surface through unpredictable egress, GPU resource sharing, and dynamic tool invocation, requiring security beyond standard cluster hardening. Azure Kubernetes Service is moving toward network-isolated clusters that restrict outbound internet acc...
Agent Engineering
The New Stack
Transferring ChatGPT conversation history to other AI platforms risks losing context, message order, and formatting due to token limits and export limitations. Recommended methods include exporting in JSON or Markdown format, preserving message sequence, and splitting long conversations into smal...
Workflows & Tips
Dev.to - Claude
LuisCore published a JSON-LD ontology at luiscore.com/ontology providing a shared vocabulary and glossary of terms for use by AI agents and LLMs across different frameworks. The ontology is designed to be machine-readable and citable, alongside APIs for agent registration, cluster telemetry, and ...
Agent Engineering
Dev.to - AI
A GitHub project called "headroom" preprocesses tool outputs, logs, and RAG chunks before they reach an LLM, claiming 60-95% fewer input tokens. Independent testing on 117 PR reviews using Claude found 58.4% input token reduction with an F1 score drop from 0.71 to 0.69.
Open Source Tools
Dev.to - Claude
Anthropic quietly optimized the response speed of Claude Opus, reducing latency to make the model more viable for agentic workflows and multi-step autonomous tasks. The update addresses prior delays that made real-time deployment impractical.
Model Releases
Dev.to - Claude
Google released Gemma 4 12B, a multimodal language model that runs on 16GB of VRAM and benchmarks close to its 26B parameter counterpart. The model supports native audio input without separate encoders and is the first mid-sized Gemma model to do so.
Model Releases
The New Stack
Anthropic released an open-source framework called "defending-code-reference-harness" on GitHub for AI-assisted vulnerability discovery in code. The repository provides a reference harness for using AI models to identify security vulnerabilities.
Open Source Tools
Hacker News - Best
A benchmark of 10 LLMs for code generation tasks found DeepSeek-R1 scored highest at 9.4 with a cost of $2.50 per million output tokens, while DeepSeek V4 Flash offered the best value at $0.25 with a score of 8.7, across a price range of $0.20 to $3.00 per million tokens.
Model Releases
Dev.to - AI
A developer published an open-source set of eight AI agent skill files, installable via npx, designed to give AI coding assistants domain-specific knowledge for Chrome extension development, including Manifest V3 rules and Web Store submission requirements.
Open Source Tools
Dev.to - Claude
OpenAI's API supports text, images, audio, and video natively, while Anthropic's Claude API handles text and image inputs but not generation of other media. Both APIs offer roughly 1 million token context windows, though OpenAI charges a 2x input premium above 272K tokens versus Claude's flat pri...
Model Releases
Dev.to - Claude
Anthropic open-sourced a code security tool called "defending-code-reference-harness," a Claude-powered pipeline that scans repositories for vulnerabilities and suggests patches. The project appeared on GitHub Trending and supports integration into CI pipelines via a `/vuln-scan` command.
Open Source Tools
Dev.to - Claude
Nvidia's Nemotron 3 Ultra, a 550B-parameter Mixture-of-Experts reasoning model with a 1M token context window, is now available on Vercel's AI Gateway. The model delivers up to 350 tokens per second and up to 30% lower cost on agentic tasks compared to other providers.
Model Releases
Vercel Blog
Charity Majors argues that AI enthusiasts and skeptics in software teams both face legitimate existential risks — enthusiasts from competitive disadvantage if they don't adopt AI, skeptics from eroding code quality and institutional knowledge when shipping faster than engineers can review. She id...
Opinion & Analysis
Simon Willison
Mate Security CEO Asaf Wiener restructured the company's AI inference cost management after discovering spending that threatened its runway, breaking a single AI cost line into roughly ten tracked sub-lines. The $15.5 million seed-funded SOC startup now requires backend engineers to estimate toke...
Opinion & Analysis
The New Stack
Snowflake announced new capabilities for CoCo, its AI coding agent, including a desktop app, mobile app, and Slack integration with autonomous task execution. The company also launched Snowflake Datastream, a fully managed Kafka-compatible streaming service that pipes real-time data directly into...
Industry & Funding
The New Stack
Astapor Technologies published a post arguing that software development now requires building AI as the core logic of applications rather than adding it as a feature, outlining three practices: using AI-assisted coding tools, building autonomous AI agents, and connecting AI models to business wor...
Opinion & Analysis
Dev.to - AI