// edition · 2026-06-05

June 05, 2026

25 stories on AI dev tools, agents, and the coding stack, curated from the day's RSS haul by Agentic Dev's pipeline.

Top Signal · Pricing & Plans

Cursor cuts prices and adds enterprise spend controls amid “tokenomics” reckoning

Cursor reduced its Teams plan annual price by 20% to $32 per user per month and introduced a $120/month Premium tier, while adding enterprise spend alerts and an "organizations" governance dashboard for managing budgets and model access across multiple deployments.

The New Stack

Tool Updates

My Company Wouldn't Let Me Use Claude Code. So I Built a Proxy That Redacts Code Locally

A software manager built Kiri, an open-source on-premises proxy that intercepts requests to cloud AI coding tools and replaces sensitive code with placeholders before forwarding them, allowing use of tools like Claude Code without exposing proprietary source code externally.

CLI Agents Dev.to - Claude

Claude Code for Web Development: How to Build Websites

Claude Code is Anthropic's command-line tool that generates HTML, CSS, JavaScript, and backend code from plain-language instructions. The tool can scaffold full websites, integrate with frameworks like React and Next.js, and guide users through deployment to platforms such as Vercel or Netlify.

CLI Agents Dev.to - Claude

The fifth layer is forming: six memory-tool authors wrote a Claude Code spec

Six independent developers building memory tools for Claude Code collaborated on GitHub issue #47023 to draft a four-hook lifecycle spec (PreCompact, PostCompact, SessionEnd, SessionStart) for external memory layers. The proposal, initiated April 12, addresses the absence of lifecycle events arou...

Agent Engineering Dev.to - Claude

Autonomous agents have met their biggest challenge yet: The database.

Carnegie Mellon computer science professor Andy Pavlo told attendees at Percona Live 2026 that databases represent the most difficult challenge for autonomous AI agents, because hallucinated queries or configuration changes in production systems can cause complete data loss, unlike errors in othe...

Agent Engineering The New Stack

Haystack 2026: The End-to-End NLP Framework for Production RAG & Agent Pipelines \u2014 Setup Guide

Haystack is an open-source NLP framework developed for building production-grade RAG pipelines and agent workflows, supporting document stores, retrievers, evaluation tools, and Docker deployment.

Workflows & Tips Dev.to - AI

How I set up Claude to teach me DSA through Leetcode problems

A developer open-sourced "Claude with Leetcode," a MIT-licensed tool that automatically commits accepted LeetCode solutions to GitHub and uses Claude to generate daily data structure and algorithm analysis of each submission.

Workflows & Tips Dev.to - Claude

Claude's Visualize Feature Is Broken — Here's a One-Line Workaround

Claude's inline visualization feature has been broken since mid-March 2026 due to the external domain `claudemcpcontent.com` failing to resolve on any DNS server. A workaround exists: requesting output as a PNG file via Python's Pillow library bypasses the broken MCP dependency entirely.

Workflows & Tips Dev.to - Claude

Fix: "There's an issue with the selected model (deepseek-v4-pro)" in Claude CLI

A missing newline in a shell script caused two `export` lines to merge, preventing `ANTHROPIC_AUTH_TOKEN` from being set when routing Claude CLI through DeepSeek's API. The error "There's an issue with the selected model (deepseek-v4-pro)" was resolved by separating the two environment variable d...

Workflows & Tips Dev.to - Claude

How to secure Kubernetes in the age of AI workloads

AI agent workloads on Kubernetes expand the attack surface through unpredictable egress, GPU resource sharing, and dynamic tool invocation, requiring security beyond standard cluster hardening. Azure Kubernetes Service is moving toward network-isolated clusters that restrict outbound internet acc...

Agent Engineering The New Stack

How to Move ChatGPT Conversations Without Losing Context

Transferring ChatGPT conversation history to other AI platforms risks losing context, message order, and formatting due to token limits and export limitations. Recommended methods include exporting in JSON or Markdown format, preserving message sequence, and splitting long conversations into smal...

Workflows & Tips Dev.to - Claude

LuisCore ontology — shared vocabulary for agents and LLMs — daily syndication · 2026-06-05

LuisCore published a JSON-LD ontology at luiscore.com/ontology providing a shared vocabulary and glossary of terms for use by AI agents and LLMs across different frameworks. The ontology is designed to be machine-readable and citable, alongside APIs for agent registration, cluster telemetry, and ...

Agent Engineering Dev.to - AI

Ecosystem

A GitHub project claims 60-95% fewer tokens with the same answers. The number is real. The economics it implies for your agent fleet are uncomfortable.

A GitHub project called "headroom" preprocesses tool outputs, logs, and RAG chunks before they reach an LLM, claiming 60-95% fewer input tokens. Independent testing on 117 PR reviews using Claude found 58.4% input token reduction with an F1 score drop from 0.71 to 0.69.

Open Source Tools Dev.to - Claude

Claude Opus 4 hızlandırma düzeltmesi

Anthropic quietly optimized the response speed of Claude Opus, reducing latency to make the model more viable for agentic workflows and multi-step autonomous tasks. The update addresses prior delays that made real-time deployment impractical.

Model Releases Dev.to - Claude

Google Gemma 4 12B nearly matches 26B benchmarks — and runs on your laptop

Google released Gemma 4 12B, a multimodal language model that runs on 16GB of VRAM and benchmarks close to its 26B parameter counterpart. The model supports native audio input without separate encoders and is the first mid-sized Gemma model to do so.

Model Releases The New Stack

Anthropic's open-source framework for AI-powered vulnerability discovery

Anthropic released an open-source framework called "defending-code-reference-harness" on GitHub for AI-assisted vulnerability discovery in code. The repository provides a reference harness for using AI models to identify security vulnerabilities.

Open Source Tools Hacker News - Best

<think>

A benchmark of 10 LLMs for code generation tasks found DeepSeek-R1 scored highest at 9.4 with a cost of $2.50 per million output tokens, while DeepSeek V4 Flash offered the best value at $0.25 with a score of 8.7, across a price range of $0.20 to $3.00 per million tokens.

Model Releases Dev.to - AI

I Let My AI Agent Build, Test & Ship a Chrome Extension — These 8 Skills Did 90% of the Work

A developer published an open-source set of eight AI agent skill files, installable via npx, designed to give AI coding assistants domain-specific knowledge for Chrome extension development, including Manifest V3 rules and Web Store submission requirements.

Open Source Tools Dev.to - Claude

OpenAI API vs Anthropic API: Which One Should Developers Choose in 2026?

OpenAI's API supports text, images, audio, and video natively, while Anthropic's Claude API handles text and image inputs but not generation of other media. Both APIs offer roughly 1 million token context windows, though OpenAI charges a 2x input premium above 272K tokens versus Claude's flat pri...

Model Releases Dev.to - Claude

Claude AI Vulnerability Scanner: Anthropic's Open-Source Code-Security Harness (2026)

Anthropic open-sourced a code security tool called "defending-code-reference-harness," a Claude-powered pipeline that scans repositories for vulnerabilities and suggests patches. The project appeared on GitHub Trending and supports integration into CI pipelines via a `/vuln-scan` command.

Open Source Tools Dev.to - Claude

Nemotron 3 Ultra now available on AI Gateway

Nvidia's Nemotron 3 Ultra, a 550B-parameter Mixture-of-Experts reasoning model with a 1M token context window, is now available on Vercel's AI Gateway. The model delivers up to 350 tokens per second and up to 30% lower cost on agentic tasks compared to other providers.

Model Releases Vercel Blog

AI enthusiasts are in a race against time, AI skeptics are in a race against entropy

Charity Majors argues that AI enthusiasts and skeptics in software teams both face legitimate existential risks — enthusiasts from competitive disadvantage if they don't adopt AI, skeptics from eroding code quality and institutional knowledge when shipping faster than engineers can review. She id...

Opinion & Analysis Simon Willison

Mate Security’s Asaf Wiener made every backend engineer a model router. He’s right to.

Mate Security CEO Asaf Wiener restructured the company's AI inference cost management after discovering spending that threatened its runway, breaking a single AI cost line into roughly ten tracked sub-lines. The $15.5 million seed-funded SOC startup now requires backend engineers to estimate toke...

Opinion & Analysis The New Stack

Snowflake thinks it knows what’s really slowing developers down

Snowflake announced new capabilities for CoCo, its AI coding agent, including a desktop app, mobile app, and Slack integration with autonomous task execution. The company also launched Snowflake Datastream, a fully managed Kafka-compatible streaming service that pipes real-time data directly into...

Industry & Funding The New Stack

🚀 Why Writing "Good Code" Isn't Enough Anymore: The Rise of AI-Native Engineering

Astapor Technologies published a post arguing that software development now requires building AI as the core logic of applications rather than adding it as a feature, outlining three practices: using AI-assisted coding tools, building autonomous AI agents, and connecting AI models to business wor...

Opinion & Analysis Dev.to - AI

June 05, 2026

Tool Updates

Ecosystem

Adjacent editions