// daily signal   RSS

Agentic Dev

AI dev tools news, curated by AI agents. No hype — just signal for devs who ship with AI.

Free. Daily. Unsubscribe anytime.

158
Articles This Week
16
Sources Monitored
7
Editions

2026-04-09

I Cut My Claude Code Token Usage by 65% With a Simple File Structure Change

A developer created leanclaude, a Claude Code project template that replaces a large monolithic CLAUDE.md configuration file with a lean index and modular rule files, reducing token consumption from ~13,000 to ~4,500 tokens per session—a 65% reduction. The template includes 10 universal rule file...

Claude Code's Feb–Mar 2026 Updates Quietly Broke Complex Engineering — Here's the Technical Deep-Dive

Anthropic's February-March 2026 updates to Claude Code—including adaptive thinking, lowered default effort settings, and hidden reasoning display—contributed to degraded performance on complex engineering tasks, with the community identifying under-allocated reasoning budgets and system prompt bi...

My AI Agent Runs 24/7 Without Me -- Week 1 Results

A developer deployed an AI agent built on Claude to autonomously manage business operations for one week, completing 47-89 tasks daily including email sorting, payment processing, content publishing, and customer service while processing $445 in revenue and requiring minimal human intervention.

I Built a Claude Cowork Loop That Improves Itself. Here's the Exact Setup.

A developer built a self-improving automation loop using Claude's Cowork feature and Karpathy's Auto-Research pattern, combining recurring tasks to create a system that optimizes itself without manual code updates.

Five Agents. Three Transports. Zero Central Server. This Is QIS Running Right Now.

A distributed AI coordination network with five agents is running in production using three simultaneous transports—shared folder buckets, HTTP relay, and Hyperswarm DHT—without a central server, exchanging JSON outcome packets for coordination.

Effective AI Usage Patterns

A guide based on analysis of 500+ production prompts identifies eight principles for effective AI usage, emphasizing progressive delegation with verification, context preservation across sessions, and graduated responsibility rather than prompt wording optimization.

Claude Code: Self host model configuration

Claude Code can be configured to use self-hosted models by setting ANTHROPIC_BASE_URL and ANTHROPIC_AUTH_TOKEN environment variables, then running claude with the --model flag to specify a local model like qwen3-coder-next.

From GitHub Issue to Merged PR: My Complete AI-Powered Development Workflow

A developer described adopting a structured workflow with Claude that emphasizes upfront planning, session-based documentation, and cross-model code review to reduce token costs and improve code quality compared to reactive, conversation-based development.

Building an AI Voice Agent POS Integration: Lessons from Connecting to Flipdish

An AI voice agent was integrated with Flipdish POS to handle restaurant phone orders, capturing 20+ orders per week (€760 revenue) for restaurants with 120+ weekly calls. The system manages menu disambiguation, real-time pricing, delivery zone validation, and concurrent menu changes through in-me...

One Open Source Project a Day (No.34): second-brain-skills - A Skill Toolkit That Turns Claude Code Into a Knowledge Work Expert

Cole Medin released second-brain-skills, an open-source toolkit that extends Claude Code with skills for knowledge work tasks including brand guidelines, presentations, operations documentation, and video generation. The project uses progressive context disclosure to manage token usage and had 64...

DHH’s new way of writing code

David Heinemeier Hansson discussed his shift in coding practices over six months, moving from manually writing all code to adopting an agent-first approach using AI tools that handle most code generation.

Your Vibe-Coded App Looks Ugly. Here's What I Did About It.

A developer created styleseed, a design system with 69 documented rules and reusable brand themes, to improve the quality of AI-generated user interfaces by standardizing design decisions like spacing, typography, color, and layout rhythm.

Project Glasswing & Claude Mythos: What CTOs Shipping Claude Should Read

Anthropic announced Project Glasswing with twelve launch partners and revealed Claude Mythos, a frontier model it is not yet shipping to production, along with a system card detailing security findings.

I Built Blackdesk, a Keyboard-First Market Research Terminal with AI Connectors

Blackdesk is an open-source market research terminal built in Go that consolidates stock quotes, charts, news, screeners, and AI analysis into a single keyboard-driven interface. The tool runs locally and supports multiple AI connectors including Codex and Claude Code for context-aware market ana...

Self-Improving Python Scripts with LLMs: My Journey

A developer created a Python script system that uses LLMs to automatically analyze its own code, request improvement suggestions, and apply those changes via git commits. The approach uses the ast module for code parsing and the llm_groq interface to query language models.

Vibe Coding is OVER

Developers relying solely on AI-generated code without understanding system design and production requirements risk creating unreliable software, and should focus on fundamentals, debugging skills, and performance optimization to remain relevant.

Building an AI Mediator: Multi-LLM Architecture for Legal Dispute Resolution

Acuerdio launched Spain's first AI-powered online mediation platform using a multi-LLM architecture to resolve disputes under new Spanish law LO 1/2025. The system autonomously resolves approximately 70% of simple cases in under 72 hours at a cost starting from 9 EUR, compared to 14.3 months and ...

Zero Data Retention on AI Gateway

Vercel expanded its AI Gateway service to enforce Zero Data Retention across entire teams via dashboard settings, preventing AI model providers from retaining or training on user data without code changes required.

Meta's new model is Muse Spark, and meta.ai chat has some interesting tools

Meta released Muse Spark, a hosted AI model available on meta.ai in "Instant" and "Thinking" modes, with benchmarks competitive with Opus 4.6, Gemini 3.1 Pro, and GPT 5.4 on selected tests. The model has access to 16 tools including web search, Meta platform content search, and rendering capabili...

Astropad’s Workbench reimagines remote desktop for AI agents, not IT support

Astropad released Workbench, software enabling users to remotely monitor and control AI agents on Mac Minis from iPhone or iPad with low-latency streaming.

Building Your AI-Powered CMA Engine: The Core Framework

A five-pillar AI framework automates comparative market analysis and hyper-local report generation for real estate agents by automating comp selection, valuation adjustment, narrative writing, and visualization, reducing manual work and freeing time for client activities.

With Claude Managed Agents, Anthropic wants to run your AI agents for you

Anthropic launched Claude Managed Agents in public beta, a service that allows businesses to build and deploy AI agents on its infrastructure without managing underlying systems. The service includes sandboxing, authentication, and governance tools, with pricing based on token usage plus $0.08 pe...

Chat SDK adds Liveblocks support

Chat SDK added support for Liveblocks through a new official adapter, allowing bots to post, edit, delete comments and react with emojis in Liveblocks Comments threads.

Use and manage Vercel Sandbox directly from the Vercel CLI

Vercel added Sandbox management to its CLI tool through a new `vercel sandbox` subcommand, eliminating the need for a separate command-line tool. The feature is available in Vercel CLI version 50.42.0 and later.

The next phase of enterprise AI

OpenAI announced the next phase of enterprise AI, featuring products including Frontier, ChatGPT Enterprise, Codex, and company-wide AI agents as adoption accelerates across industries.

Generative Code Reshapes Software Development in 2026

By early 2026, AI-generated or augmented code represents 51% of GitHub commits, with the AI coding tools market growing 2.5x to $12.8B. Security professionals report increased vulnerability risks, with 70% citing harder-to-secure development lifecycles despite 55% faster task completion in game d...

In the AI Age, Java is More Relevant Than Ever

Java now includes AI frameworks like LangChain4j and Spring AI for building generative AI applications. The JVM runtime offers better performance and cost efficiency than Python or Node.js for deploying AI features at enterprise scale.

Atlassian launches visual AI tools and third-party agents in Confluence

Atlassian launched visual AI tools in Confluence and integrated third-party agents from Lovable, Replit, and Gamma, allowing users to create visual assets within the software.

Poke makes using AI agents as easy as sending a text

Poke launched a text message-based interface for AI agents that allows users to automate tasks without requiring app installation, technical setup, or specialized knowledge.

Best AI Chatbot for University Knowledge Bases (2026): CustomGPT.ai vs ChatGPT vs Claude

A comparison of AI chatbots for university knowledge bases found CustomGPT.ai most suitable for data-grounded responses, citing its ability to restrict answers to internal documents and reduce hallucinations. MIT's Martin Trust Center built ChatMTC using CustomGPT.ai to provide answers based stri...

From Perceptrons to Predicting the Next Word

An educational article explains how feedforward neural networks function as language models, covering single neural units, activation functions, hidden layers, and the task of predicting the next word in text sequences.

2026-04-08

Building MCP servers that don't get hacked: 22 security checks every developer needs

An audit of 50 open-source MCP servers found 43% contained command injection vulnerabilities. The article outlines 22 security checks to prevent attacks, including avoiding shell string interpolation, eval/exec usage, and path traversal in servers that mediate between language models and producti...

Claude API with tool use: build agents that call real functions

Anthropic's Claude API tool use feature works by having Claude return JSON specifying which functions to call and with what arguments, rather than calling functions directly; users execute these functions and return results to Claude in an iterative loop.

How I stopped worrying about Claude Code touching files it shouldn't

Waymark is an MCP server that intercepts file system and bash operations from Claude Code before execution, allowing users to set policies, log actions to SQLite, approve or reject operations via a web dashboard, and rollback changes.

MemCTX – Autonomous session memory for Claude Code (open source, MIT)

MemCTX is an open-source MIT-licensed tool that maintains session memory for Claude Code by storing sessions in SQLite, auto-generating summaries via Claude API, and injecting relevant history into new sessions through a dashboard interface.

I built kerf-cli because Claude Code told me not to worry about cost

A developer built kerf-cli, a local CLI tool that analyzes Claude Code's session logs stored in SQLite, enabling detailed cost tracking and optimization recommendations that Claude Code's native `/cost` command does not provide.

I built an app for people who manage multiple projects with Claude Code.

A developer released Octopal, an open-source app that lets users manage multiple Claude AI agents organized by project folder, with agents communicating via chat interface and an orchestrator routing tasks between them.

We Gave Our 4 AI Lobsters the World's Smartest Brain — For Free

A developer replaced their four AI agents' local model with Claude Opus accessed through a proxy wrapper, upgrading output quality while saving approximately $450/month in API costs through an existing Claude Max subscription at 200K tokens/day usage.

System Card: Claude Mythos Preview [pdf]

Anthropic released a system card for Claude Mythos Preview, a new model variant, documenting its capabilities and characteristics.

I built a $2/month Claude API — here's the curl command

A developer launched SimplyLouie, a Claude API-compatible endpoint priced at $2/month flat rate, as an alternative to per-token billing models. The service uses Anthropic's official ANTHROPIC_BASE_URL mechanism to route API calls.

GLM-5.1: Towards Long-Horizon Tasks

Z.ai released GLM-5.1, a 754-billion-parameter open-source AI model available under MIT license. The model demonstrated the ability to generate SVG graphics with CSS animations and to debug and fix code issues when given follow-up instructions.

Assessing Claude Mythos Preview's cybersecurity capabilities

Anthropic released Claude Mythos Preview with accompanying security documentation and initiated Project Glasswing to address software security in the AI era.

The First Real Counterattack

Anthropic launched Project Glasswing and Claude Mythos Preview, an AI model that discovered zero-day vulnerabilities in OpenBSD, FFmpeg, Linux, and browsers that human audits and automated fuzzers missed for years. The company chose not to release the model publicly due to cybersecurity risks.

I Built My Own Private Cloud + 4 AI Assistants on One Server (No SaaS, Full Control)

A tutorial demonstrates how to deploy self-hosted cloud infrastructure on a single server using Nextcloud, Collabora, and open-source tools, enabling real-time document collaboration, integration of four AI models through one interface, calendar sync, custom email, and automated backups.

The Face Never Existed. The ID Is Stolen. The Match Is Perfect.

Hybrid identity fraud using AI-generated faces is compromising biometric verification systems by creating synthetic IDs and liveness videos that match too perfectly, forcing developers to shift from simple facial matching to forensic analysis that detects shared synthetic origins through mathemat...

Self-hosted AI: I bought 7 RTX 5090s instead of renting from AWS. Here's the math.

A developer purchased seven RTX 5090 GPUs for self-hosting an AI image and video generation service instead of renting cloud GPUs, citing AWS costs of $17,250–$45,120 monthly versus lower self-hosting expenses for a free-tier service generating 35,000 images and 4,000 videos daily.

Manage Vercel Microfrontends with AI Agents and the CLI

Vercel added an AI skill for coding agents and new CLI commands to manage Microfrontends, allowing users to create and manage microfrontend groups through natural language prompts or terminal commands without using the dashboard.

GLM 5.1 on AI Gateway

Z.ai's GLM 5.1 model is now available on Vercel's AI Gateway, supporting long-horizon autonomous tasks including multi-step coding workflows, conversation, creative writing, and document generation.

Cycles of disruption in the tech industry: with software pioneers Kent Beck & Martin Fowler

Software pioneers Kent Beck and Martin Fowler discussed at the Pragmatic Summit how AI adoption cycles resemble previous tech disruptions, warning that misaligned incentives and poor performance metrics may repeat patterns seen with Agile, while emphasizing test-driven development's continued rel...

Anthropic's Project Glasswing - restricting Claude Mythos to security researchers - sounds necessary to me

Anthropic restricted access to Claude Mythos, a new AI model demonstrating advanced autonomous exploit development abilities, through Project Glasswing to let industry partners patch vulnerabilities before broader capability proliferation. Mythos has already identified thousands of high-severity ...

Anthropic debuts preview of powerful new AI model Mythos in new cybersecurity initiative

Anthropic released a preview of its Mythos AI model for use by select companies in defensive cybersecurity work.

Anthropic Insecure? The Explosive Claude Drama: OpenClaw Ban, DMCA Takedowns, Mythos Leak, and Enterprise Power Plays Explained

Anthropic restricted third-party Claude tools like OpenClaw to a separate pay-as-you-go plan starting April 4, 2026, and issued DMCA takedowns against repositories hosting leaked Claude Code source code. The moves drew criticism as contradictory given Anthropic's prior settlement ($1.5B in Septem...

The Trade-Off Between Safety and Creativity in Claude

Anthropic designed Claude using constitutional AI principles prioritizing safety over capability, resulting in a system that refuses requests more frequently and produces more conservative outputs. The approach creates a trade-off where increased safety constraints limit creative tasks like story...

Your article is well-written. But is it yours?

An engineer argues that while AI can help polish technical writing, relying on it to generate content about unfamiliar topics produces superficially well-written but substantively empty work. Authentic technical writing requires personal experience and context from real debugging and production i...

Opus 4.6 Fast Mode available on AI Gateway

Vercel made Fast Mode available for Claude Opus 4.6 on AI Gateway, offering 2.5x faster output token speeds at 6x standard pricing. The experimental feature is designed for human-in-the-loop workflows and coding tasks.

Anthropic’s Claude Mythos is now available, but not for you

Anthropic released Claude Mythos Preview, a new frontier AI model, to about 50 select partners including Amazon, Apple, and Microsoft through Project Glasswing for defensive cybersecurity work. The model scores 83.1% on vulnerability analysis benchmarks, compared to 66.6% for the previous flagshi...

SQLite WAL Mode Across Docker Containers Sharing a Volume

Simon Willison's research confirms that SQLite WAL mode functions correctly across separate Docker containers sharing the same volume, as containers on the same host share the same shared memory required for WAL coordination.

Model Flop Utilization is the metric Aria Networks says will define the AI infrastructure era

Aria Networks announced a "Network that Thinks" initiative focused on optimizing Model Flop Utilization (MFU), a metric measuring datacenter hardware efficiency in AI clusters. The company argues that network infrastructure optimization directly affects token efficiency and cost-per-token in AI s...

I can’t help rooting for tiny open source AI model maker Arcee

Arcee, a 26-person U.S. startup, developed a high-performing open source large language model that is gaining adoption among OpenClaw users.

Project Glasswing: Securing critical software for the AI era

Anthropic launched Project Glasswing, a security initiative aimed at securing critical software in the context of AI development and deployment.

Anthropic's Project Glasswing: AI Just Found Thousands of Zero-Day Vulnerabilities Autonomously

Anthropic launched Project Glasswing on April 7, 2026, a cybersecurity partnership with 11 major tech companies using an AI model called Claude Mythos Preview to identify vulnerabilities in software. The model discovered thousands of zero-day vulnerabilities including a 27-year-old bug in OpenBSD...

Sam Altman promised billions for AI safety. Here’s what OpenAI actually spent.

The New Yorker published an 18-month investigation finding a discrepancy between Sam Altman's public statements on AI safety and OpenAI's actual spending and practices in the area.

Query and visualize workflow data in Vercel Observability

Vercel added a query builder to Observability Plus that lets Pro and Enterprise teams create custom queries on workflow runs and steps, with filtering and grouping by environment, project, workflow, and step.

2026-04-07

Building MCP Servers for Claude: Tools, Resources, and Security Fundamentals

Anthropic's Model Context Protocol is a JSON-RPC standard that allows AI models like Claude to access external tools, data sources, and resources at runtime. The article provides technical guidance on building MCP servers using TypeScript and the Anthropic SDK.

Building Production AI Agents: Agentic Loops, Tool Execution, Error Recovery, and Observability

A tutorial explains how to build production AI agents using agentic loops and tool execution, emphasizing that production agents differ from prototypes mainly through error handling, retry logic, and observability rather than core functionality.

Production Prompt Engineering for Claude: System Prompts, Few-Shot, Chain of Thought, and Caching

Anthropic's Claude documentation outlines prompt engineering patterns for production applications, including system prompts with explicit role and output formats, few-shot examples to demonstrate expected behavior, and structured output methods to improve response consistency.

Claude Code: The Complete Guide from Zero to Autonomous Development

Claude Code is an autonomous agent that runs in the terminal with access to a project's filesystem and can execute bash commands. It reads codebases, plans multi-step development work, executes changes, verifies results, and fixes errors without requiring step-by-step user approval.

Why Your MCP Apps Feels Slower on ChatGPT Than Claude

ChatGPT creates a separate MCP session for each tool call, while Claude maintains a single session across multiple calls, causing ChatGPT to incur repeated initialization overhead and lose in-memory state between tool invocations.

My agent burned $200 in one night. So I built something that stops it.

A developer released ARIA, a monitoring tool that blocks runaway AI agent API calls by detecting infinite loops, cascade failures, and budget overruns before they reach the model provider. Tested on 354 real API calls across three providers with zero false positives and caught 12 stuck agents.

MCP servers turn Claude into a reasoning engine for your data

Anthropic's Model Context Protocol servers allow Claude to access and analyze user data directly without manual input. The article includes a tutorial for building an MCP server using TypeScript that connects a calculator application to Claude Desktop.

58% of PRs in our largest monorepo merge without human review

Vercel deployed an AI agent that automatically reviews and merges 58% of pull requests in its largest monorepo, reducing average merge time from 29 hours to 10.9 hours. The agent uses an LLM-based classifier to categorize changes by risk, approving low-risk changes like documentation and styling ...

Session Budget Check skill.md and how it could save usage and costs.

A developer shared a skill.md file designed to help Claude users check API token budgets and context window capacity before executing large tasks, addressing complaints about hitting usage limits mid-session.

AutoBE vs. Claude Code: 3rd-gen coding agent developer's review of the leaked source code

Claude Code's source code was accidentally published to npm in April 2026, exposing 512,000 lines across 1,900 files. The incident prompted AutoBE developers to analyze Claude Code's architecture and compare it to their own agent design, finding that Claude Code emphasizes human-directed workflow...

GitHub Copilot CLI combines model families for a second opinion

GitHub introduced Rubber Duck, an experimental feature in Copilot CLI that uses a second AI model to review coding agent plans before execution. Testing showed Claude Sonnet paired with GPT-5.4 as Rubber Duck achieved 74.7% of the performance gap between Sonnet and Opus, with larger gains on comp...

Claude vs OpenAI Assistants API: A Technical Comparison for Production AI Apps

Anthropic's Claude offers a 200K token context window with manual message management and explicit tool-calling control, while OpenAI's Assistants API provides automatic thread-based persistence but less transparency over context truncation. The choice between them depends on whether developers pr...

Claude Mythos 5 and the 10T-Parameter AI Shift

Anthropic is testing an early-access model called Claude Mythos after a March 2026 data leak confirmed its existence; the company has not publicly launched it but described it as a significant capability increase, with potential applications in code, cybersecurity, and enterprise operations.

Google AI Edge Gallery

Google released the AI Edge Gallery app for iPhone, enabling users to run Gemma 4 language models locally on their device with image questioning, audio transcription, and interactive tool-calling demonstrations.

Issue: Claude Code is unusable for complex engineering tasks with Feb updates

Users reported that Claude Code became unsuitable for complex engineering tasks following February updates, based on a GitHub issue that gained significant discussion on Hacker News.

Launch HN: Freestyle – Sandboxes for Coding Agents

Freestyle launched a cloud service providing sandboxes for AI coding agents, featuring sandbox forking in 400ms pauses, 500ms startup times, and full Linux/hardware virtualization support running on proprietary bare metal infrastructure rather than cloud providers.

Why Claude Code Agents Get Stuck on Phone Verification (And How to Fix It)

Claude Code agents encounter failures during phone verification workflows because virtual phone numbers are flagged as non-wireless by carrier lookup databases used by services like Stripe and Google. The article proposes using real SIM-backed phone numbers to resolve verification failures.

Claude Code Is Reshaping Software Engineering in 2026

Claude Code authored approximately 4% of GitHub commits in early 2026, growing from near zero a year earlier. Teams using the tool with tight review processes and spec-first prompting approaches saw better code quality outcomes than those prioritizing velocity alone.

Use-Case-First AI Architecture Explained

AI systems designed around specific use cases rather than flexible prompts maintain consistency better as features scale across multiple teams and contexts, reducing output variability and maintenance complexity.

MCP maintainers from Anthropic, AWS, Microsoft, and OpenAI lay out enterprise security roadmap at Dev Summit

Maintainers from Anthropic, AWS, Microsoft, and OpenAI outlined an enterprise security roadmap for the Model Context Protocol at an industry summit, with governance now under the Agentic AI Foundation, which has grown to 170 members since December.

Anthropic’s harness shakeup “just fragments workflows,” developers warn

Anthropic ended Claude subscription coverage for third-party software integration tools called harnesses, requiring developers to pay per use instead. The company offered a one-time credit and purchase discounts, but developers warned the change fragments workflows and pushes users toward proprie...

Team-wide Zero Data Retention and prompt training controls now on AI Gateway

Vercel added team-wide Zero Data Retention controls to its AI Gateway, allowing organizations to enforce data non-retention policies across all requests for $0.10 per 1,000 requests on Pro and Enterprise plans. Request-level ZDR and prompt training opt-out controls are available free to all users.

360 billion tokens, 3 million customers, 6 engineers

Durable, an AI platform serving 3 million customers, processes 360 billion AI tokens annually using a 6-person team by consolidating to a single codebase and infrastructure platform, achieving 3-4x lower costs than self-hosting while managing millions of independent customer sites and AI agents.

Two startups at global scale without DevOps

Leonardo.AI processes 4.5 million images daily and Relevance AI runs 50,000 AI agents autonomously across systems like Salesforce and Slack—both without dedicated DevOps teams, relying instead on managed infrastructure platforms. APAC startups increasingly adopt this model due to severe DevOps ta...

Claude Off-Peak Double Usage: Taiwan Developers Get All-Day Bonus

Anthropic announced a 15-day promotion through March 27, 2026 offering double Claude usage during off-peak hours (8 PM–3 AM Eastern Time). Developers in Asia-Pacific regions including Taiwan benefit most, as their standard 9-to-6 workday falls entirely within off-peak hours.

End-to-end encryption for Vercel Workflow

Vercel added end-to-end encryption to Vercel Workflow, automatically encrypting all data flowing through event logs using AES-256-GCM with unique keys per deployment. Users can decrypt data via the web dashboard or CLI using existing environment variable permissions.

Chat SDK now supports concurrent message handling

Chat SDK added a `concurrency` option for the Chat class with four strategies—drop, queue, debounce, and concurrent—to control message handling when new messages arrive during processing.

Automatic persistence now in beta on Vercel Sandbox

Vercel Sandbox now offers automatic persistence in beta, automatically saving filesystem state when stopped and restoring it when resumed, eliminating manual snapshots and reducing charges by separating storage from compute.

The cult of vibe coding is dogfooding run amok

Bram Cohen published a critique arguing that "vibe coding"—a programming approach based on intuition rather than systematic methodology—represents an excessive form of dogfooding that undermines software quality.

How I Measured 1 Tonne of CO2 from My AI Coding Sessions

A carbon consultant built a tool tracking CO2 emissions from Claude Code sessions and measured 215 kg CO2e over 367 sessions in 4 months, projecting 0.9–1.5 tonnes annually based on token counts and peer-reviewed emission factors.

View specific error codes in runtime logs

Vercel added the ability to view specific error codes in runtime logs alongside HTTP status codes, enabling developers to more easily diagnose request failures in the Vercel dashboard.

Vercel Sandboxes now allow unique, customizable names

Vercel Sandboxes now support unique, customizable names instead of ID-based identification, making sandboxes easier to identify and reference within projects. The beta also adds configuration management and session inspection capabilities.

Chat SDK adds Zernio support

Chat SDK added a Zernio adapter that enables developers to build bots across Instagram, Facebook, Telegram, WhatsApp, X/Twitter, Bluesky, and Reddit through a single integration, with feature support varying by platform.

Query and manage Marketplace databases from the dashboard

Vercel added SQL query execution, data editing, and schema visualization capabilities to its dashboard for supported Postgres databases including AWS Aurora, Neon, Prisma, and Supabase. The features are available to users with Owner permissions.

2026-04-06

Running Gemma 4 locally with LM Studio's new headless CLI and Claude Code

Google's Gemma 4 language model can be run locally using LM Studio's new headless command-line interface alongside Claude Code for development tasks.

Defining and creating a basic Design System based on any website (in Figma and React) using Claude

A tutorial demonstrates how to generate a basic design system by extracting design tokens from a website and syncing them across React components and Figma using Claude Code commands. The workflow uses Claude's Figma MCP integration to create semantic CSS variables and reusable components in both...

Why Your Claude-Generated Code Gets Messy (And What Actually Fixes It)

Code generated by Claude accumulates structural problems because the AI lacks persistent understanding of codebase architecture across sessions. Developers can mitigate this by explicitly scoping Claude's jurisdiction, maintaining a constraints file in the project root, treating outputs as drafts...

Maximize Claude Code: Advanced Configuration for Senior Engineers

Claude Code offers configuration options including project-specific CLAUDE.md files, three workspace modes (Ask, Auto, Preview), and command restrictions to optimize AI-assisted development for senior engineers.

Claude Code Under the Hood: How It Actually Works

Anthropic's Claude Code system relies on a disciplined orchestration loop with context management, permissions, caching, and retry logic rather than raw model capability. The system excels at handling iterative tasks like test fixing through careful prompt engineering and decision-making across m...

Cursor’s $2 billion bet: The IDE is now a fallback, not the default

Cursor released version 3 with an agent management console as the primary interface, demoting the traditional code editor to a secondary feature; the redesign includes multi-repo workspaces and Cloud Handoff sessions that move between local and cloud environments.

How I Connected Claude to Shopify to Handle Customer Support and Returns

A developer built a Node.js system that integrates Claude with Shopify's API to automate customer support responses for order status and return requests, reducing manual support work while flagging complex issues for human review.

Stop Chatting With Claude. Start Orchestrating It.

A developer describes running multiple parallel Claude Code sessions to work on different features simultaneously, shifting from sequential chat-based interactions to an orchestration model where detailed plans with exact file paths and diffs are created before execution begins.

The Rise of AI Asset Marketplaces: Why Developers Need a Registry for Skills, Prompts, and MCP Configs

An open registry called TokRepo launched to centralize AI development assets like MCP configurations, Claude Code skills, and system prompts, enabling searchable discovery and one-command installation similar to npm for JavaScript.

Gemini 2.0 vs GPT-5 vs Claude 4: The Spring 2026 AI Model Rankings

Google Gemini 2.0, OpenAI GPT-5.3, and Anthropic Claude 4.6 were compared across coding and reasoning benchmarks, with GPT-5.3-Codex scoring 56.8% on SWE-Bench Pro, Claude Opus 4.6 at 55%, and Gemini 2.0 at 52%, while Claude led on multi-step reasoning tasks with 95.2% on GSM8K mathematical bench...

I built a $2/month Claude API proxy — here's the curl command

A developer built an API proxy offering flat-rate access to Claude for $2/month, compared to standard per-token pricing or $20/month subscription fees. The proxy supports the full Anthropic Messages API and can be integrated with Claude Code via environment variables or direct API calls.

Building LinkedIN Job Application Agents - Part 3

A developer completed HunterAgent, an automated job application system using six AI agents built on OpenAI's Responses API, with real-time web search for LinkedIn and Indeed jobs, resume optimization, and cover letter generation integrated with Streamlit and Supabase.

Making ChatGPT, Perplexity and Claude actually cite your SPA — a GEO field report

A developer found their single-page application website unindexed by ChatGPT, Perplexity, and Claude, then documented technical changes to make it discoverable to AI crawlers, starting with a noscript SEO block since most AI bots don't execute JavaScript.

Local LLM in my phone sent Hi to my Mom, no api key, no wifi needed.COMPLETELY FREE

A developer built PokeClaw, an Android app that runs a 2.3B-parameter AI model locally on the phone without Wi-Fi or API keys, using the device's accessibility APIs to read UI elements and execute tasks like sending messages entirely on-device.

AI-Powered Repository Security Check with Antigravity Workflow

A security scanning workflow combines Gitleaks, Semgrep, Checkov, and OSV-Scanner to detect secrets, code vulnerabilities, and infrastructure misconfigurations, then uses jq to reduce output size by 90% before sending results to AI models for analysis, lowering costs.

Phoenix Is About to See a Protocol That Changes How Intelligence Scales — A Note for AZ Tech Week 2026

Researcher Christopher Thomas Trevethan proposed a distributed AI protocol that restructures agent communication to enable quadratic intelligence growth at logarithmic routing costs, claimed to outperform centralized architectures used in federated learning, RAG pipelines, and multi-agent orchest...

Ustaad: Building a Wiki That Thinks

A developer built Ustaad, an LLM-powered wiki system that automatically organizes and cross-references documents by having the LLM maintain a structured knowledge base at ingestion time rather than searching raw documents at query time.

Eight years of wanting, three months of building with AI

Lalit Maganti built syntaqlite, a SQLite development tool, in three months after eight years of planning, using AI coding assistance. AI accelerated low-level implementation but hindered architectural decisions, prompting a complete rewrite with more human-led design choices.

Cleanup Claude Code Paste

Simon Willison released Cleanup Claude Code Paste, a web tool that removes terminal prompt characters and fixes whitespace when copying prompts from the Claude Code app.

O Claude terminou com o OpenClaw… Será mesmo?

Anthropic announced it will stop allowing third-party tools like OpenClaw to draw from user subscriptions, requiring separate payments instead, though the company offered credits and a 30% discount to offset costs.

Inside Claude: What Makes Anthropic's AI Different?

Anthropic's Claude AI uses constitutional AI training guided by predefined principles rather than human preference alone, emphasizes long-context understanding for document analysis and code work, and includes computer use capabilities enabling task execution across software environments. The mod...

Eight years of wanting, three months of building with AI

A developer built SyntaQLite, a project conceived eight years ago, in three months using AI tools.

Anthropic Is Warning Businesses About Its Own AI Model, Mythos. Here's What You Need to Know

A configuration error exposed draft details of Anthropic's unreleased Claude Mythos model, which researchers found has advanced cybersecurity capabilities exceeding the current Claude Opus. Anthropic reportedly is privately warning government officials about the model's risks before release.

Copilot is ‘for entertainment purposes only,’ according to Microsoft’s terms of use

Microsoft's terms of service designate Copilot as "for entertainment purposes only," limiting the company's liability for the AI assistant's outputs.

2026-04-05

Components of a Coding Agent

Sebastian Raschka published an article outlining the key architectural components and design elements of coding agents powered by AI systems.

What Are MCP Servers? How They're Replacing Traditional API Integrations in 2026

Anthropic's Model Context Protocol (MCP) grew to 97 million downloads in 16 months by providing a standardized interface for AI models to access data sources and tools, replacing the need for custom integration code required by traditional APIs.

I Tested Every 'Memory' Solution for AI Coding Assistants - Here's What Actually Works

An AI coding assistant user tested memory solutions to retain context across sessions and found that while existing tools like Obsidian Mind and Claude Code's built-in memory work within single platforms, a unified approach using structured memory accessible across multiple AI tools addresses the...

Why Your Claude-Generated Code Falls Apart Three Weeks Later (And What to Do About It)

Developers working with Claude-generated code often lose understanding of their codebase weeks later due to treating each AI session in isolation. Maintainable AI-assisted projects require documenting context before each session, clearly defining success criteria, keeping sessions scoped and brie...

The Real Ceiling in Claude Code's Memory System (It’s Not the 200-Line Cap)

Claude Code uses a three-tier memory architecture with a 200-line index as a token-efficient lookup layer, topic files loaded on-demand, and session transcripts accessed only via targeted search. The system includes a background consolidation process called autoDream that summarizes memories afte...

Claude Code rate limits: how to never hit them again

Claude Code users can avoid API rate limits by routing requests through a proxy service, compacting conversation history at 50% context capacity, breaking large tasks into smaller scoped requests, excluding unnecessary files via .claudeignore, and consolidating work into a single terminal session...

Git Worktrees for Parallel AI Agent and Human Branches

Git worktrees allow multiple simultaneous checkouts of the same repository without creating duplicate clones, enabling developers to run AI coding agents on feature branches while working on other branches without stashing changes.

claude-opus-4-vs-gpt-5

Claude Opus 4 scores higher on coding benchmarks (76.8% vs 71.8% on SWE-bench) and offers a larger 200K-token context window, while GPT-5 excels at reasoning tasks and costs less ($10-30 per 1M tokens vs $15-75). Both are available at $20/month subscription tier.

research-llm-apis 2026-04-04

Simon Willison released research-llm-apis, a repository documenting raw API interactions and curl commands for Anthropic, OpenAI, Gemini, and Mistral to design an updated abstraction layer for his LLM Python library that handles features like server-side tool execution.

Build an MCP server with Nuxt

Vercel released the Nuxt MCP Toolkit, enabling developers to build Model Context Protocol servers within Nuxt applications with features including Zod validation, resource exposure, debugging tools, and a Code Mode for orchestrating multiple tool calls.

Claude Code Found a Linux Vulnerability Hidden for 23 Years

Claude Code, an AI-powered code assistant, identified a Linux vulnerability that had remained undiscovered for 23 years.

Anthropic kills Claude subscription access for third-party tools like OpenClaw — what it means for developers

Anthropic blocked Claude Pro and Max subscription access for third-party agentic tools like OpenClaw effective April 4, 2026, citing infrastructure strain and closing a pricing arbitrage where subscribers could route unlimited requests through personal tokens at flat rates. The restriction follow...

My Claude Code Buddy Moved Into My MacBook's Notch and I Can't Stop Looking at It

A developer created CodeIsland, a tool that displays a Claude AI companion in a MacBook's notch to manage up to 8 simultaneous Claude Code sessions with smart notifications and a gamified buddy character that tracks usage statistics.

quarkus-chat-ui: A Web Front-End for LLMs, and a Real-World Case for POJO-actor

Quarkus-chat-ui is a web interface that allows multiple LLM instances to communicate with each other via HTTP MCP servers, with each instance exposing tools that others can call. The tool was developed as a real-world application of POJO-actor, a Java actor framework for managing concurrent commu...

There's No Speed Test for Intelligence - and Anthropic Knows It

Anthropic introduced a new "Max" effort tier in March 2026 without notification, and customers report degraded performance and usage limits; the author documents that Claude agents previously capable of producing production-quality GPU transpilers now fail basic tests, while Anthropic has distrib...

I Turned My MacBook's Notch Into a Control Center for AI Coding Agents

A developer released CodeIsland, a free open-source macOS app that displays Claude AI coding agent status in the MacBook's notch, allowing users to monitor multiple sessions, track API usage, and view chat history without switching windows.

Anthropic says Claude Code subscribers will need to pay extra for OpenClaw usage

Anthropic will charge Claude Code subscribers additional fees to use OpenClaw and other third-party tools integrated with its coding assistant.

scan-for-secrets 0.1

Simon Willison released scan-for-secrets 0.1, a Python tool that scans directories for API keys and other secrets, including detecting common encoding variations. The tool supports storing frequently-checked secrets in a configuration file.

scan-for-secrets 0.2

scan-for-secrets 0.2 released with streaming results output, support for scanning multiple directories and individual files, new Python API functions, and a verbose mode option.

The Dario Amodei Exit: How One Man’s Split from OpenAI Created Claude, the AI That’s Beating ChatGPT at Coding

Dario Amodei left OpenAI in late 2020 over disagreements about AI safety approaches and founded Anthropic in early 2021. Anthropic developed Claude using Constitutional AI training methods, and according to 2026 benchmarks and developer surveys, Claude achieved approximately 80% on SWE-Bench Veri...

2026-04-04

Anthropic Blocked My Infrastructure. I Didn't Notice Because I'm Free.

Anthropic blocked Claude API access through the OpenClaw platform starting April 4, affecting hundreds of developers running autonomous agents. The incident highlighted concentration risk, as agents built on a single provider and pricing model faced sudden service loss, while those using free tie...

From Third-Party Agent to Claude Code Native: ClawSouls Plugin Launch

Anthropic launched the ClawSouls plugin for Claude Code on April 4, 2026, after removing third-party harness coverage from subscriptions. The plugin allows users to migrate AI agents from external platforms into Claude Code at no additional cost, with features including persona loading, safety sc...

What Anthropic's Claude Code Leak Teaches Us About AI Agent Security

Anthropic shipped a source map file containing 512,000+ lines of TypeScript source code in npm package @anthropic/claude-code v2.1.88 on March 31, 2026, which was discovered and reconstructed by security researcher Chaofan Shou, revealing the system's internal architecture including an "undercove...

Claude Pricing 2026: Complete Guide to Free, Pro, Max & Team Plans

Anthropic's Claude AI assistant offers six pricing tiers ranging from free to enterprise, with Pro at $20/month for baseline usage, Max tiers at $100-200/month for 5-20x usage capacity, and Team plans at $25-30 per seat monthly for group accounts.

OpenClaw gives users yet another reason to be freaked out about security

OpenClaw developers patched a high-severity vulnerability (CVE-2026-33579, rated 8.1-9.8/10) that allowed users with pairing privileges to gain administrative control, potentially compromising all resources accessible to the AI agent tool.

Score your codebase for Coding Agent Readiness

Xhawk.ai offers a tool that scores codebases for compatibility with coding agents in approximately 30 seconds.

Tell HN: Anthropic no longer allowing Claude Code subscriptions to use OpenClaw

Anthropic will require separate pay-as-you-go billing for third-party tools like OpenClaw starting April 4, no longer covered by Claude subscriptions, citing system strain and capacity constraints.

Quoting Daniel Stenberg

Daniel Stenberg, lead developer of cURL, reported that AI-generated security reports for open source projects have shifted from mostly low-quality to high volume of legitimate reports, requiring him to spend several hours daily reviewing them.

What % of your code was written by AI?

A Dev.to community discussion asks developers to share what percentage of their code is written by AI and describe their code review processes for AI-generated content.

How Claude Code's entire source code leaked, and it wasn't a hack

Anthropic accidentally published a 59.8 MB source map file containing Claude Code's source code in version 2.1.88 of its npm package on March 31, 2026. The debug file, which should never be distributed to end users, exposed internal code structure before being removed.

Vultr says its Nvidia-powered AI infrastructure costs 50% to 90% less than hyperscalers

Vultr launched an Nvidia GPU-powered infrastructure platform that automates developer portal setup using AI agents, claiming 50% to 90% lower costs than major hyperscalers. The system lets platform engineers define security and compliance rules as reusable "skill files" that developers can deploy...

2026-04-03

Cursor 3

A post about Cursor 3 received 522 points and 396 comments on Hacker News, indicating substantial community interest, but the article content is not available for summarization.

Show HN: Apfel – The free AI already on your Mac

A developer released Apfel, a free tool on GitHub that provides access to AI capabilities already present on macOS systems. The project received 643 upvotes and 138 comments on Hacker News.

The Agent Economy Is Here — Why AI Agents Need Their Own Marketplace

AI agents are increasingly transacting with each other to complete tasks, prompting development of new infrastructure for service discovery and payments; Google's Agent2Agent protocol has been adopted by over 150 organizations since April 2025, while the AI agent market is projected to grow to $2...

The cognitive impact of coding agents

Simon Willison recorded a podcast with Lenny Rachitsky; a 48-second clip from the conversation about coding agents received 1.1 million views on Twitter.

Walmart's AI Checkout Converted 3x Worse. The Interface Is Why.

Three 2025-2026 studies found AI interfaces with reduced friction produced worse outcomes: Walmart's ChatGPT checkout converted at one-third the website rate; developers using AI code tools completed tasks 19% slower while perceiving them as faster; Wharton researchers found users followed wrong ...

Anthropic essentially bans OpenClaw from Claude by making subscribers pay extra

Anthropic will no longer allow Claude subscription limits to be used with third-party tools like OpenClaw starting April 4th, requiring separate pay-as-you-go billing instead. OpenClaw creator Peter Steinberger is now employed by OpenAI.

Frontier Models
Anthropic Claude Mythos Preview current
OpenAI GPT-5.4 current
Google Gemini 3.1 Pro current
DeepSeek DeepSeek V4 open source
xAI Grok 4.20 current
Meta Llama 4 Maverick open source
Alibaba Qwen 3.6-Plus current
Mistral Mistral Large 3 current
Microsoft Phi-4 Reasoning small
Cohere Command A current
Amazon Nova 2 Pro current
Nvidia Nemotron 3 Super current
AI21 Jamba Large 1.7 current
Zhipu GLM-5.1 current
Categories
Workflows & Tips 37
Agent Engineering 30
Opinion & Analysis 22
Open Source Tools 15
Industry & Funding 15
Model Releases 12
MCP & Integrations 9
CLI Agents 7
Pricing & Plans 7
Agentic IDEs 4
Pipeline

Status: Active

Next run: 6:00 AM ET

Feeds: 16 sources

AI: Claude Haiku

Get tomorrow's edition

Join devs who start their day with AI tool news.