Open Source Tools — Agentic Dev

Prompting Is Not Enough: Code-Enforced Research Workflows for AI Agents

A developer released Alpha Insights, an open-source research workflow tool for Claude Code and Codex Desktop that uses code-enforced stage gates and validators instead of prompts to control AI agent behavior. It includes 19 business frameworks, 9 thinking methods, evidence grading by source confi...

Dev.to - Claude · 2026-06-01

an open-source version Dynamic Workflows

A developer reported spending over 300 RMB in one morning using Claude Code's Dynamic Workflows feature, which runs multiple agents in parallel for validation and solution selection. They identified an open-source alternative called OpenWorkflows, available on GitHub, that supports lower-cost mod...

Dev.to - Claude · 2026-06-01

Spec-Driven Development con Superpowers: Como evoluciono mi vibecoding

Superpowers is an open-source plugin for AI coding agents including Claude Code, Cursor, and Gemini CLI that injects structured behavioral skills into the model. The plugin, available at github.com/obra/superpowers, guides agents through a defined workflow including brainstorming, spec writing, a...

Dev.to - Claude · 2026-05-30

llm-anthropic 0.25.1

Simon Willison released llm-anthropic 0.25.1, adding support for Anthropic's Claude Opus 4.8 model, a new fast mode option for eligible organizations, and changing the default max_tokens to each model's maximum output instead of 8,192.

Simon Willison · 2026-05-29

Genesis AI SDK — A Universal Flutter SDK for AI Agents

Genesis AI SDK is a Flutter package that provides a single API for building AI agents across seven providers, including Gemini, OpenAI, Anthropic, HuggingFace, Ollama, and on-device Gemma and GGUF models. The SDK includes built-in tool calling via a ReAct loop, persistent memory, and safety guard...

Dev.to - AI · 2026-05-29

markdown-svg-renderer

Simon Willison released markdown-svg-renderer, a web tool that renders Markdown with special handling for fenced SVG code blocks, displaying both the rendered image and a code view tab. It accepts pasted Markdown or URLs pointing to CORS-enabled Markdown files or GitHub Gists.

Simon Willison · 2026-05-29

Why Your AI Agent Keeps Making the Same Mistakes (It's Not the Model)

A developer released Rein, an open-source tool for Claude Code that monitors AI agent sessions to detect patterns indicating missing scaffolding — such as repeated bugs, context loss, or cost spikes — based on a framework the author calls "Harness Engineering."

Dev.to - Claude · 2026-05-28

Building a Runtime Continuity Layer for AI Coding Agents

Contorium Labs released Contorium, an open-source runtime continuity layer for AI coding agents that tracks workspace state, git state, and session context across sessions. It is compatible with Cursor, VS Code, Claude Code, Codex, and MCP-based agents, and does not use chat history for state per...

Dev.to - AI · 2026-05-28

One Open Source Project a Day (No. 78): stop-slop - A Skill File That Teaches AI to Eliminate Its Own Writing Tells

Product designer Hardik Pandya released "stop-slop," an MIT-licensed Markdown skill file for Claude and other AI tools that identifies and removes common AI writing patterns across 8 categories, using a 5-dimension scoring rubric. The project has accumulated over 5,800 GitHub stars and 435 forks.

Dev.to - Claude · 2026-05-28

Stop Paying for Noise: Trim LLM Tokens from Both Ends of the Pipe

RTK, an open-source CLI proxy, claims to reduce LLM input tokens by up to 89% by filtering noise from developer command output, while caveman, a Claude Code skill, claims 65% output token reduction by constraining model response verbosity. Both tools are MIT-licensed and available on GitHub.

Dev.to - Claude · 2026-05-27

LLM API Tokens burning your Bank even on testing ? Not anymore, cuesheet is here to help with that.

A developer released cuesheet v0.2.0, a Python testing tool that records LLM API responses to YAML files on first run and replays them locally on subsequent runs, eliminating API token costs during testing. It supports Anthropic, OpenAI, Gemini, Mistral, and DeepSeek via any httpx-based SDK, and ...

Dev.to - Claude · 2026-05-27

One Open Source Project a Day (No. 76): Knowledge Work Plugins - Anthropic's Official Role-Specialist Plugin Library

Anthropic released an official open-source plugin library called Knowledge Work Plugins, offering 11 role-specific configurations for Claude covering domains such as sales, legal, finance, and data analysis. The library uses MCP connectors to integrate with tools like HubSpot, Snowflake, and Figm...

Dev.to - Claude · 2026-05-26

My AI corrections used to be tribal knowledge. I shipped the fix into my own OSS.

A developer released Hypomnema, an open-source personal wiki tool that integrates with Claude Code via 14 lifecycle hooks, available via `npm install -g hypomnema` under the MIT license. It stores notes as plain markdown with git, requires no vector database or API keys, and auto-manages session ...

Dev.to - AI · 2026-05-26

Voicebox: The Open-Source AI Voice Studio That Just Hit 28K Stars

Voicebox, an open-source desktop voice synthesis application, has reached 28,500 GitHub stars. Released under the MIT license, it runs locally, supports 7 text-to-speech engines, voice cloning from short audio clips, 23 languages, and includes a Model Context Protocol server for AI agent integrat...

Dev.to - AI · 2026-05-26

AiFinPay: Autonomous Payments for ruvnet/ruflo

AiFinPay released a Python library called `aifinpay-agent` that provides payment infrastructure for AI agents. The package is installable via pip and the source code is available on GitHub under the AiFinPay organization.

Dev.to - AI · 2026-05-26

One Open Source Project a Day (No. 75): Understand Anything - The AI Engine That Turns Any Codebase Into an Explorable Knowledge Graph

"Understand Anything," an open source Claude Code plugin by developer Lum1104, converts codebases into interactive knowledge graphs using Tree-sitter for structural parsing combined with LLM agents for semantic analysis. The project has accumulated 26,500 GitHub stars and 2,300 forks, and support...

Dev.to - Claude · 2026-05-25

datasette-agent 0.1a4

Datasette-agent 0.1a4 was released, adding a "Start a new agent chat" interface to the Jump to menu (triggered by pressing "/"), using the makeJumpSections() JavaScript plugin hook introduced in Datasette 1.0a30.

Simon Willison · 2026-05-25

datasette 1.0a30

Datasette 1.0a30 alpha was released, adding a "Jump to..." navigation menu triggered by pressing `/` that filters databases, tables, and debug options. A new `jump_items_sql()` plugin hook allows plugins to add custom items to the menu.

Simon Willison · 2026-05-25

codegraph: The Missing Knowledge Graph for 5 Coding Agents

Colbymchenry released codegraph, a local code knowledge graph that indexes codebases into SQLite via Tree-sitter and exposes nine tools over an MCP server interface for use with Claude Code, Codex CLI, Cursor, OpenCode, and Hermes Agent. Benchmarks across seven codebases showed 59% fewer tokens, ...

Dev.to - Claude · 2026-05-24

AiFinPay: The AiFinPay SDK offers a seamless and secure paym

AiFinPay released a Python SDK for enabling payment processing in AI agent applications, available via pip as "aifinpay-agent" with source code published on GitHub.

Dev.to - AI · 2026-05-24

AiFinPay: The AiFinPay SDK provides a seamless and secure pa

AiFinPay released a Python SDK for enabling payment processing by AI agents, installable via pip as "aifinpay-agent" and available on GitHub at github.com/AiFinPay/sdk.

Dev.to - AI · 2026-05-24

AiFinPay: Autonomous Payments for ruvnet/ruflo

AiFinPay released a Python SDK for enabling autonomous payment capabilities in AI agents, installable via pip as `aifinpay-agent`. The project is associated with the ruflo repository by developer ruvnet.

Dev.to - AI · 2026-05-24

What 9,830 Claude conversations reveal about your own habits

A developer tool called skill-tree analyzes Claude Code session history, scoring users across 11 AI collaboration behaviors drawn from a taxonomy based on 9,830 Claude conversations, then assigns one of seven archetype profiles and suggests unused behaviors to practice.

Dev.to - Claude · 2026-05-23

Open Source Project (No.73): Sub2API - All-in-One Claude/OpenAI/Gemini Subscription-to-API Relay

Sub2API is an open-source API gateway written in Go that aggregates multiple Claude, OpenAI, and Gemini subscription accounts into a single unified API endpoint. The project, maintained by Wei-Shaw on GitHub under LGPL-3.0, includes multi-account management, token-level billing, and load balancin...

Dev.to - Claude · 2026-05-23

AiFinPay: The AiFinPay SDK offers a seamless and secure paym

AiFinPay released a Python SDK for enabling payment processing in AI agent applications, available via pip as "aifinpay-agent" with source code published on GitHub.

Dev.to - AI · 2026-05-23

AiFinPay: The AiFinPay SDK offers a seamless and efficient s

AiFinPay released an SDK for integrating payment processing and payouts into AI agent workflows, available via pip as `aifinpay-agent`. The project is hosted on GitHub at github.com/AiFinPay/sdk.

Dev.to - AI · 2026-05-23

AiFinPay: The AiFinPay SDK offers a seamless and secure paym

AiFinPay released an open-source Python SDK for processing payments between humans and AI agents, available via pip as "aifinpay-agent" with source code published on GitHub.

Dev.to - AI · 2026-05-23

AiFinPay: The AiFinPay SDK offers a seamless and efficient w

AiFinPay released an open-source Python SDK (`pip install aifinpay-agent`) designed to integrate payment processing functionality for AI agents into applications. The source code is available on GitHub at github.com/AiFinPay/sdk.

Dev.to - AI · 2026-05-23

Datasette Agent

Simon Willison released Datasette Agent, a conversational AI assistant for the Datasette data tool that generates SQL queries from natural language questions. It uses a plugin architecture and runs on Gemini 3.1 Flash-Lite in the live demo at agent.datasette.io.

Simon Willison · 2026-05-22

claudio: system-wide hold-to-talk dictation using Deepgram Nova 3 via your Claude subscription

Claudio is an open-source macOS menu bar app that enables system-wide hold-to-talk dictation by routing audio through Anthropic's speech-to-text API (Deepgram Nova 3), reusing the OAuth token stored by Claude Code. It requires macOS 13 or later and an active Claude Pro, Max, Team, or Enterprise s...

Dev.to - Claude · 2026-05-22

datasette-agent 0.1a3

Datasette Agent version 0.1a3 was released, adding "View SQL query" buttons for tables and SQL result tool calls, suppressing empty reasoning chunks, and improving handling of truncated responses so tables remain visible when SQL results are cut off.

Simon Willison · 2026-05-22

datasette-agent-charts 0.1a2

Simon Willison released datasette-agent-charts 0.1a2, adding "View SQL query" buttons below rendered charts in the Datasette agent plugin.

Simon Willison · 2026-05-22

Counting tokens is dumb. So we built a free metric for AI proficiency.

A company launched AIQ Rank, a free tool that scores AI coding proficiency from 0-1000 across 11 dimensions by reading local session activity from tools like Claude Code, Codex, and Cursor. The tool runs locally without transmitting data, and measures factors like customization, parallel agent us...

Dev.to - Claude · 2026-05-22

datasette-agent-sprites 0.1a0

Simon Willison released datasette-agent-sprites 0.1a0, an alpha-stage Datasette Agent plugin that enables running commands inside a Fly Sprites sandbox environment.

Simon Willison · 2026-05-22

One Open Source Project a Day (No. 71): CodeGraph — Pre-Index Your Codebase for AI Agents, Save 35% Cost and 70% Tool Calls

CodeGraph is an open source tool that pre-indexes codebases into a local SQLite semantic graph using tree-sitter, exposing 8 query tools to AI coding agents via MCP. Benchmarks across 7 projects show average reductions of 35% in token cost and 70% in tool calls; the project has 9,600+ GitHub stars.

Dev.to - Claude · 2026-05-21

The Future Is Going to B(e) MAD

A December 2025 analysis of 470 open-source GitHub pull requests found AI co-authored code had 1.7x more major issues and 2.74x higher security vulnerability rates than human-written code. BMAD is an open-source framework that applies traditional software development lifecycle phases to AI-assist...

Dev.to - Claude · 2026-05-21

Understanding AI Code Fast: A 60-Second Habit for Institutional Memory

Developer Shrijith Venkatramana released git-lrc, an open-source AI code reviewer that runs on each Git commit and generates summaries to help teams maintain institutional knowledge of AI-generated code. The tool implements an "awareness loop" — converting commits into daily and weekly reports to...

Dev.to - AI · 2026-05-21

datasette-llm 0.1a8

datasette-llm version 0.1a8 was released with a single bug fix addressing an issue where the `llm_prompt_context()` hook did not fully collect chains of responses.

Simon Willison · 2026-05-20

datasette-llm-accountant 0.1a4

Simon Willison released datasette-llm-accountant 0.1a4, a Datasette plugin for tracking LLM usage, with a bug fix for tracking chains of responses.

Simon Willison · 2026-05-20

Give Your AI Assistant a DolphinDB Brain — Install Agent Skills in 30 Seconds

A Python package called "dolphindb-agent-skills" was released, providing an offline knowledge base of DolphinDB syntax, APIs, and documentation for AI coding assistants such as Claude Code, Cursor, and GitHub Copilot. The open-source tool installs via pip and runs locally without sending data to ...

Dev.to - AI · 2026-05-20

Why your Anthropic prompt caching probably isn't working (and the npm package I built to fix it)

A developer published an npm package called `prompt-cache-optimizer`, a TypeScript wrapper for Anthropic's SDK that monitors prompt cache hit rates and warns about common failure modes, including prefix drift and TTL expiration. Anthropic's prompt caching offers a 90% token cost discount but requ...

Dev.to - Claude · 2026-05-20

llm-gemini 0.32a0

Simon Willison released llm-gemini 0.32a0, a plugin compatible with llm 0.32a0 alpha that adds support for streaming reasoning tokens from Gemini models.

Simon Willison · 2026-05-20

llm-gemini 0.32

Simon Willison released llm-gemini 0.32, adding support for Google's Gemini 3.5 Flash model via the new `gemini-3.5-flash` identifier in his LLM plugin.

Simon Willison · 2026-05-20

Steve Yegge’s AI agent orchestration project Gas Town comes to the cloud — and brings the Wasteland with it

Steve Yegge's open-source multi-agent orchestration project Gas Town, launched in early 2026, has expanded to include cloud components Wasteland and Gas City through a partnership with Kilo, an agentic coding platform co-founded by GitLab's former CEO Sid Sijbrandij. Gas Town splits software deve...

The New Stack · 2026-05-20

One Open Source Project a Day (No. 68): CLI-Anything - Making Every Piece of Software Agent-Native

The HKUDS lab at the University of Hong Kong released CLI-Anything, an open-source framework that wraps GUI-only desktop applications into structured command-line interfaces with JSON output, enabling AI agents to control software without APIs. The project supports 80+ applications, has over 35,7...

Dev.to - Claude · 2026-05-18

I Built an API That Parses Any Contract into Structured JSON

A developer released Clausify, an API that accepts contract documents in PDF, Word, or image formats and returns structured JSON containing fields such as parties, dates, duration, and governing law. The API is available on RapidAPI with a free tier allowing 20 requests per month.

Dev.to - AI · 2026-05-18

Zerostack – A Unix-inspired coding agent written in pure Rust

Zerostack is a coding agent written in Rust, released at version 1.0.0 on crates.io. It follows Unix design principles and is available as an open-source Rust crate.

Hacker News - Best · 2026-05-17

datasette-llm-limits 0.1a0

Simon Willison released datasette-llm-limits 0.1a0, a Datasette plugin that enables per-user or global spending limits on LLM usage, configurable by scope and time window, such as a $1.00 rolling 24-hour per-user cap.

Simon Willison · 2026-05-16

Osaurus brings both local and cloud AI models to your Mac

Osaurus is a Mac app that integrates both local and cloud AI models while storing user memory, files, and tools on the user's own hardware.

TechCrunch - AI · 2026-05-16

Clawdmeter turns your Claude Code usage stats into a tiny desktop dashboard

Clawdmeter is an open source desktop dashboard that displays usage statistics for Claude Code, Anthropic's AI coding tool.

TechCrunch - AI · 2026-05-15

I Tried TencentDB Agent Memory — Here's What the Token Reduction Looks Like

Tencent Cloud released TencentDB Agent Memory under MIT license in May 2026, a four-tier memory system for AI agents that offloads verbose tool output to local files while maintaining a compressed graph in context. Self-reported benchmarks show token reductions of 33–61% and task success improvem...

Dev.to - Claude · 2026-05-15

datasette-ip-rate-limit 0.1a0

Simon Willison released datasette-ip-rate-limit 0.1a0, a Datasette plugin that blocks IPs exceeding configurable request thresholds, built using OpenAI's Codex to address aggressive crawler traffic on datasette.io. The production configuration limits demo database paths to 60 requests per 60 seco...

Simon Willison · 2026-05-15

MCP Is a Great Start — But Multi-Agent Production Needs More

A developer released Network-AI, an open-source coordination layer for multi-agent AI systems that uses atomic propose-validate-commit cycles to prevent concurrent state overwrites. The project claims support for 14 frameworks including LangChain, AutoGen, CrewAI, and the Model Context Protocol.

Dev.to - AI · 2026-05-14

llm 0.32a2

Simon Willison released llm 0.32a2, an alpha version of his LLM command-line tool. The update switches reasoning-capable OpenAI models to the `/v1/responses` API endpoint, allowing summarized reasoning tokens to be displayed during prompts, with a `--hide-reasoning` flag to suppress them.

Simon Willison · 2026-05-13

One Open Source Project a Day (No. 64): Easy-Vibe - Datawhale's AI-Era Programming Curriculum

Datawhale, a Chinese AI learning community, published Easy-Vibe, an open-source programming curriculum built around AI-assisted "vibe coding," accumulating over 10,300 GitHub stars. The three-stage course targets non-programmers and covers tools such as Cursor and Claude Code, with support for 10...

Dev.to - Claude · 2026-05-13

Jensen Huang and Bill McDermott bet on OpenShell to secure enterprise AI agents

Nvidia released OpenShell, an Apache 2.0 open source secure runtime for autonomous AI agents, built over six months by senior director Ali Golshan's team. The system isolates each agent in a sandbox with an external gateway handling credentials, preventing agents from directly accessing host infr...

The New Stack · 2026-05-13

Why Your Multi-Agent AI System Needs Governance (Not Just Orchestration)

A developer released Network-AI, an open-source coordination layer for multi-agent AI systems that uses atomic propose-validate-commit state updates to prevent silent write conflicts. The tool supports 14 frameworks including LangChain, AutoGen, and CrewAI, and includes per-agent token budget con...

Dev.to - AI · 2026-05-12

I made my .NET travel AI library work with OpenAI, Anthropic, Ollama, and Azure. Not just one.

A developer released TravelAI.Core v2.0.0, a .NET library for generating travel itineraries, adding support for OpenAI, Anthropic, and Ollama backends alongside the existing Azure OpenAI integration. The update also introduced a mock provider requiring no credentials for offline testing.

Dev.to - Claude · 2026-05-12

I Built an IDE Where Claude Runs in Every Project Tab — Inside Hermes 1.1.5

A developer released Hermes IDE version 1.1.5, an open-source IDE that runs Claude AI in isolated sessions per project tab, each with its own working directory, permission mode, model selection, and MCP plugin configuration. The tool is available at hermes-ide.com and on GitHub, and was built to ...

Dev.to - Claude · 2026-05-10

Visualizing the Invisible: Why LLM Math is Hard and How to Fix It | 0510-0947

A developer released Neural Viz Lab, a browser-based tool for visualizing neural network internals including weight interactions, tensor flow through layers, and attention head mechanics. The tool runs without a GPU or Python environment and is available at a Cloudflare Pages URL.

Dev.to - AI · 2026-05-10

Chat SDK adds Messenger adapter support

Vercel's Chat SDK added a Messenger adapter, enabling support for messages, reactions, multimedia downloads, postback buttons, and automatic display name retrieval from user profiles. The adapter was based on a community contribution via GitHub PR #461.

Vercel Blog · 2026-05-09

I built ARIA (Autonomous Reasoning intelligent agent) for hard code developers – an open-source CLI coding agent with AST validation and RAG web search

A developer released ARIA (Autonomous Reasoning and Intelligent Agent), a free open-source CLI coding agent installable via `pip install aria-x`. It supports any Ollama model and includes AST pre-validation for Python, RAG-based Stack Overflow search, and automatic approach-switching after three ...

Dev.to - AI · 2026-05-08

Veritas: Give Your AI Agent the Ability to Know What It Knows

A developer released Veritas, an open-source Python library that tracks confidence levels for AI agent beliefs using a four-component vector covering estimated certainty, source fragility, staleness, and source diversity. The system applies temporal decay to evidence based on source type, with em...

Dev.to - AI · 2026-05-08

With the launch of Meko, Yugabyte targets the data layer that’s breaking multi-agent AI systems

Yugabyte launched Meko, an open source data infrastructure tool designed to manage state in multi-agent AI systems. The company cites research finding that 37% of multi-agent system failures stem from state inconsistencies rather than model reasoning errors.

The New Stack · 2026-05-08

llm-gemini 0.31

Simon Willison released llm-gemini 0.31, updating the plugin to reflect that Google's Gemini 3.1 Flash-Lite model is now generally available, having exited preview status.

Simon Willison · 2026-05-08

Kanban in Hermes Agent for Self Hosted LLM Workflows

Hermes Agent includes a Kanban-style task board backed by SQLite that dispatches jobs to self-hosted LLMs via a daemon process. Without parallelism limits, the dispatcher can flood single-GPU or CPU-bound hosts; the current workaround is using `hermes kanban dispatch --max` to cap concurrent prom...

Dev.to - AI · 2026-05-08

AI Skills for Project Management

A developer built two open-source AI workflow tools — `project-complexity-mapper` and `project-complexity-action-planner` — based on TU Delft's TOE framework, which categorizes project complexity into Technical, Organizational, and External dimensions to guide management responses.

Dev.to - Claude · 2026-05-08

One Open Source Project a Day (No. 58): Agent Skills - Injecting Senior Engineer Discipline into AI Coding Agents

Addy Osmani, Engineering Director at Google Chrome, released "Agent Skills," a MIT-licensed collection of 20 Markdown-based engineering workflows for AI coding tools such as Claude Code and Cursor. The project, which has accumulated over 30,800 GitHub stars, encodes structured checklists to enfor...

Dev.to - Claude · 2026-05-07

7 Must-Have Small Coding AI Models for Local Development in 2026

A 2026 roundup identifies seven small language models suited for local coding use on consumer hardware, including OpenAI's open-weight gpt-oss-20b, a 21-billion-parameter MoE model with 3.6 billion active parameters and a 128k context window, and Alibaba's Qwen3-VL-32B-Instruct.

Dev.to - AI · 2026-05-07

7 Must-Have Small Coding AI Models for Local Development in 2026

A 2026 roundup identifies seven small language models suited for local coding use on consumer hardware, including OpenAI's open-weight gpt-oss-20b, a 21-billion-parameter MoE model with 3.6 billion active parameters and a 128k context window, and Alibaba's Qwen3-VL-32B-Instruct.

Dev.to - AI · 2026-05-07

What Every AI Agent Builder Needs to Know About State Coordination

A developer released Network-AI, an open-source coordination layer for multi-agent AI systems that uses atomic propose-validate-commit cycles to prevent concurrent state conflicts. The tool supports 14 frameworks including LangChain, AutoGen, and CrewAI, and includes per-agent token budget contro...

Dev.to - AI · 2026-05-07

datasette-llm 0.1a7

datasette-llm 0.1a7 adds a configuration mechanism for setting default options on specific LLM models, allowing users to define defaults such as model selection and temperature for enrichment operations within Datasette.

Simon Willison · 2026-05-06

llm-echo 0.5a0

Simon Willison released llm-echo 0.5a0, a plugin for the LLM tool that provides a fake "echo" model for automated testing. The update adds a `-o thinking 1` option that simulates a reasoning block, compatible with LLM 0.32a0 and higher.

Simon Willison · 2026-05-06

Welcome to Maintainer Month: Celebrating the people behind the code

GitHub launched its sixth annual Maintainer Month, announcing new tools including granular pull request limits for unknown contributors and pull request archiving to remove spam. The releases follow GitHub data showing merged pull requests have nearly doubled year over year, with AI-generated con...

GitHub Blog · 2026-05-06

The perfect agent orchestration tool for your friendly ADHD developer

A developer and their partner released Apra Fleet, an open-source MCP server on GitHub that allows a single Claude Code session to dispatch tasks to agents running on multiple machines or in isolated local workspaces. The tool supports doer-reviewer loops within a single machine using separate fo...

Dev.to - Claude · 2026-05-05

Introducing deepsec: The security harness for finding vulnerabilities in your codebase

Vercel open sourced "deepsec," a security scanning tool that uses AI coding agents (Claude and Codex) to identify vulnerabilities in codebases, running locally or scaling to 1,000+ concurrent Vercel Sandboxes. The tool reports a 10-20% false positive rate and follows a five-step pipeline: scan, i...

Vercel Blog · 2026-05-05

WinkTerm: AI That Shares Your Terminal Session (Not Just Command Suggestions)

WinkTerm is an open-source AI terminal tool that writes commands directly into the shell input line within a shared PTY session, triggered by typing a `#`-prefixed comment. Built on Python, FastAPI, LangGraph, and Next.js, it is MIT licensed and available via Docker or as a desktop app.

Dev.to - AI · 2026-05-04

Spine v1: Stop Making Claude Rediscover Your Codebase Every Time You Open a Repo

A developer released Spine v1, an open-source tool that scans code repositories and generates architecture maps, prioritized reading orders, and a persistent context file at `.claude/REPO_CONTEXT.md` for use with Claude Code sessions.

Dev.to - Claude · 2026-05-03

Where Did My Claude Code Money Go? I Built a Tool to Find Out

A developer released Claude Code Cost Explorer, an open-source local dashboard that reads Claude Code's session files to show API costs broken down by individual turn, prompt, and tool call. It installs via pip and requires no API keys or external data uploads.

Dev.to - AI · 2026-05-03

I always worried my READMEs disagreed with my own repos. So I built README Clew.

A developer built README Clew, a tool that audits public GitHub repositories for discrepancies between README documentation and actual code, using Claude Sonnet 4.5 to extract claims and five deterministic verifiers to check dependencies, scripts, environment variables, and file references. The t...

Dev.to - Claude · 2026-05-03

Inside OpenSearch’s bid to become the default AI data layer

OpenSearch released versions 3.5 and 3.6 in February and April 2026, adding Better Binary Quantization (BBQ) that compresses high-dimensional vectors by 32x and the SEISMIC algorithm for sparse neural approximate nearest neighbor search. The project is positioning the open-source search engine to...

The New Stack · 2026-05-03

llm 0.32a0

Simon Willison released version 0.32a0, an alpha build, of his open-source `llm` command-line tool for interacting with large language models.

Simon Willison · 2026-04-30

LLM 0.32a0 is a major backwards-compatible refactor

Simon Willison released LLM 0.32a0, an alpha version of his Python library and CLI tool for accessing LLMs. The update refactors the core abstraction so model inputs can be represented as a sequence of messages and responses can include multiple typed parts, replacing the previous single-prompt, ...

Simon Willison · 2026-04-30

llm 0.32a1

Simon Willison released llm 0.32a1, a bug fix for the prior 0.32a0 release, correcting an issue where tool-calling conversations were not correctly restored from SQLite storage.

Simon Willison · 2026-04-30

Warp’s gamble: Going open source to take on closed-source rivals

Warp, maker of a Rust-based agentic development environment, released its client as open source under the AGPL license, with OpenAI named as founding sponsor of the repository. The agent workflows powering the platform are built on GPT models, and the company cited faster community-driven develop...

The New Stack · 2026-04-30

Complex UIs, Cross-App Workflows, Long Tasks: What GUI Agents Actually Unlock

Mininglamp Technology released Mano-P, an open-source GUI agent model under Apache 2.0 that operates graphical interfaces using only screenshots, without DOM parsing or system APIs. The model uses three-stage progressive training to enable AI agents to interact with software that lacks programmat...

Dev.to - AI · 2026-04-29

VibeVoice: Open-source frontier voice AI

Microsoft released VibeVoice, an open-source voice AI system, publishing the code on GitHub. The project appears to be a speech/voice model or toolkit made publicly available by Microsoft.

Hacker News - Best · 2026-04-29

microsoft/VibeVoice

Microsoft released VibeVoice, an MIT-licensed speech-to-text model with built-in speaker diarization, on January 21, 2026. The 17.3GB model (available in a 5.71GB 4-bit version) handles up to one hour of audio and transcribed 60 minutes of speech in 8 minutes 45 seconds on an M5 Max MacBook Pro.

Simon Willison · 2026-04-28

Open source package with 1 million monthly downloads stole user credentials

Attackers compromised elementary-data, an open source Python CLI tool with over 1 million monthly downloads, by exploiting a vulnerability in the developers' account workflow to push malicious version 0.23.3 to PyPI and Docker. The package collected warehouse credentials, cloud provider keys, API...

Ars Technica - AI · 2026-04-28

I built a self-hosted dashboard for running Claude Code across 8 machines

A four-person engineering team built and open-sourced a self-hosted web dashboard for managing Claude Code sessions across eight machines, replacing manual SSH workflows they said cost ~30 minutes daily. The MIT-licensed tool uses inotify file watching, PostgreSQL storage, and WebSocket streaming...

Dev.to - Claude · 2026-04-28

Your AI is swinging at nothing — 6 cognitive firewalls that cut my prod bugs by 87%

A developer released "Swing" (thestack-ai/swing-skills), an open-source set of six Claude Code skills designed to counter AI failure modes including confirmation bias and hallucination. The author reports agent-induced production bugs fell from 28% to 4% over 200 sessions across six weeks of pers...

Dev.to - Claude · 2026-04-28

An open-source spec for orchestration: Symphony

OpenAI published Symphony, an open-source specification for orchestrating Codex agents that connects issue trackers to automated agent systems, aiming to reduce manual context switching in engineering workflows.

OpenAI Blog · 2026-04-28

Boost Your Claude / AI Dev Workflow with These 4 Tools

Four open-source tools for Claude-based development workflows include Superpowers (structured prompt orchestration), UI UX Pro Max Skill (UI/UX prompt sets for React/Tailwind), Awesome Claude Code (curated resource list), and Claude Mem (persistent memory across sessions).

Dev.to - Claude · 2026-04-28

MCP Is a Great Start — But Multi-Agent Production Needs More

A developer released Network-AI, an open-source coordination layer for multi-agent AI systems that addresses race conditions in shared state access. The tool uses a propose-validate-commit cycle and claims support for 14 frameworks including LangChain, AutoGen, CrewAI, and MCP.

Dev.to - AI · 2026-04-28

One Open Source Project a Day (No.49): free-claude-code - Run Claude Code for Free with One Environment Variable

Ali Khokhar released free-claude-code, a FastAPI-based local proxy that redirects Anthropic API requests from Claude Code to free backends such as NVIDIA NIM, OpenRouter, or Ollama by changing one environment variable. The project accumulated over 14,300 GitHub stars and 2,000 forks, topping GitH...

Dev.to - Claude · 2026-04-27

Why I built an AI-native terminal for network engineers and DevOps

A network engineer released NetCopilot, a beta SSH, Telnet, and Serial terminal client that embeds AI assistance to help diagnose network issues such as BGP failures and routing misconfigurations. The tool supports connections to Cisco, Juniper, and Linux devices and analyzes command outputs in c...

Dev.to - AI · 2026-04-27

My financial advisor doesn't know I bought NVDA yesterday. My AI agent does.

A developer published an open-source personal finance agent system built on Claude, using two specialized agents routed by text prefixes and lazy-loaded from local JSON files. The project, available at github.com/jasonsieg22/claude-finance-agents under MIT license, claims an approximately 80% tok...

Dev.to - Claude · 2026-04-26

llm 0.31

Simon Willison released version 0.31 of his open-source `llm` CLI tool, adding support for OpenAI's GPT-5.5 model, a verbosity level option for GPT-5+ models, and an image detail level parameter for image attachments.

Simon Willison · 2026-04-25

How I got my AI agents to communicate across repos — and shipped SAMP doing it

A developer released SAMP (Simple Agent Message Protocol) and a reference implementation called "agent-message," enabling AI coding agents to pass messages across separate repository sessions using append-only JSONL log files with no servers or daemons. The system uses content-addressed message I...

Dev.to - Claude · 2026-04-25

Mistral’s Leanstral wants to kill off human-in-the-loop code checks, but is it blowing in the wind?

Mistral AI launched Leanstral in March, an open-source code agent that uses formal verification via the Lean 4 programming language to mathematically prove code correctness. The model uses a Mixture-of-Experts architecture with 119 billion total parameters and 6.5 billion active parameters, relea...

The New Stack · 2026-04-25

Doby: How I Cut Claude Code's Navigation Tokens by 95% with a Spec-First Workflow

Developer changmyoungkim released Doby, an open-source tool that uses pre-compiled indexing and a spec-first workflow to reduce token usage in Claude Code navigation from 2,000–5,000 tokens per query to approximately 100 tokens, a claimed 95% reduction.

Dev.to - Claude · 2026-04-24

A pelican for GPT-5.5 via the semi-official Codex backdoor API

OpenAI released GPT-5.5 in its Codex product and for paid ChatGPT subscribers, but withheld API access pending safety reviews. Developer Simon Willison built an open-source LLM plugin, llm-openai-via-codex, that accesses GPT-5.5 through the Codex CLI's authentication mechanism, which OpenAI has s...

Simon Willison · 2026-04-24

Run Claude Code in Any Sandbox with One API: AgentBox SDK

AgentBox SDK is a new TypeScript library that provides a single API for running coding agents such as Claude Code across different sandbox providers, including local Docker. It launches agents as server processes communicating via WebSocket or HTTP, preserving approval flows and streaming events.

Dev.to - AI · 2026-04-24

llm-openai-via-codex 0.1a0

Simon Willison released llm-openai-via-codex 0.1a0, a plugin that reuses Codex CLI credentials to make OpenAI API calls via the LLM command-line tool, bypassing the need for separate API key configuration.

Simon Willison · 2026-04-24

I Lost a 3-Hour Debugging Session With Claude. So I Built a Fix.

A developer built a Chrome extension called Claude Exporter that allows users to save Claude AI conversations to PDF, Word (.docx), Google Docs, and Notion, with preserved code block formatting. The tool also supports exporting partial conversations rather than entire sessions.

Dev.to - Claude · 2026-04-24

Extract PDF text in your browser with LiteParse for the web

Simon Willison built a browser-based version of LlamaIndex's LiteParse PDF text extraction tool, available at simonw.github.io/liteparse/. The tool uses PDF.js and Tesseract.js to extract and spatially order text from PDFs entirely client-side, with no data leaving the user's machine.

Simon Willison · 2026-04-24

Introducing OpenAI Privacy Filter

OpenAI released an open-weight model called OpenAI Privacy Filter designed to detect and redact personally identifiable information (PII) in text. The company describes it as achieving state-of-the-art accuracy for PII detection tasks.

OpenAI Blog · 2026-04-23

My memory tool said "no session history." The session had 2,526 lines.

A developer found two bugs in Longhand, a local session-memory tool for Claude Code: a 2,526-line work session was never ingested due to a missing SessionEnd hook, and two other sessions were stored with NULL project IDs linked to the home directory instead of the correct project path.

Dev.to - Claude · 2026-04-23

Pinocchio: a skill to [try to] detect hallucinations

A developer published an AI skill called "Pinocchio" that appends a confidence percentage to responses from Claude and GitHub Copilot in VS Code, intended to flag potential hallucinations. The source code is available on GitHub at luizwbr/pinocchio-skill.

Dev.to - Claude · 2026-04-23

ccusage-web – web dashboard to track Claude Code token costs

ccusage-web is an open-source web dashboard for ccusage that displays Claude Code token costs broken down by day, month, session, and 5-hour billing block, with interactive charts and per-model breakdowns.

Dev.to - Claude · 2026-04-22

n8n Workflow Automation for Qatar Enterprises

n8n is an open-source, self-hostable workflow automation platform that connects apps and APIs without extensive coding. DigiMateAI published a guide promoting its use for Qatar enterprises, citing local data residency compliance and fixed-cost pricing as advantages over cloud-based alternatives l...

Dev.to - AI · 2026-04-22

Eclipse Foundation offers enterprise-grade open source alternative to Microsoft’s VS Code Marketplace

The Eclipse Foundation announced the Open VSX Managed Registry, a foundation-operated managed service for the Open VSX extension registry, an open source alternative to Microsoft's VS Code Marketplace. Commercial adopters receive a 99.95% uptime SLA, support tiers, and service credits; users incl...

The New Stack · 2026-04-22

Claude Desktop for Linux (unofficial, CLI-powered, local-first)

A developer released an unofficial Linux desktop client for Claude AI, built as a wrapper around Anthropic's official Claude CLI. The app stores chats locally in SQLite, runs file-access sessions inside a bubblewrap sandbox, and is currently in alpha.

Dev.to - Claude · 2026-04-21

OpenClaw Plugins — Ecosystem Guide and Practical Picks

OpenClaw's plugin ecosystem supports native gateway packages for channels, model providers, memory, voice, and web search, installable via CLI with validated manifests. The most mature plugins cover browser access, memory, and observability, while business-automation categories such as CRM and in...

Dev.to - AI · 2026-04-20

llm-anthropic 0.25

Simon Willison released llm-anthropic 0.25, a plugin providing LLM access to Anthropic's Claude models. The update adds support for claude-opus-4.7 with a thinking_effort option, new thinking_display and thinking_adaptive boolean flags, and increased default max_tokens limits per model.

Simon Willison · 2026-04-17

Qwen3.6-35B-A3B on my laptop drew me a better pelican than Claude Opus 4.7

Alibaba's Qwen3.6-35B-A3B, run locally as a 20.9GB quantized model on a MacBook Pro M5, produced higher-quality SVG illustrations than Anthropic's Claude Opus 4.7 in informal tests conducted by Simon Willison on April 16, 2026.

Simon Willison · 2026-04-17

Hugging Face pushes into “computer use” with HoloTab agent that works through your browser

Hugging Face released HoloTab, a browser-based AI agent designed for "computer use" tasks — allowing the agent to interact with web interfaces autonomously. The project is open-source and operates through the browser to automate computer interactions.

The New Stack · 2026-04-17

Google Gemma 4 Runs Natively on iPhone with Full Offline AI Inference

Google's Gemma 4 AI model can run natively on iPhones with full offline inference, eliminating the need for cloud connectivity to use the model.

Hacker News - Best · 2026-04-16

I built a live AI token meter for Claude and Cursor

A developer released TokenBar, a macOS menu bar application that monitors token usage for Claude, Cursor, and other AI tools in real-time. The app costs $5 as a one-time purchase and aims to help users track spending before receiving bills.

Dev.to - Claude · 2026-04-16

Why Enterprises Are Ditching Expensive APIs for Open-Source Image Generation in 2026

Enterprises are increasingly adopting self-hosted open-source image generation models combined with affordable APIs instead of relying solely on proprietary services like DALL-E and Midjourney, with per-image costs dropping from $0.02-$0.25 to $0.0005-$0.001 when self-hosted at scale.

Dev.to - AI · 2026-04-16

I Built a Free Gemini AI Watermark Remover (No Signup, Local Processing)

A developer released a free, browser-based tool that removes visible and invisible watermarks from Google Gemini-generated images using local processing with no server uploads or account requirements.

Dev.to - AI · 2026-04-16

datasette 1.0a27

Datasette released version 1.0a27, replacing Django-style CSRF tokens with modern browser headers and adding a RenameTableEvent for plugin compatibility. The alpha also includes new actor parameters for client methods, temporary disk database options, and improvements to the upsert API.

Simon Willison · 2026-04-16

Spring creator wants Java’s type system to tame agentic AI

Rod Johnson, creator of the Spring Framework, launched Embabel, an Apache-licensed agentic AI framework for Java built on Spring Boot, at Microsoft's JDConf conference to address enterprise predictability challenges in large language model applications.

The New Stack · 2026-04-15

How exposed is your code? Find out in minutes—for free

GitHub introduced Code Security Risk Assessment, a free tool that scans up to 20 repositories using CodeQL to identify vulnerabilities by severity and language, available to organization admins and security managers at no cost.

GitHub Blog · 2026-04-15

datasette PR #2689: Replace token-based CSRF with Sec-Fetch-Site header protection

Datasette pull request #2689 replaces token-based CSRF protection with Sec-Fetch-Site header protection, removing the need for hidden CSRF token form inputs and simplifying the security implementation based on research by Filippo Valsorda and Go 1.25.

Simon Willison · 2026-04-15

I built an MCP server that lets Claude debug failed cron jobs

A developer released an MCP server for CronSignal that allows Claude and other AI tools to diagnose failed cron jobs, retrieve error logs, and manage monitors through terminal commands. The server exposes tools including list_checks, diagnose_check, get_check_output, pause_check, and resume_check.

Dev.to - Claude · 2026-04-14

Quick Codex: a lightweight workflow layer for Codex CLI

Quick Codex is an open-source workflow layer for Codex CLI that stores task state in files to address context loss during multi-turn coding sessions. The tool provides two modes—qc-flow for exploratory work and qc-lock for strict execution—along with utilities for resuming, verifying, and checkin...

Dev.to - AI · 2026-04-14

I built a $10/month Claude API — here's the curl command

A developer launched SimplyLouie, a Claude API gateway offering $10/month flat-rate access with unlimited calls, as an alternative to Anthropic's $15 per million token pricing model.

Dev.to - Claude · 2026-04-14

Exploring the new `servo` crate

The Servo browser engine was released as an embeddable Rust crate on crates.io. A CLI tool was built to demonstrate its ability to render web pages as screenshots, though compiling Servo itself to WebAssembly proved infeasible due to threading and dependency constraints.

Simon Willison · 2026-04-14

One Open Source Project a Day (No.37): everything-claude-code - The Most Systematic Claude Code Enhancement Framework

Everything-claude-code is an open-source enhancement framework for Claude Code that includes 181 skills, 47 sub-agents, and 34 rules designed to improve productivity and code quality. The project, created by Affaan Mustafa, reportedly has over 150,000 GitHub stars and supports multiple AI coding ...

Dev.to - Claude · 2026-04-13

How I prevent state drift across long-running AI-assisted projects

A developer published Sessioncraft, an open-source governance system for managing state and context drift across long-running AI-assisted projects using Claude, after identifying recurring problems across 180+ sessions including stale information and forgotten constraints.

Dev.to - Claude · 2026-04-12

How I Track AI Coding Costs Across 4 Platforms with One Tool

A developer created cc-statistics, an open-source tool that aggregates AI coding costs from Claude Code, Gemini CLI, Codex, and Cursor into a unified view via CLI, web dashboard, and macOS menu-bar app.

Dev.to - Claude · 2026-04-12

SQLite Query Result Formatter Demo

Simon Willison released a web-based demo tool for SQLite's Query Result Formatter library, which allows users to test various rendering options for SQL result tables using WebAssembly.

Simon Willison · 2026-04-12

OCP v3.5.0: Zero-Config LAN Sharing — Your Family Can Use Claude in 30 Seconds

Open Claude Proxy v3.5.0 allows multiple devices on a local network to share a single Claude subscription by routing requests through a central server; new clients can connect by running a single curl command instead of manually configuring API keys.

Dev.to - Claude · 2026-04-11

One Open Source Project a Day (No. 35): claude-code-best-practice - Moving from Vibe Coding to AI-Native Development

Claude-code-best-practice is an open-source reference guide by shanraisshan with over 1,000 GitHub stars that provides strategies for using Anthropic's Claude Code CLI tool, including guidance on configuration files, prompt engineering, and coordinating multi-step development tasks.

Dev.to - Claude · 2026-04-10

Paid AI Subcriptions Are Dead

Google's Gemma 4 is an open-source AI model designed to run locally on personal devices without API costs or cloud dependencies. The model supports 256K context windows, multimodal inputs, and function calling, enabling uses such as local coding assistants, document analysis, and AI agents.

Dev.to - Claude · 2026-04-10

I Built Routiform After Hitting Every Limit with 9router and OmniRoute

A developer created Routiform, an AI gateway tool, after encountering bugs in 9router and OmniRoute including Docker build failures, authentication issues, and cost tracking problems. Routiform addresses these issues with improved Docker support, fixed login flows after backups, and better OAuth ...

Dev.to - AI · 2026-04-10

PyTorch Foundation Expands AI Stack with Safetensors, ExecuTorch, and Helion

The PyTorch Foundation announced three new projects joining its portfolio at PyTorch Conference EU: Safetensors for secure model distribution, ExecuTorch for inference capabilities, and Helion. The additions expand the foundation's vendor-neutral infrastructure for the AI lifecycle.

The New Stack · 2026-04-10

GitHub Repo Size

Simon Willison created a tool that displays GitHub repository sizes by querying GitHub's API, which GitHub's web interface does not expose. Users can input a repository name to see its total size.

Simon Willison · 2026-04-10

I Built Blackdesk, a Keyboard-First Market Research Terminal with AI Connectors

Blackdesk is an open-source market research terminal built in Go that consolidates stock quotes, charts, news, screeners, and AI analysis into a single keyboard-driven interface. The tool runs locally and supports multiple AI connectors including Codex and Claude Code for context-aware market ana...

Dev.to - Claude · 2026-04-09

One Open Source Project a Day (No.34): second-brain-skills - A Skill Toolkit That Turns Claude Code Into a Knowledge Work Expert

Cole Medin released second-brain-skills, an open-source toolkit that extends Claude Code with skills for knowledge work tasks including brand guidelines, presentations, operations documentation, and video generation. The project uses progressive context disclosure to manage token usage and had 64...

Dev.to - Claude · 2026-04-09

I built kerf-cli because Claude Code told me not to worry about cost

A developer built kerf-cli, a local CLI tool that analyzes Claude Code's session logs stored in SQLite, enabling detailed cost tracking and optimization recommendations that Claude Code's native `/cost` command does not provide.

Dev.to - Claude · 2026-04-08

I built an app for people who manage multiple projects with Claude Code.

A developer released Octopal, an open-source app that lets users manage multiple Claude AI agents organized by project folder, with agents communicating via chat interface and an orchestrator routing tasks between them.

Dev.to - Claude · 2026-04-08

SQLite WAL Mode Across Docker Containers Sharing a Volume

Simon Willison's research confirms that SQLite WAL mode functions correctly across separate Docker containers sharing the same volume, as containers on the same host share the same shared memory required for WAL coordination.

Simon Willison · 2026-04-08

Google AI Edge Gallery

Google released the AI Edge Gallery app for iPhone, enabling users to run Gemma 4 language models locally on their device with image questioning, audio transcription, and interactive tool-calling demonstrations.

Simon Willison · 2026-04-07

The Rise of AI Asset Marketplaces: Why Developers Need a Registry for Skills, Prompts, and MCP Configs

An open registry called TokRepo launched to centralize AI development assets like MCP configurations, Claude Code skills, and system prompts, enabling searchable discovery and one-command installation similar to npm for JavaScript.

Dev.to - AI · 2026-04-06

Local LLM in my phone sent Hi to my Mom, no api key, no wifi needed.COMPLETELY FREE

A developer built PokeClaw, an Android app that runs a 2.3B-parameter AI model locally on the phone without Wi-Fi or API keys, using the device's accessibility APIs to read UI elements and execute tasks like sending messages entirely on-device.

Dev.to - AI · 2026-04-06

Ustaad: Building a Wiki That Thinks

A developer built Ustaad, an LLM-powered wiki system that automatically organizes and cross-references documents by having the LLM maintain a structured knowledge base at ingestion time rather than searching raw documents at query time.

Dev.to - AI · 2026-04-06

My Claude Code Buddy Moved Into My MacBook's Notch and I Can't Stop Looking at It

A developer created CodeIsland, a tool that displays a Claude AI companion in a MacBook's notch to manage up to 8 simultaneous Claude Code sessions with smart notifications and a gamified buddy character that tracks usage statistics.

Dev.to - Claude · 2026-04-05

quarkus-chat-ui: A Web Front-End for LLMs, and a Real-World Case for POJO-actor

Quarkus-chat-ui is a web interface that allows multiple LLM instances to communicate with each other via HTTP MCP servers, with each instance exposing tools that others can call. The tool was developed as a real-world application of POJO-actor, a Java actor framework for managing concurrent commu...

Dev.to - Claude · 2026-04-05