// category

Open Source Tools

Open-source models, local inference, and community-built AI dev tooling.

Open-source AI tooling — local model runners (Ollama, LM Studio), OSS coding agents, open-weights frontier models (Llama, DeepSeek, Qwen), and community projects pushing the AI dev stack forward without a vendor lock.

156 stories · last 90 days

an open-source version Dynamic Workflows

A developer reported spending over 300 RMB in one morning using Claude Code's Dynamic Workflows feature, which runs multiple agents in parallel for validation and solution selection. They identified an open-source alternative called OpenWorkflows, available on GitHub, that supports lower-cost mod...

Dev.to - Claude · 2026-06-01

Spec-Driven Development con Superpowers: Como evoluciono mi vibecoding

Superpowers is an open-source plugin for AI coding agents including Claude Code, Cursor, and Gemini CLI that injects structured behavioral skills into the model. The plugin, available at github.com/obra/superpowers, guides agents through a defined workflow including brainstorming, spec writing, a...

Dev.to - Claude · 2026-05-30

llm-anthropic 0.25.1

Simon Willison released llm-anthropic 0.25.1, adding support for Anthropic's Claude Opus 4.8 model, a new fast mode option for eligible organizations, and changing the default max_tokens to each model's maximum output instead of 8,192.

Simon Willison · 2026-05-29

Genesis AI SDK — A Universal Flutter SDK for AI Agents

Genesis AI SDK is a Flutter package that provides a single API for building AI agents across seven providers, including Gemini, OpenAI, Anthropic, HuggingFace, Ollama, and on-device Gemma and GGUF models. The SDK includes built-in tool calling via a ReAct loop, persistent memory, and safety guard...

Dev.to - AI · 2026-05-29

markdown-svg-renderer

Simon Willison released markdown-svg-renderer, a web tool that renders Markdown with special handling for fenced SVG code blocks, displaying both the rendered image and a code view tab. It accepts pasted Markdown or URLs pointing to CORS-enabled Markdown files or GitHub Gists.

Simon Willison · 2026-05-29

Building a Runtime Continuity Layer for AI Coding Agents

Contorium Labs released Contorium, an open-source runtime continuity layer for AI coding agents that tracks workspace state, git state, and session context across sessions. It is compatible with Cursor, VS Code, Claude Code, Codex, and MCP-based agents, and does not use chat history for state per...

Dev.to - AI · 2026-05-28

Stop Paying for Noise: Trim LLM Tokens from Both Ends of the Pipe

RTK, an open-source CLI proxy, claims to reduce LLM input tokens by up to 89% by filtering noise from developer command output, while caveman, a Claude Code skill, claims 65% output token reduction by constraining model response verbosity. Both tools are MIT-licensed and available on GitHub.

Dev.to - Claude · 2026-05-27

Voicebox: The Open-Source AI Voice Studio That Just Hit 28K Stars

Voicebox, an open-source desktop voice synthesis application, has reached 28,500 GitHub stars. Released under the MIT license, it runs locally, supports 7 text-to-speech engines, voice cloning from short audio clips, 23 languages, and includes a Model Context Protocol server for AI agent integrat...

Dev.to - AI · 2026-05-26

AiFinPay: Autonomous Payments for ruvnet/ruflo

AiFinPay released a Python library called `aifinpay-agent` that provides payment infrastructure for AI agents. The package is installable via pip and the source code is available on GitHub under the AiFinPay organization.

Dev.to - AI · 2026-05-26

datasette-agent 0.1a4

Datasette-agent 0.1a4 was released, adding a "Start a new agent chat" interface to the Jump to menu (triggered by pressing "/"), using the makeJumpSections() JavaScript plugin hook introduced in Datasette 1.0a30.

Simon Willison · 2026-05-25

datasette 1.0a30

Datasette 1.0a30 alpha was released, adding a "Jump to..." navigation menu triggered by pressing `/` that filters databases, tables, and debug options. A new `jump_items_sql()` plugin hook allows plugins to add custom items to the menu.

Simon Willison · 2026-05-25

codegraph: The Missing Knowledge Graph for 5 Coding Agents

Colbymchenry released codegraph, a local code knowledge graph that indexes codebases into SQLite via Tree-sitter and exposes nine tools over an MCP server interface for use with Claude Code, Codex CLI, Cursor, OpenCode, and Hermes Agent. Benchmarks across seven codebases showed 59% fewer tokens, ...

Dev.to - Claude · 2026-05-24

What 9,830 Claude conversations reveal about your own habits

A developer tool called skill-tree analyzes Claude Code session history, scoring users across 11 AI collaboration behaviors drawn from a taxonomy based on 9,830 Claude conversations, then assigns one of seven archetype profiles and suggests unused behaviors to practice.

Dev.to - Claude · 2026-05-23

Datasette Agent

Simon Willison released Datasette Agent, a conversational AI assistant for the Datasette data tool that generates SQL queries from natural language questions. It uses a plugin architecture and runs on Gemini 3.1 Flash-Lite in the live demo at agent.datasette.io.

Simon Willison · 2026-05-22

datasette-agent 0.1a3

Datasette Agent version 0.1a3 was released, adding "View SQL query" buttons for tables and SQL result tool calls, suppressing empty reasoning chunks, and improving handling of truncated responses so tables remain visible when SQL results are cut off.

Simon Willison · 2026-05-22

datasette-agent-charts 0.1a2

Simon Willison released datasette-agent-charts 0.1a2, adding "View SQL query" buttons below rendered charts in the Datasette agent plugin.

Simon Willison · 2026-05-22

Counting tokens is dumb. So we built a free metric for AI proficiency.

A company launched AIQ Rank, a free tool that scores AI coding proficiency from 0-1000 across 11 dimensions by reading local session activity from tools like Claude Code, Codex, and Cursor. The tool runs locally without transmitting data, and measures factors like customization, parallel agent us...

Dev.to - Claude · 2026-05-22

datasette-agent-sprites 0.1a0

Simon Willison released datasette-agent-sprites 0.1a0, an alpha-stage Datasette Agent plugin that enables running commands inside a Fly Sprites sandbox environment.

Simon Willison · 2026-05-22

The Future Is Going to B(e) MAD

A December 2025 analysis of 470 open-source GitHub pull requests found AI co-authored code had 1.7x more major issues and 2.74x higher security vulnerability rates than human-written code. BMAD is an open-source framework that applies traditional software development lifecycle phases to AI-assist...

Dev.to - Claude · 2026-05-21

Understanding AI Code Fast: A 60-Second Habit for Institutional Memory

Developer Shrijith Venkatramana released git-lrc, an open-source AI code reviewer that runs on each Git commit and generates summaries to help teams maintain institutional knowledge of AI-generated code. The tool implements an "awareness loop" — converting commits into daily and weekly reports to...

Dev.to - AI · 2026-05-21

datasette-llm 0.1a8

datasette-llm version 0.1a8 was released with a single bug fix addressing an issue where the `llm_prompt_context()` hook did not fully collect chains of responses.

Simon Willison · 2026-05-20

datasette-llm-accountant 0.1a4

Simon Willison released datasette-llm-accountant 0.1a4, a Datasette plugin for tracking LLM usage, with a bug fix for tracking chains of responses.

Simon Willison · 2026-05-20

llm-gemini 0.32a0

Simon Willison released llm-gemini 0.32a0, a plugin compatible with llm 0.32a0 alpha that adds support for streaming reasoning tokens from Gemini models.

Simon Willison · 2026-05-20

llm-gemini 0.32

Simon Willison released llm-gemini 0.32, adding support for Google's Gemini 3.5 Flash model via the new `gemini-3.5-flash` identifier in his LLM plugin.

Simon Willison · 2026-05-20

I Built an API That Parses Any Contract into Structured JSON

A developer released Clausify, an API that accepts contract documents in PDF, Word, or image formats and returns structured JSON containing fields such as parties, dates, duration, and governing law. The API is available on RapidAPI with a free tier allowing 20 requests per month.

Dev.to - AI · 2026-05-18

datasette-llm-limits 0.1a0

Simon Willison released datasette-llm-limits 0.1a0, a Datasette plugin that enables per-user or global spending limits on LLM usage, configurable by scope and time window, such as a $1.00 rolling 24-hour per-user cap.

Simon Willison · 2026-05-16

datasette-ip-rate-limit 0.1a0

Simon Willison released datasette-ip-rate-limit 0.1a0, a Datasette plugin that blocks IPs exceeding configurable request thresholds, built using OpenAI's Codex to address aggressive crawler traffic on datasette.io. The production configuration limits demo database paths to 60 requests per 60 seco...

Simon Willison · 2026-05-15

MCP Is a Great Start — But Multi-Agent Production Needs More

A developer released Network-AI, an open-source coordination layer for multi-agent AI systems that uses atomic propose-validate-commit cycles to prevent concurrent state overwrites. The project claims support for 14 frameworks including LangChain, AutoGen, CrewAI, and the Model Context Protocol.

Dev.to - AI · 2026-05-14

llm 0.32a2

Simon Willison released llm 0.32a2, an alpha version of his LLM command-line tool. The update switches reasoning-capable OpenAI models to the `/v1/responses` API endpoint, allowing summarized reasoning tokens to be displayed during prompts, with a `--hide-reasoning` flag to suppress them.

Simon Willison · 2026-05-13

Chat SDK adds Messenger adapter support

Vercel's Chat SDK added a Messenger adapter, enabling support for messages, reactions, multimedia downloads, postback buttons, and automatic display name retrieval from user profiles. The adapter was based on a community contribution via GitHub PR #461.

Vercel Blog · 2026-05-09

Veritas: Give Your AI Agent the Ability to Know What It Knows

A developer released Veritas, an open-source Python library that tracks confidence levels for AI agent beliefs using a four-component vector covering estimated certainty, source fragility, staleness, and source diversity. The system applies temporal decay to evidence based on source type, with em...

Dev.to - AI · 2026-05-08

llm-gemini 0.31

Simon Willison released llm-gemini 0.31, updating the plugin to reflect that Google's Gemini 3.1 Flash-Lite model is now generally available, having exited preview status.

Simon Willison · 2026-05-08

Kanban in Hermes Agent for Self Hosted LLM Workflows

Hermes Agent includes a Kanban-style task board backed by SQLite that dispatches jobs to self-hosted LLMs via a daemon process. Without parallelism limits, the dispatcher can flood single-GPU or CPU-bound hosts; the current workaround is using `hermes kanban dispatch --max` to cap concurrent prom...

Dev.to - AI · 2026-05-08

AI Skills for Project Management

A developer built two open-source AI workflow tools — `project-complexity-mapper` and `project-complexity-action-planner` — based on TU Delft's TOE framework, which categorizes project complexity into Technical, Organizational, and External dimensions to guide management responses.

Dev.to - Claude · 2026-05-08

7 Must-Have Small Coding AI Models for Local Development in 2026

A 2026 roundup identifies seven small language models suited for local coding use on consumer hardware, including OpenAI's open-weight gpt-oss-20b, a 21-billion-parameter MoE model with 3.6 billion active parameters and a 128k context window, and Alibaba's Qwen3-VL-32B-Instruct.

Dev.to - AI · 2026-05-07

7 Must-Have Small Coding AI Models for Local Development in 2026

A 2026 roundup identifies seven small language models suited for local coding use on consumer hardware, including OpenAI's open-weight gpt-oss-20b, a 21-billion-parameter MoE model with 3.6 billion active parameters and a 128k context window, and Alibaba's Qwen3-VL-32B-Instruct.

Dev.to - AI · 2026-05-07

What Every AI Agent Builder Needs to Know About State Coordination

A developer released Network-AI, an open-source coordination layer for multi-agent AI systems that uses atomic propose-validate-commit cycles to prevent concurrent state conflicts. The tool supports 14 frameworks including LangChain, AutoGen, and CrewAI, and includes per-agent token budget contro...

Dev.to - AI · 2026-05-07

datasette-llm 0.1a7

datasette-llm 0.1a7 adds a configuration mechanism for setting default options on specific LLM models, allowing users to define defaults such as model selection and temperature for enrichment operations within Datasette.

Simon Willison · 2026-05-06

llm-echo 0.5a0

Simon Willison released llm-echo 0.5a0, a plugin for the LLM tool that provides a fake "echo" model for automated testing. The update adds a `-o thinking 1` option that simulates a reasoning block, compatible with LLM 0.32a0 and higher.

Simon Willison · 2026-05-06

Welcome to Maintainer Month: Celebrating the people behind the code

GitHub launched its sixth annual Maintainer Month, announcing new tools including granular pull request limits for unknown contributors and pull request archiving to remove spam. The releases follow GitHub data showing merged pull requests have nearly doubled year over year, with AI-generated con...

GitHub Blog · 2026-05-06

The perfect agent orchestration tool for your friendly ADHD developer

A developer and their partner released Apra Fleet, an open-source MCP server on GitHub that allows a single Claude Code session to dispatch tasks to agents running on multiple machines or in isolated local workspaces. The tool supports doer-reviewer loops within a single machine using separate fo...

Dev.to - Claude · 2026-05-05

Inside OpenSearch’s bid to become the default AI data layer

OpenSearch released versions 3.5 and 3.6 in February and April 2026, adding Better Binary Quantization (BBQ) that compresses high-dimensional vectors by 32x and the SEISMIC algorithm for sparse neural approximate nearest neighbor search. The project is positioning the open-source search engine to...

The New Stack · 2026-05-03

llm 0.32a0

Simon Willison released version 0.32a0, an alpha build, of his open-source `llm` command-line tool for interacting with large language models.

Simon Willison · 2026-04-30

LLM 0.32a0 is a major backwards-compatible refactor

Simon Willison released LLM 0.32a0, an alpha version of his Python library and CLI tool for accessing LLMs. The update refactors the core abstraction so model inputs can be represented as a sequence of messages and responses can include multiple typed parts, replacing the previous single-prompt, ...

Simon Willison · 2026-04-30

llm 0.32a1

Simon Willison released llm 0.32a1, a bug fix for the prior 0.32a0 release, correcting an issue where tool-calling conversations were not correctly restored from SQLite storage.

Simon Willison · 2026-04-30

Warp’s gamble: Going open source to take on closed-source rivals

Warp, maker of a Rust-based agentic development environment, released its client as open source under the AGPL license, with OpenAI named as founding sponsor of the repository. The agent workflows powering the platform are built on GPT models, and the company cited faster community-driven develop...

The New Stack · 2026-04-30

VibeVoice: Open-source frontier voice AI

Microsoft released VibeVoice, an open-source voice AI system, publishing the code on GitHub. The project appears to be a speech/voice model or toolkit made publicly available by Microsoft.

Hacker News - Best · 2026-04-29

microsoft/VibeVoice

Microsoft released VibeVoice, an MIT-licensed speech-to-text model with built-in speaker diarization, on January 21, 2026. The 17.3GB model (available in a 5.71GB 4-bit version) handles up to one hour of audio and transcribed 60 minutes of speech in 8 minutes 45 seconds on an M5 Max MacBook Pro.

Simon Willison · 2026-04-28

An open-source spec for orchestration: Symphony

OpenAI published Symphony, an open-source specification for orchestrating Codex agents that connects issue trackers to automated agent systems, aiming to reduce manual context switching in engineering workflows.

OpenAI Blog · 2026-04-28

Boost Your Claude / AI Dev Workflow with These 4 Tools

Four open-source tools for Claude-based development workflows include Superpowers (structured prompt orchestration), UI UX Pro Max Skill (UI/UX prompt sets for React/Tailwind), Awesome Claude Code (curated resource list), and Claude Mem (persistent memory across sessions).

Dev.to - Claude · 2026-04-28

MCP Is a Great Start — But Multi-Agent Production Needs More

A developer released Network-AI, an open-source coordination layer for multi-agent AI systems that addresses race conditions in shared state access. The tool uses a propose-validate-commit cycle and claims support for 14 frameworks including LangChain, AutoGen, CrewAI, and MCP.

Dev.to - AI · 2026-04-28

Why I built an AI-native terminal for network engineers and DevOps

A network engineer released NetCopilot, a beta SSH, Telnet, and Serial terminal client that embeds AI assistance to help diagnose network issues such as BGP failures and routing misconfigurations. The tool supports connections to Cisco, Juniper, and Linux devices and analyzes command outputs in c...

Dev.to - AI · 2026-04-27

llm 0.31

Simon Willison released version 0.31 of his open-source `llm` CLI tool, adding support for OpenAI's GPT-5.5 model, a verbosity level option for GPT-5+ models, and an image detail level parameter for image attachments.

Simon Willison · 2026-04-25

A pelican for GPT-5.5 via the semi-official Codex backdoor API

OpenAI released GPT-5.5 in its Codex product and for paid ChatGPT subscribers, but withheld API access pending safety reviews. Developer Simon Willison built an open-source LLM plugin, llm-openai-via-codex, that accesses GPT-5.5 through the Codex CLI's authentication mechanism, which OpenAI has s...

Simon Willison · 2026-04-24

Run Claude Code in Any Sandbox with One API: AgentBox SDK

AgentBox SDK is a new TypeScript library that provides a single API for running coding agents such as Claude Code across different sandbox providers, including local Docker. It launches agents as server processes communicating via WebSocket or HTTP, preserving approval flows and streaming events.

Dev.to - AI · 2026-04-24

llm-openai-via-codex 0.1a0

Simon Willison released llm-openai-via-codex 0.1a0, a plugin that reuses Codex CLI credentials to make OpenAI API calls via the LLM command-line tool, bypassing the need for separate API key configuration.

Simon Willison · 2026-04-24

I Lost a 3-Hour Debugging Session With Claude. So I Built a Fix.

A developer built a Chrome extension called Claude Exporter that allows users to save Claude AI conversations to PDF, Word (.docx), Google Docs, and Notion, with preserved code block formatting. The tool also supports exporting partial conversations rather than entire sessions.

Dev.to - Claude · 2026-04-24

Extract PDF text in your browser with LiteParse for the web

Simon Willison built a browser-based version of LlamaIndex's LiteParse PDF text extraction tool, available at simonw.github.io/liteparse/. The tool uses PDF.js and Tesseract.js to extract and spatially order text from PDFs entirely client-side, with no data leaving the user's machine.

Simon Willison · 2026-04-24

Introducing OpenAI Privacy Filter

OpenAI released an open-weight model called OpenAI Privacy Filter designed to detect and redact personally identifiable information (PII) in text. The company describes it as achieving state-of-the-art accuracy for PII detection tasks.

OpenAI Blog · 2026-04-23

Pinocchio: a skill to [try to] detect hallucinations

A developer published an AI skill called "Pinocchio" that appends a confidence percentage to responses from Claude and GitHub Copilot in VS Code, intended to flag potential hallucinations. The source code is available on GitHub at luizwbr/pinocchio-skill.

Dev.to - Claude · 2026-04-23

n8n Workflow Automation for Qatar Enterprises

n8n is an open-source, self-hostable workflow automation platform that connects apps and APIs without extensive coding. DigiMateAI published a guide promoting its use for Qatar enterprises, citing local data residency compliance and fixed-cost pricing as advantages over cloud-based alternatives l...

Dev.to - AI · 2026-04-22

OpenClaw Plugins — Ecosystem Guide and Practical Picks

OpenClaw's plugin ecosystem supports native gateway packages for channels, model providers, memory, voice, and web search, installable via CLI with validated manifests. The most mature plugins cover browser access, memory, and observability, while business-automation categories such as CRM and in...

Dev.to - AI · 2026-04-20

llm-anthropic 0.25

Simon Willison released llm-anthropic 0.25, a plugin providing LLM access to Anthropic's Claude models. The update adds support for claude-opus-4.7 with a thinking_effort option, new thinking_display and thinking_adaptive boolean flags, and increased default max_tokens limits per model.

Simon Willison · 2026-04-17

I built a live AI token meter for Claude and Cursor

A developer released TokenBar, a macOS menu bar application that monitors token usage for Claude, Cursor, and other AI tools in real-time. The app costs $5 as a one-time purchase and aims to help users track spending before receiving bills.

Dev.to - Claude · 2026-04-16

datasette 1.0a27

Datasette released version 1.0a27, replacing Django-style CSRF tokens with modern browser headers and adding a RenameTableEvent for plugin compatibility. The alpha also includes new actor parameters for client methods, temporary disk database options, and improvements to the upsert API.

Simon Willison · 2026-04-16

I built an MCP server that lets Claude debug failed cron jobs

A developer released an MCP server for CronSignal that allows Claude and other AI tools to diagnose failed cron jobs, retrieve error logs, and manage monitors through terminal commands. The server exposes tools including list_checks, diagnose_check, get_check_output, pause_check, and resume_check.

Dev.to - Claude · 2026-04-14

Quick Codex: a lightweight workflow layer for Codex CLI

Quick Codex is an open-source workflow layer for Codex CLI that stores task state in files to address context loss during multi-turn coding sessions. The tool provides two modes—qc-flow for exploratory work and qc-lock for strict execution—along with utilities for resuming, verifying, and checkin...

Dev.to - AI · 2026-04-14

Exploring the new `servo` crate

The Servo browser engine was released as an embeddable Rust crate on crates.io. A CLI tool was built to demonstrate its ability to render web pages as screenshots, though compiling Servo itself to WebAssembly proved infeasible due to threading and dependency constraints.

Simon Willison · 2026-04-14

SQLite Query Result Formatter Demo

Simon Willison released a web-based demo tool for SQLite's Query Result Formatter library, which allows users to test various rendering options for SQL result tables using WebAssembly.

Simon Willison · 2026-04-12

Paid AI Subcriptions Are Dead

Google's Gemma 4 is an open-source AI model designed to run locally on personal devices without API costs or cloud dependencies. The model supports 256K context windows, multimodal inputs, and function calling, enabling uses such as local coding assistants, document analysis, and AI agents.

Dev.to - Claude · 2026-04-10

GitHub Repo Size

Simon Willison created a tool that displays GitHub repository sizes by querying GitHub's API, which GitHub's web interface does not expose. Users can input a repository name to see its total size.

Simon Willison · 2026-04-10

Google AI Edge Gallery

Google released the AI Edge Gallery app for iPhone, enabling users to run Gemma 4 language models locally on their device with image questioning, audio transcription, and interactive tool-calling demonstrations.

Simon Willison · 2026-04-07

Ustaad: Building a Wiki That Thinks

A developer built Ustaad, an LLM-powered wiki system that automatically organizes and cross-references documents by having the LLM maintain a structured knowledge base at ingestion time rather than searching raw documents at query time.

Dev.to - AI · 2026-04-06

scan-for-secrets 0.1

Simon Willison released scan-for-secrets 0.1, a Python tool that scans directories for API keys and other secrets, including detecting common encoding variations. The tool supports storing frequently-checked secrets in a configuration file.

Simon Willison · 2026-04-05

scan-for-secrets 0.2

scan-for-secrets 0.2 released with streaming results output, support for scanning multiple directories and individual files, new Python API functions, and a verbose mode option.

Simon Willison · 2026-04-05

Sandbox SDK adds file permission control

Vercel Sandbox SDK 1.9.0 added the ability to set file permissions directly when writing files via a new `mode` property in the writeFiles API, eliminating the need for separate chmod operations.

Vercel Blog · 2026-04-03