// category

Open Source Tools

Open-source models, local inference, and community-built AI dev tooling.

Open-source AI tooling — local model runners (Ollama, LM Studio), OSS coding agents, open-weights frontier models (Llama, DeepSeek, Qwen), and community projects pushing the AI dev stack forward without a vendor lock.

98 stories · last 90 days

Chat SDK adds Messenger adapter support

Vercel's Chat SDK added a Messenger adapter, enabling support for messages, reactions, multimedia downloads, postback buttons, and automatic display name retrieval from user profiles. The adapter was based on a community contribution via GitHub PR #461.

Vercel Blog · 2026-05-09

Veritas: Give Your AI Agent the Ability to Know What It Knows

A developer released Veritas, an open-source Python library that tracks confidence levels for AI agent beliefs using a four-component vector covering estimated certainty, source fragility, staleness, and source diversity. The system applies temporal decay to evidence based on source type, with em...

Dev.to - AI · 2026-05-08

llm-gemini 0.31

Simon Willison released llm-gemini 0.31, updating the plugin to reflect that Google's Gemini 3.1 Flash-Lite model is now generally available, having exited preview status.

Simon Willison · 2026-05-08

Kanban in Hermes Agent for Self Hosted LLM Workflows

Hermes Agent includes a Kanban-style task board backed by SQLite that dispatches jobs to self-hosted LLMs via a daemon process. Without parallelism limits, the dispatcher can flood single-GPU or CPU-bound hosts; the current workaround is using `hermes kanban dispatch --max` to cap concurrent prom...

Dev.to - AI · 2026-05-08

AI Skills for Project Management

A developer built two open-source AI workflow tools — `project-complexity-mapper` and `project-complexity-action-planner` — based on TU Delft's TOE framework, which categorizes project complexity into Technical, Organizational, and External dimensions to guide management responses.

Dev.to - Claude · 2026-05-08

7 Must-Have Small Coding AI Models for Local Development in 2026

A 2026 roundup identifies seven small language models suited for local coding use on consumer hardware, including OpenAI's open-weight gpt-oss-20b, a 21-billion-parameter MoE model with 3.6 billion active parameters and a 128k context window, and Alibaba's Qwen3-VL-32B-Instruct.

Dev.to - AI · 2026-05-07

7 Must-Have Small Coding AI Models for Local Development in 2026

A 2026 roundup identifies seven small language models suited for local coding use on consumer hardware, including OpenAI's open-weight gpt-oss-20b, a 21-billion-parameter MoE model with 3.6 billion active parameters and a 128k context window, and Alibaba's Qwen3-VL-32B-Instruct.

Dev.to - AI · 2026-05-07

What Every AI Agent Builder Needs to Know About State Coordination

A developer released Network-AI, an open-source coordination layer for multi-agent AI systems that uses atomic propose-validate-commit cycles to prevent concurrent state conflicts. The tool supports 14 frameworks including LangChain, AutoGen, and CrewAI, and includes per-agent token budget contro...

Dev.to - AI · 2026-05-07

datasette-llm 0.1a7

datasette-llm 0.1a7 adds a configuration mechanism for setting default options on specific LLM models, allowing users to define defaults such as model selection and temperature for enrichment operations within Datasette.

Simon Willison · 2026-05-06

llm-echo 0.5a0

Simon Willison released llm-echo 0.5a0, a plugin for the LLM tool that provides a fake "echo" model for automated testing. The update adds a `-o thinking 1` option that simulates a reasoning block, compatible with LLM 0.32a0 and higher.

Simon Willison · 2026-05-06

Welcome to Maintainer Month: Celebrating the people behind the code

GitHub launched its sixth annual Maintainer Month, announcing new tools including granular pull request limits for unknown contributors and pull request archiving to remove spam. The releases follow GitHub data showing merged pull requests have nearly doubled year over year, with AI-generated con...

GitHub Blog · 2026-05-06

The perfect agent orchestration tool for your friendly ADHD developer

A developer and their partner released Apra Fleet, an open-source MCP server on GitHub that allows a single Claude Code session to dispatch tasks to agents running on multiple machines or in isolated local workspaces. The tool supports doer-reviewer loops within a single machine using separate fo...

Dev.to - Claude · 2026-05-05

Inside OpenSearch’s bid to become the default AI data layer

OpenSearch released versions 3.5 and 3.6 in February and April 2026, adding Better Binary Quantization (BBQ) that compresses high-dimensional vectors by 32x and the SEISMIC algorithm for sparse neural approximate nearest neighbor search. The project is positioning the open-source search engine to...

The New Stack · 2026-05-03

llm 0.32a0

Simon Willison released version 0.32a0, an alpha build, of his open-source `llm` command-line tool for interacting with large language models.

Simon Willison · 2026-04-30

LLM 0.32a0 is a major backwards-compatible refactor

Simon Willison released LLM 0.32a0, an alpha version of his Python library and CLI tool for accessing LLMs. The update refactors the core abstraction so model inputs can be represented as a sequence of messages and responses can include multiple typed parts, replacing the previous single-prompt, ...

Simon Willison · 2026-04-30

llm 0.32a1

Simon Willison released llm 0.32a1, a bug fix for the prior 0.32a0 release, correcting an issue where tool-calling conversations were not correctly restored from SQLite storage.

Simon Willison · 2026-04-30

Warp’s gamble: Going open source to take on closed-source rivals

Warp, maker of a Rust-based agentic development environment, released its client as open source under the AGPL license, with OpenAI named as founding sponsor of the repository. The agent workflows powering the platform are built on GPT models, and the company cited faster community-driven develop...

The New Stack · 2026-04-30

VibeVoice: Open-source frontier voice AI

Microsoft released VibeVoice, an open-source voice AI system, publishing the code on GitHub. The project appears to be a speech/voice model or toolkit made publicly available by Microsoft.

Hacker News - Best · 2026-04-29

microsoft/VibeVoice

Microsoft released VibeVoice, an MIT-licensed speech-to-text model with built-in speaker diarization, on January 21, 2026. The 17.3GB model (available in a 5.71GB 4-bit version) handles up to one hour of audio and transcribed 60 minutes of speech in 8 minutes 45 seconds on an M5 Max MacBook Pro.

Simon Willison · 2026-04-28

An open-source spec for orchestration: Symphony

OpenAI published Symphony, an open-source specification for orchestrating Codex agents that connects issue trackers to automated agent systems, aiming to reduce manual context switching in engineering workflows.

OpenAI Blog · 2026-04-28

Boost Your Claude / AI Dev Workflow with These 4 Tools

Four open-source tools for Claude-based development workflows include Superpowers (structured prompt orchestration), UI UX Pro Max Skill (UI/UX prompt sets for React/Tailwind), Awesome Claude Code (curated resource list), and Claude Mem (persistent memory across sessions).

Dev.to - Claude · 2026-04-28

MCP Is a Great Start — But Multi-Agent Production Needs More

A developer released Network-AI, an open-source coordination layer for multi-agent AI systems that addresses race conditions in shared state access. The tool uses a propose-validate-commit cycle and claims support for 14 frameworks including LangChain, AutoGen, CrewAI, and MCP.

Dev.to - AI · 2026-04-28

Why I built an AI-native terminal for network engineers and DevOps

A network engineer released NetCopilot, a beta SSH, Telnet, and Serial terminal client that embeds AI assistance to help diagnose network issues such as BGP failures and routing misconfigurations. The tool supports connections to Cisco, Juniper, and Linux devices and analyzes command outputs in c...

Dev.to - AI · 2026-04-27

llm 0.31

Simon Willison released version 0.31 of his open-source `llm` CLI tool, adding support for OpenAI's GPT-5.5 model, a verbosity level option for GPT-5+ models, and an image detail level parameter for image attachments.

Simon Willison · 2026-04-25

A pelican for GPT-5.5 via the semi-official Codex backdoor API

OpenAI released GPT-5.5 in its Codex product and for paid ChatGPT subscribers, but withheld API access pending safety reviews. Developer Simon Willison built an open-source LLM plugin, llm-openai-via-codex, that accesses GPT-5.5 through the Codex CLI's authentication mechanism, which OpenAI has s...

Simon Willison · 2026-04-24

Run Claude Code in Any Sandbox with One API: AgentBox SDK

AgentBox SDK is a new TypeScript library that provides a single API for running coding agents such as Claude Code across different sandbox providers, including local Docker. It launches agents as server processes communicating via WebSocket or HTTP, preserving approval flows and streaming events.

Dev.to - AI · 2026-04-24

llm-openai-via-codex 0.1a0

Simon Willison released llm-openai-via-codex 0.1a0, a plugin that reuses Codex CLI credentials to make OpenAI API calls via the LLM command-line tool, bypassing the need for separate API key configuration.

Simon Willison · 2026-04-24

I Lost a 3-Hour Debugging Session With Claude. So I Built a Fix.

A developer built a Chrome extension called Claude Exporter that allows users to save Claude AI conversations to PDF, Word (.docx), Google Docs, and Notion, with preserved code block formatting. The tool also supports exporting partial conversations rather than entire sessions.

Dev.to - Claude · 2026-04-24

Extract PDF text in your browser with LiteParse for the web

Simon Willison built a browser-based version of LlamaIndex's LiteParse PDF text extraction tool, available at simonw.github.io/liteparse/. The tool uses PDF.js and Tesseract.js to extract and spatially order text from PDFs entirely client-side, with no data leaving the user's machine.

Simon Willison · 2026-04-24

Introducing OpenAI Privacy Filter

OpenAI released an open-weight model called OpenAI Privacy Filter designed to detect and redact personally identifiable information (PII) in text. The company describes it as achieving state-of-the-art accuracy for PII detection tasks.

OpenAI Blog · 2026-04-23

Pinocchio: a skill to [try to] detect hallucinations

A developer published an AI skill called "Pinocchio" that appends a confidence percentage to responses from Claude and GitHub Copilot in VS Code, intended to flag potential hallucinations. The source code is available on GitHub at luizwbr/pinocchio-skill.

Dev.to - Claude · 2026-04-23

n8n Workflow Automation for Qatar Enterprises

n8n is an open-source, self-hostable workflow automation platform that connects apps and APIs without extensive coding. DigiMateAI published a guide promoting its use for Qatar enterprises, citing local data residency compliance and fixed-cost pricing as advantages over cloud-based alternatives l...

Dev.to - AI · 2026-04-22

OpenClaw Plugins — Ecosystem Guide and Practical Picks

OpenClaw's plugin ecosystem supports native gateway packages for channels, model providers, memory, voice, and web search, installable via CLI with validated manifests. The most mature plugins cover browser access, memory, and observability, while business-automation categories such as CRM and in...

Dev.to - AI · 2026-04-20

llm-anthropic 0.25

Simon Willison released llm-anthropic 0.25, a plugin providing LLM access to Anthropic's Claude models. The update adds support for claude-opus-4.7 with a thinking_effort option, new thinking_display and thinking_adaptive boolean flags, and increased default max_tokens limits per model.

Simon Willison · 2026-04-17

I built a live AI token meter for Claude and Cursor

A developer released TokenBar, a macOS menu bar application that monitors token usage for Claude, Cursor, and other AI tools in real-time. The app costs $5 as a one-time purchase and aims to help users track spending before receiving bills.

Dev.to - Claude · 2026-04-16

datasette 1.0a27

Datasette released version 1.0a27, replacing Django-style CSRF tokens with modern browser headers and adding a RenameTableEvent for plugin compatibility. The alpha also includes new actor parameters for client methods, temporary disk database options, and improvements to the upsert API.

Simon Willison · 2026-04-16

I built an MCP server that lets Claude debug failed cron jobs

A developer released an MCP server for CronSignal that allows Claude and other AI tools to diagnose failed cron jobs, retrieve error logs, and manage monitors through terminal commands. The server exposes tools including list_checks, diagnose_check, get_check_output, pause_check, and resume_check.

Dev.to - Claude · 2026-04-14

Quick Codex: a lightweight workflow layer for Codex CLI

Quick Codex is an open-source workflow layer for Codex CLI that stores task state in files to address context loss during multi-turn coding sessions. The tool provides two modes—qc-flow for exploratory work and qc-lock for strict execution—along with utilities for resuming, verifying, and checkin...

Dev.to - AI · 2026-04-14

Exploring the new `servo` crate

The Servo browser engine was released as an embeddable Rust crate on crates.io. A CLI tool was built to demonstrate its ability to render web pages as screenshots, though compiling Servo itself to WebAssembly proved infeasible due to threading and dependency constraints.

Simon Willison · 2026-04-14

SQLite Query Result Formatter Demo

Simon Willison released a web-based demo tool for SQLite's Query Result Formatter library, which allows users to test various rendering options for SQL result tables using WebAssembly.

Simon Willison · 2026-04-12

Paid AI Subcriptions Are Dead

Google's Gemma 4 is an open-source AI model designed to run locally on personal devices without API costs or cloud dependencies. The model supports 256K context windows, multimodal inputs, and function calling, enabling uses such as local coding assistants, document analysis, and AI agents.

Dev.to - Claude · 2026-04-10

GitHub Repo Size

Simon Willison created a tool that displays GitHub repository sizes by querying GitHub's API, which GitHub's web interface does not expose. Users can input a repository name to see its total size.

Simon Willison · 2026-04-10

Google AI Edge Gallery

Google released the AI Edge Gallery app for iPhone, enabling users to run Gemma 4 language models locally on their device with image questioning, audio transcription, and interactive tool-calling demonstrations.

Simon Willison · 2026-04-07

Ustaad: Building a Wiki That Thinks

A developer built Ustaad, an LLM-powered wiki system that automatically organizes and cross-references documents by having the LLM maintain a structured knowledge base at ingestion time rather than searching raw documents at query time.

Dev.to - AI · 2026-04-06

scan-for-secrets 0.1

Simon Willison released scan-for-secrets 0.1, a Python tool that scans directories for API keys and other secrets, including detecting common encoding variations. The tool supports storing frequently-checked secrets in a configuration file.

Simon Willison · 2026-04-05

scan-for-secrets 0.2

scan-for-secrets 0.2 released with streaming results output, support for scanning multiple directories and individual files, new Python API functions, and a verbose mode option.

Simon Willison · 2026-04-05

Sandbox SDK adds file permission control

Vercel Sandbox SDK 1.9.0 added the ability to set file permissions directly when writing files via a new `mode` property in the writeFiles API, eliminating the need for separate chmod operations.

Vercel Blog · 2026-04-03