Open-source models, local inference, and community-built AI dev tooling.
Simon Willison released llm-anthropic 0.25, a plugin providing LLM access to Anthropic's Claude models. The update adds support for claude-opus-4.7 with a thinking_effort option, new thinking_display and thinking_adaptive boolean flags, and increased default max_tokens limits per model.
Alibaba's Qwen3.6-35B-A3B, run locally as a 20.9GB quantized model on a MacBook Pro M5, produced higher-quality SVG illustrations than Anthropic's Claude Opus 4.7 in informal tests conducted by Simon Willison on April 16, 2026.
Hugging Face released HoloTab, a browser-based AI agent designed for "computer use" tasks — allowing the agent to interact with web interfaces autonomously. The project is open-source and operates through the browser to automate computer interactions.
Google's Gemma 4 AI model can run natively on iPhones with full offline inference, eliminating the need for cloud connectivity to use the model.
A developer released TokenBar, a macOS menu bar application that monitors token usage for Claude, Cursor, and other AI tools in real-time. The app costs $5 as a one-time purchase and aims to help users track spending before receiving bills.
Enterprises are increasingly adopting self-hosted open-source image generation models combined with affordable APIs instead of relying solely on proprietary services like DALL-E and Midjourney, with per-image costs dropping from $0.02-$0.25 to $0.0005-$0.001 when self-hosted at scale.
A developer released a free, browser-based tool that removes visible and invisible watermarks from Google Gemini-generated images using local processing with no server uploads or account requirements.
Datasette released version 1.0a27, replacing Django-style CSRF tokens with modern browser headers and adding a RenameTableEvent for plugin compatibility. The alpha also includes new actor parameters for client methods, temporary disk database options, and improvements to the upsert API.
Rod Johnson, creator of the Spring Framework, launched Embabel, an Apache-licensed agentic AI framework for Java built on Spring Boot, at Microsoft's JDConf conference to address enterprise predictability challenges in large language model applications.
GitHub introduced Code Security Risk Assessment, a free tool that scans up to 20 repositories using CodeQL to identify vulnerabilities by severity and language, available to organization admins and security managers at no cost.
Datasette pull request #2689 replaces token-based CSRF protection with Sec-Fetch-Site header protection, removing the need for hidden CSRF token form inputs and simplifying the security implementation based on research by Filippo Valsorda and Go 1.25.
A developer released an MCP server for CronSignal that allows Claude and other AI tools to diagnose failed cron jobs, retrieve error logs, and manage monitors through terminal commands. The server exposes tools including list_checks, diagnose_check, get_check_output, pause_check, and resume_check.
Quick Codex is an open-source workflow layer for Codex CLI that stores task state in files to address context loss during multi-turn coding sessions. The tool provides two modes—qc-flow for exploratory work and qc-lock for strict execution—along with utilities for resuming, verifying, and checkin...
A developer launched SimplyLouie, a Claude API gateway offering $10/month flat-rate access with unlimited calls, as an alternative to Anthropic's $15 per million token pricing model.
The Servo browser engine was released as an embeddable Rust crate on crates.io. A CLI tool was built to demonstrate its ability to render web pages as screenshots, though compiling Servo itself to WebAssembly proved infeasible due to threading and dependency constraints.
Everything-claude-code is an open-source enhancement framework for Claude Code that includes 181 skills, 47 sub-agents, and 34 rules designed to improve productivity and code quality. The project, created by Affaan Mustafa, reportedly has over 150,000 GitHub stars and supports multiple AI coding ...
A developer published Sessioncraft, an open-source governance system for managing state and context drift across long-running AI-assisted projects using Claude, after identifying recurring problems across 180+ sessions including stale information and forgotten constraints.
A developer created cc-statistics, an open-source tool that aggregates AI coding costs from Claude Code, Gemini CLI, Codex, and Cursor into a unified view via CLI, web dashboard, and macOS menu-bar app.
Simon Willison released a web-based demo tool for SQLite's Query Result Formatter library, which allows users to test various rendering options for SQL result tables using WebAssembly.
Open Claude Proxy v3.5.0 allows multiple devices on a local network to share a single Claude subscription by routing requests through a central server; new clients can connect by running a single curl command instead of manually configuring API keys.
Claude-code-best-practice is an open-source reference guide by shanraisshan with over 1,000 GitHub stars that provides strategies for using Anthropic's Claude Code CLI tool, including guidance on configuration files, prompt engineering, and coordinating multi-step development tasks.
Google's Gemma 4 is an open-source AI model designed to run locally on personal devices without API costs or cloud dependencies. The model supports 256K context windows, multimodal inputs, and function calling, enabling uses such as local coding assistants, document analysis, and AI agents.
A developer created Routiform, an AI gateway tool, after encountering bugs in 9router and OmniRoute including Docker build failures, authentication issues, and cost tracking problems. Routiform addresses these issues with improved Docker support, fixed login flows after backups, and better OAuth ...
The PyTorch Foundation announced three new projects joining its portfolio at PyTorch Conference EU: Safetensors for secure model distribution, ExecuTorch for inference capabilities, and Helion. The additions expand the foundation's vendor-neutral infrastructure for the AI lifecycle.
Simon Willison created a tool that displays GitHub repository sizes by querying GitHub's API, which GitHub's web interface does not expose. Users can input a repository name to see its total size.
Blackdesk is an open-source market research terminal built in Go that consolidates stock quotes, charts, news, screeners, and AI analysis into a single keyboard-driven interface. The tool runs locally and supports multiple AI connectors including Codex and Claude Code for context-aware market ana...
Cole Medin released second-brain-skills, an open-source toolkit that extends Claude Code with skills for knowledge work tasks including brand guidelines, presentations, operations documentation, and video generation. The project uses progressive context disclosure to manage token usage and had 64...
A developer built kerf-cli, a local CLI tool that analyzes Claude Code's session logs stored in SQLite, enabling detailed cost tracking and optimization recommendations that Claude Code's native `/cost` command does not provide.
A developer released Octopal, an open-source app that lets users manage multiple Claude AI agents organized by project folder, with agents communicating via chat interface and an orchestrator routing tasks between them.
Simon Willison's research confirms that SQLite WAL mode functions correctly across separate Docker containers sharing the same volume, as containers on the same host share the same shared memory required for WAL coordination.
Google released the AI Edge Gallery app for iPhone, enabling users to run Gemma 4 language models locally on their device with image questioning, audio transcription, and interactive tool-calling demonstrations.
An open registry called TokRepo launched to centralize AI development assets like MCP configurations, Claude Code skills, and system prompts, enabling searchable discovery and one-command installation similar to npm for JavaScript.
A developer built PokeClaw, an Android app that runs a 2.3B-parameter AI model locally on the phone without Wi-Fi or API keys, using the device's accessibility APIs to read UI elements and execute tasks like sending messages entirely on-device.
A developer built Ustaad, an LLM-powered wiki system that automatically organizes and cross-references documents by having the LLM maintain a structured knowledge base at ingestion time rather than searching raw documents at query time.
A developer created CodeIsland, a tool that displays a Claude AI companion in a MacBook's notch to manage up to 8 simultaneous Claude Code sessions with smart notifications and a gamified buddy character that tracks usage statistics.
Quarkus-chat-ui is a web interface that allows multiple LLM instances to communicate with each other via HTTP MCP servers, with each instance exposing tools that others can call. The tool was developed as a real-world application of POJO-actor, a Java actor framework for managing concurrent commu...
A developer released CodeIsland, a free open-source macOS app that displays Claude AI coding agent status in the MacBook's notch, allowing users to monitor multiple sessions, track API usage, and view chat history without switching windows.
Simon Willison released scan-for-secrets 0.1, a Python tool that scans directories for API keys and other secrets, including detecting common encoding variations. The tool supports storing frequently-checked secrets in a configuration file.
scan-for-secrets 0.2 released with streaming results output, support for scanning multiple directories and individual files, new Python API functions, and a verbose mode option.
A developer released Apfel, a free tool on GitHub that provides access to AI capabilities already present on macOS systems. The project received 643 upvotes and 138 comments on Hacker News.
Rotato is an open-source Node.js proxy that automatically rotates between multiple API keys to bypass rate limits on services like OpenAI and Gemini. It requires no external dependencies and acts as a drop-in replacement for direct API calls.
Vercel Sandbox SDK 1.9.0 added the ability to set file permissions directly when writing files via a new `mode` property in the writeFiles API, eliminating the need for separate chmod operations.