// category

Pricing & Plans

Plan changes, quota tweaks, and the cost-of-agent conversations no one can ignore.

The cost-of-agent conversation is moving fast. New usage-based tiers, quota changes, rate-limit adjustments, and the pricing math behind running AI tools at scale. If your team's weekly Claude or Cursor bill matters, this is the feed.

42 stories · last 90 days

Claude API Pricing Explained: What Does It Really Cost in 2025?

Anthropic's Claude API uses token-based pricing with three tiers as of 2025: Haiku at $0.80/$4.00 per million input/output tokens, Sonnet at $3.00/$15.00, and Opus at $15.00/$75.00. A prompt caching feature reduces repeated input costs by 90%, with cached tokens billed at 10% of standard input ra...

Dev.to - Claude · 2026-05-08

Why AI Builders Need a Unified LLM API Layer

Vector Engine API launched a unified LLM API layer that provides access to GPT, Claude, Gemini, Llama, and DeepSeek models through a single API key with usage-based pricing. New users receive $5 in credits after email verification and an additional $10 after their first successful API call.

Dev.to - AI · 2026-05-08

Hobby projects now default to 30-day deployment retention

Vercel will cap deployment retention for Hobby plan users at 30 days starting April 29th, with older deployments automatically deleted. Exceptions apply to the 10 most recent production deployments and any aliased deployments; Pro and Enterprise plans are unaffected.

Vercel Blog · 2026-04-28

GitHub Copilot is moving to usage-based billing

GitHub announced that Copilot will switch to usage-based billing on June 1, 2026, replacing premium request units with "GitHub AI Credits" calculated on token consumption. Base plan prices remain unchanged, with Pro at $10/month and Pro+ at $39/month each receiving equivalent monthly credit allot...

GitHub Blog · 2026-04-28

Changes to GitHub Copilot Individual plans

GitHub announced changes to its Copilot Individual plans, including tightened usage limits, paused signups for individual tiers, and restriction of Claude Opus 4.7 access to the $39/month Pro+ plan. The company cited increased compute demands from agentic workflows as the reason, and moved from p...

Simon Willison · 2026-04-22

Claude Opus 4.7 seems to use way more tokens than expected

Claude Opus 4.7's updated tokenizer produces approximately 1.4–1.47× more tokens than previous models in real-world developer workloads, exceeding Anthropic's stated estimate of 1.0–1.35×. The higher token count accelerates context consumption and increases effective cost per workflow.

Dev.to - Claude · 2026-04-22

GitHub pauses Copilot sign-ups as AI coding drives up compute demand

GitHub paused new sign-ups for Copilot individual plans and tightened usage limits for existing users, citing increased compute demands from agentic workflows. Microsoft VP Joe Binder said long-running, parallelized sessions now consume more resources than the original plan structure was built to...

The New Stack · 2026-04-22

Changes to GitHub Copilot Individual plans

GitHub paused new sign-ups for Copilot Pro, Pro+, and Student plans, tightened token-based usage limits, and removed Opus models from Pro-tier plans. The company cited increased compute demands from agentic workflows as the cause, and offered refunds for April usage to customers who cancel by May...

GitHub Blog · 2026-04-21

The Real Cost of Running an AI Agent 24/7 — 30 Days of Data

A developer running an autonomous AI agent 24/7 for 30 days reported total monthly API costs of $60–120, with Anthropic's Claude accounting for $45–90 of that. The pipeline uses Claude Sonnet as the primary model, Claude Opus sparingly for strategic tasks, and Mistral Voxtral for text-to-speech, ...

Dev.to - Claude · 2026-04-20

Claude Token Counter, now with model comparisons

Claude Opus 4.7 uses an updated tokenizer that produces approximately 1.46x more tokens than Opus 4.6 for raw text, and up to 3x more for high-resolution images, according to testing with Anthropic's token counting API. Since Opus 4.7 retains Opus 4.6's pricing of $5 per million input tokens, the...

Simon Willison · 2026-04-20

Anthropic Silently Dropped Prompt Cache TTL from 1 Hour to 5 Minutes

Anthropic reduced the default prompt cache TTL for its Claude API from 1 hour to 5 minutes on March 6, 2026, without a public announcement. Developers using cache_control with the "ephemeral" type who make API calls more than 5 minutes apart are now experiencing cache misses and paying full input...

Dev.to - Claude · 2026-04-18

Measuring Claude 4.7's tokenizer costs

An analysis of Claude 4.7's tokenizer found measurable differences in how it encodes text compared to prior versions, with implications for API usage costs. The piece quantified token counts across various input types to assess cost changes for users.

Hacker News - Best · 2026-04-18

Why I stopped paying $20/month for AI and what I use instead

A developer reduced AI tool spending from $40-60 monthly to $2/month by switching from ChatGPT Plus and Claude Pro subscriptions to a flat-rate API proxy, finding their typical usage (code review, debugging, writing) costs only $1.50-3/month in actual API fees.

Dev.to - AI · 2026-04-12

ChatGPT has a new $100 per month Pro subscription

OpenAI launched a $100-per-month ChatGPT Pro subscription offering 5x more Codex coding tool usage than its $20 monthly Plus tier, positioning it to compete directly with Anthropic's similarly priced Claude Max subscription.

The Verge - AI · 2026-04-10

Claude Off-Peak Double Usage: Taiwan Developers Get All-Day Bonus

Anthropic announced a 15-day promotion through March 27, 2026 offering double Claude usage during off-peak hours (8 PM–3 AM Eastern Time). Developers in Asia-Pacific regions including Taiwan benefit most, as their standard 9-to-6 workday falls entirely within off-peak hours.

Dev.to - Claude · 2026-04-07

O Claude terminou com o OpenClaw… Será mesmo?

Anthropic announced it will stop allowing third-party tools like OpenClaw to draw from user subscriptions, requiring separate payments instead, though the company offered credits and a 30% discount to offset costs.

Dev.to - Claude · 2026-04-06

Custom reporting now available on AI Gateway

Vercel released custom reporting for AI Gateway in beta, allowing Pro and Enterprise users to track AI spending and usage across models, providers, and custom tags through a single API. The service charges $0.075 per 1,000 unique tag or user ID values and $5 per 1,000 queries.

Vercel Blog · 2026-04-03