4 items
4 posts
DeepSeek V4 is trending because it is close enough to frontier coding models at a much lower token price. The real question for developers is where cheap reasoning belongs in an agent stack.
How to ship Claude's Batch API in production. 50% cost savings, TypeScript SDK code, JSONL request format, and the async architecture gotchas that bite at 100k requests.
A production guide to Claude's extended thinking mode. Real cost math, TypeScript SDK code, and the tasks where reasoning tokens are worth 3x the spend.
Cut Claude API spend by up to 90% with prompt caching. Real numbers, TypeScript SDK code, and the gotchas Anthropic's docs gloss over.

New tutorials, open-source projects, and deep dives on coding agents - delivered weekly.