AI INFRASTRUCTURE

8 items

8 posts

BlogJun 23, 2026

Cerebras Stock Is a Public Test of AI Inference Demand

Google Trends put CBRS stock on the board after Cerebras' first public-company earnings. The developer takeaway is not a trade. It is that AI inference demand is now being priced, questioned, and audited in public.

AI Infrastructure Cerebras Markets AI Chips Inference

BlogJun 22, 2026

Sakana Fugu and the Case for Not Betting Everything on One Proprietary Model

Sakana Fugu makes a timely argument for model routing: frontier performance should come from swappable systems, not a hard dependency on one proprietary API.

model-routing ai-infrastructure ai-models vendor-lock-in

BlogJun 17, 2026

Factory Router, Explained: How Automatic Model Routing Cuts Coding-Agent Spend 20-25%

Factory.ai shipped a router that auto-picks the model for each Droid session and fails over across providers. The vendor claims 20-25% lower token spend and 99.9%+ request reliability. Here is what the product actually does, which claims are vendor claims, and whether a router beats DIY routing for your team.

factory-ai model-routing orchestration coding-agents cost-optimization ai-infrastructure

BlogJun 15, 2026

OpenRouter Fusion Makes Model Panels Real. Use Them Like Escalation, Not Autopilot

OpenRouter Fusion turns multi-model panels into an API feature. The useful lesson is not to run every prompt through more models. It is to define when a task deserves an expensive second opinion.

OpenRouter AI Models Model Routing Developer Tools AI Infrastructure

BlogJun 10, 2026

Claude Managed Agents: Dreaming, Outcomes, and Multi-Agent Orchestration Explained

Anthropic added three new primitives to Claude Managed Agents in spring 2026 - dreaming, outcomes, and multi-agent orchestration. Here is how each one works and when to use them together.

Claude Managed Agents Multi-Agent Anthropic AI Infrastructure

BlogJun 10, 2026

Factory AI and the Model Routing Era: How Coding Agents Are Learning to Spend Your Tokens Wisely

Factory AI's Droid agent surfaces a new competitive front in coding tools: cost-per-completed-task. Here's what their architecture reveals about where the whole industry is heading.

factory-ai coding-agents model-routing droid developer-tools cost-optimization ai-infrastructure

BlogJun 7, 2026

LLM Routers Compared: LiteLLM vs Portkey vs OpenRouter in 2026

A practical comparison of LLM routing tools - LiteLLM, Portkey, and OpenRouter - covering cost management, fallbacks, caching, and when to use each for production AI applications.

AI Infrastructure LLM Developer Tools Pricing Production

BlogApr 29, 2026

Cloudflare Agent Memory: A Developer's Guide to the New Primitive

Cloudflare's Agent Memory primitive. What it stores, latency profile, how it compares to mem0, and how to wire it into your stack.

Cloudflare Agents Memory AI Infrastructure Durable Objects

Get Smarter About AI Dev

New tutorials, open-source projects, and deep dives on coding agents - delivered weekly.

One email per weekReal code, not theoryFree forever

Browse All Tags