AI Agents Deep Dive
7 partsTL;DR
Coding changed more in the past two years than in the previous decade. We moved from manual typing to autocomplete, then to multi-file edits.
Coding changed more in the past two years than in the previous decade. We moved from manual typing to autocomplete, then to multi-file edits. Now we have agentic systems that run for minutes - or hours - handling complex tasks autonomously.
The inflection point came roughly a year ago with Sonnet 3.5. That release marked the moment when applications could dynamically build other applications. Tools like Lovable, Bolt, and Cursor's multi-file editing capabilities emerged shortly after. Since then, the focus has shifted from tab completion and function generation to agentic reasoning.
Cloud Code was among the first truly capable agentic systems, particularly when paired with the Claude 4 series. Codex followed, expanding from web apps to IDE integrations. These agent harnesses share one critical trait: you can give them increasingly complex tasks and trust the output, even when they run for extended periods.
Current models from major labs focus squarely on agentic reasoning. Instead of manually writing code, tabbing through suggestions, or managing multi-file edits, you can now provide a prompt - simple or detailed - and let the system generate the solution.

Cursor's Composer is not the most powerful model available. It does not outperform Sonnet 4.5 or the latest Anthropic and OpenAI state-of-the-art models. What it offers instead is velocity.
The faster feedback loops matter when you are building with ambiguous requirements. You iterate quicker, test assumptions sooner, and course-correct without waiting for lengthy completions. For exploratory development, this trade-off often wins.
When building the demonstration application, the stack choices reflected a core principle: do not rebuild what specialized services already do well. The combination of Next.js, Clerk, and Convex provides a foundation that handles deployment, authentication, and data without custom infrastructure.
Next.js with Vercel handles the frontend and deployment. The free tier covers early development, and the $20 tier handles significant traffic. You avoid DevOps complexity while maintaining the option to migrate specific services to GCP or AWS as you scale.
Clerk manages authentication, but it extends beyond basic login. Organizations support comes built-in - no custom tables for invites, role management, or password resets. Their new billing functionality removes the need to wire up Stripe webhooks manually.
Convex provides the database layer with type safety, real-time updates, server functions, and cron jobs. The schema definition is straightforward, and changes reflect immediately in the dashboard.
The key advantage is reducing complexity for both you and your agent. When the underlying services handle authentication, real-time sync, and scaling concerns, your prompts stay focused on business logic rather than infrastructure.

Get the weekly deep dive
Tutorials on Claude Code, AI agents, and dev tools - delivered free every week.
The demonstration started with npx create convex, selecting Next.js and Clerk as providers. The installation includes Cursor rules - examples covering API setup, schema definition, and function calling. These rules reduce the need to reference documentation repeatedly.
Clerk's keyless mode lets you experiment before configuring API credentials. Once claimed, you create a JWT template named "convex" and add the issuer URL to your environment configuration. The application then has authentication, backend, and frontend working in minutes.
Cursor's latest interface defaults to the agent panel rather than the editor - a telling design decision suggesting where coding workflows are heading.
Organizations enable multi-tenant functionality: personal accounts, business workspaces, team invites, and role-based access. Clerk handles the SMTP for invites, the UI components for management, and the permission logic.
Using Context 7 and Firecrawl MCP servers, the agent retrieves current documentation automatically. When prompted to add organization switching to the navigation, the system references Clerk's docs directly, reducing hallucination and ensuring correct implementation.
The result is a dropdown menu for organization switching, creation, and management - functional without writing custom user management code.
With the foundation set, the demonstration moved to feature development. The first request: a neo-brutalist landing page with accessibility-compliant colors, social proof, pricing, and header/footer components. The agent generated the page in one pass.
Next came the authenticated dashboard. The user profile section allows saving name, persona, Twitter handle, and other fields - with data persisting to Convex and reflecting immediately in the dashboard.
The core feature was a tweet scheduling system: a 3x3 tile grid with pagination, a "create tweet" button, scheduling controls, and AI enhancement capabilities. The agent defined the database schema (content, scheduled date, status, enhanced version), created the convex functions for CRUD operations, and wired the UI components.
When organization-scoped data became a requirement, the schema updated to include organizationId. The queries switched from user-based to organization-based filtering. After a schema mismatch error surfaced, the agent resolved it by updating the data structure and re-scoping the queries.

The workflow throughout the demonstration emphasized contained prompts. Rather than requesting multiple unrelated features simultaneously, each prompt focused on a single coherent concept: the landing page, then the profile section, then the tweet scheduler, then organization scoping.
This approach works better with agentic tools. Clear, bounded instructions with contained context windows produce more reliable results than sprawling multi-part requests.
Deploying the stack is straightforward. Vercel handles the Next.js application with a production instance. Clerk provides a production environment toggle. The convex dashboard manages the production database. Domain configuration and environment variable updates complete the transition from local development to live application.
Clerk's billing component extends this further - subscriptions, plan management, and payment processing without custom Stripe integration. With AI features and a refined design system, a functional SaaS emerges from an afternoon of agent-assisted development.
The barrier to building has dropped. Frontend specialists can ship full-stack applications. Backend developers can prototype interfaces. New developers can focus on product logic rather than framework configuration.
Composer 1 is not the most capable agentic tool available - Sonnet 4.5 and GPT-5 produce higher-quality output when you can tolerate longer wait times. But for rapid iteration and ambiguous requirements, the velocity-first approach wins.
What matters now is knowing which foundation tools to leverage. Next.js, Clerk, and Convex eliminate entire categories of complexity. Combined with agentic coding assistants, they enable shipping production applications in hours rather than weeks.
Technical content at the intersection of AI and development. Building with AI agents, Claude Code, and modern dev tools - then showing you exactly how it works.
Anthropic's agentic coding CLI. Runs in your terminal, edits files autonomously, spawns sub-agents, and maintains memory...
View ToolAI app builder - describe what you want, get a deployed full-stack app with React, Supabase, and auth. No coding requi...

New tutorials, open-source projects, and deep dives on coding agents - delivered weekly.
Install the dd CLI and scaffold your first AI-powered app in under a minute.
Getting StartedConfigure Claude Code for maximum productivity -- CLAUDE.md, sub-agents, MCP servers, and autonomous workflows.
AI AgentsWhat MCP servers are, how they work, and how to build your own in 5 minutes.
AI Agents
Check out Clerk here: https://go.clerk.com/Bil2F6F As we approach the end of 2025, this video explores the monumental changes in coding over the past couple of years. Join me for a snapshot...

In this video, we dive into Warp 2.0, a cutting-edg Agentic Development environment that integrates natural language processing to execute terminal commands and write code. Learn about the...

In this video, I showcase CodeLLM, an AI-powered code editor that supports multiple models like Claude Sonnet 3.5, DeepSeek R1, and O1, 03 Mini. I demonstrate its features including autocomplete,...

The best Claude Code sessions start with questions, not code. Spec-driven development forces requirements discovery upfr...

General methods that leverage computation are ultimately the most effective - and by a large margin.

Warp 2.0 reimagines what a development environment should look like in the agentic era. Instead of bolting AI onto exist...