Briefing · Monday, June 8, 2026

Apple's Gemini Siri, AI Is Slowing Down, and 1T Tokens/s

Good morning. It's Monday, June 8, and we're covering Apple's surprise WWDC AI architecture reveal, a viral essay arguing the AI scaling party is winding down, and a 1-trillion-parameter model that hits 1,000 tokens per second.

WWDC day means the developer ecosystem has a lot to process before the week is out.

THE BIG ONE

Apple Rebuilt Siri on Google Gemini - and It Runs in Google Cloud

Apple used WWDC 2026 to announce Siri AI, a ground-up rebuild of Apple Intelligence powered by a custom Gemini-derived model (728 points, 557 comments on HN). The headline number that caught developers off guard: Apple is running inference on NVIDIA GPUs in Google Cloud, not on its own Private Cloud Compute silicon. Apple's security blog post confirmed the arrangement, noting PCC's privacy architecture extends to the Google Cloud deployment with attested keys and short-lived processes.

The developer-facing piece is Apple Core AI, a new framework (362 pts) that bridges PyTorch-exported models to Apple hardware via coreai-torch. It integrates with Meta's PyTorch FX graph and maps ATen operators to Core AI operations - meaning existing PyTorch models can run on Apple silicon without a complete rewrite. A developer beta of iOS 27 shipped the same day with the new Siri AI, though access is gated behind a waitlist. Simon Willison covered the announcement in detail, noting that vision LLMs doing screen extraction neatly sidesteps the need for every app to ship Apple Intelligence hooks.

MODELS

"AI Is Slowing Down" - and the Community Is Not Done Arguing About It

Ed Zitron's essay "AI is slowing down" landed at 661 points and 759 comments - the most-commented story of the day. The argument: benchmark improvements have decelerated, real-world coding and reasoning gains are increasingly marginal, and the industry is selling roadmap promises rather than delivered capability. HN's response was split, with engineers sharing evidence on both sides from their own production use.

The timing is pointed - WWDC day is the one day the tech press is normally looking at Apple, not OpenAI or Anthropic. On the same day, OpenAI quietly filed a confidential draft S-1 with the SEC (359 pts, 313 comments), the first step toward a public offering. And a separate analysis argued that xAI now looks more like a datacentre REIT than a frontier lab (687 pts, 542 comments) - the piece notes Musk's company is increasingly in the business of renting compute capacity rather than shipping model improvements.

MODELS

Xiaomi's MiMo: 1 Trillion Parameters, 1,000 Tokens Per Second

Xiaomi dropped MiMo-v2.5-Pro-UltraSpeed (620 pts, 480 comments), a 1-trillion-parameter model that sustains 1,000 tokens per second - roughly 5-10x the throughput of most frontier models at a comparable scale. The claim is inference efficiency through aggressive speculative decoding and custom sparse-activation routing, not a smaller model. Also in model news: DeepSeek V4 Pro was reported to beat GPT-5.5 Pro on precision benchmarks (396 pts), continuing DeepSeek's pattern of releasing competitive open-weight alternatives within weeks of OpenAI releases.

WHAT ELSE IS HAPPENING

Let's Encrypt bans certs in US sanctioned territories: Updated service agreement now prohibits certificate issuance in OFAC-sanctioned regions - sparking a 440-point, 375-comment thread on what this means for legitimate developers in affected countries.
Signal: UK surveillance is not safety: Signal published a formal statement (688 pts) opposing the UK's latest push to require backdoors in encrypted messaging apps.
Show HN: Performative-UI: React component library of UX dark patterns and design tropes, framed as a satirical reference - the most-upvoted story of the day at 1,162 points, 208 comments.
Show HN: Gitdot: Open-source GitHub alternative written in Rust, targeting 100ms first contentful paint with a keyboard-first CLI-inspired UI (321 pts).
Ask HN: What tools have you built since AI?: 427 points, 734 comments - one of the better community threads of the month for concrete examples of AI-assisted personal tooling.
Massachusetts bans location data sales: Passed a privacy bill (365 pts) prohibiting sale of precise location data - the strongest US state-level location privacy law to date.

FROM THE SITE

Agent config files are quietly becoming one of the stealthier attack surfaces in agentic systems. Our post on how Claude Code plugin URLs turn skills into a supply chain covers why fetching agent extensions from arbitrary URLs requires the same scrutiny you'd give a dependency in your package.json.

Every link above goes to a primary source. This brief is part of the Daily Brief archive.

Get the next one in your inbox

The daily brief, delivered. Free, unsubscribe anytime.

Apple Rebuilt Siri on Google Gemini - and It Runs in Google Cloud

MODELS

"AI Is Slowing Down" - and the Community Is Not Done Arguing About It

MODELS

Xiaomi's MiMo: 1 Trillion Parameters, 1,000 Tokens Per Second

WHAT ELSE IS HAPPENING

Let's Encrypt bans certs in US sanctioned territories: Updated service agreement now prohibits certificate issuance in OFAC-sanctioned regions - sparking a 440-point, 375-comment thread on what this means for legitimate developers in affected countries.

Signal: UK surveillance is not safety: Signal published a formal statement (688 pts) opposing the UK's latest push to require backdoors in encrypted messaging apps.

Show HN: Performative-UI: React component library of UX dark patterns and design tropes, framed as a satirical reference - the most-upvoted story of the day at 1,162 points, 208 comments.

Show HN: Gitdot: Open-source GitHub alternative written in Rust, targeting 100ms first contentful paint with a keyboard-first CLI-inspired UI (321 pts).

Ask HN: What tools have you built since AI?: 427 points, 734 comments - one of the better community threads of the month for concrete examples of AI-assisted personal tooling.

Massachusetts bans location data sales: Passed a privacy bill (365 pts) prohibiting sale of precise location data - the strongest US state-level location privacy law to date.

FROM THE SITE

Every link above goes to a primary source. This brief is part of the Daily Brief archive.