cuppa

today's signal · no scroll

live

brewed 03:04 AM

← previous← Jun 03

Thursday

jun04

2026

next →Jun 05 →

the brief

Maturity over hype set the tone: OpenAI expanded GPT-Rosalind for bench scientists while Anthropic detailed concrete containment patterns for Claude. Cloudflare pushed pragmatic BGP hygiene, and dev tooling kept shipping with Next.js and Claude Code updates. Hugging Face’s DPO recipes and a $1,500 red-team test grounded capability claims against real constraints.

the poursit · sip · 8 items

pulse

(03)

vercel/next.js· First-partyJun 4, 12:16 AM
Next.js canary trims legacy paths
Deprecates undocumented custom server methods, fixes a build hang on sync IO abort, and improves cache tracing and stage checks for adapter scenarios.
v16.3.0-canary.40 — Misc Changes docs: add GitHub SSH authentication guidance: #94360 Deprecate undocumented custom server methods: #94348 Trace cacheHandler and cacheHandlers when using adapters: #94197 refactor: make stage advancing logic more generic: #94349 docs: document Cache Components behavior for navigation hooks: #94387 fix: build hangs if sync IO aborts before root chunk is flushed: #94365 refactor: make early/late checks stricter: #94358 refactor: remove delayUntilRuntimeStage: #9...
signal 8hype 1nextjsrelease_notesframeworktechnicalsource ↗
anthropics/claude-code· First-partyJun 3, 09:31 PM
Claude Code improves agent clarity
Adds waitingFor in --json for blocked sessions, proper Grep/Glob tools on native builds, persistent /effort defaults, and safer autocomplete behavior.
v2.1.162 — What's changed claude agents --json now includes waitingFor showing what a waiting session is blocked on (e.g. permission prompt) --tools: explicitly listing Grep/Glob now provides the dedicated search tools on native builds with embedded search (previously these names were silently ignored) /effort now confirms when your chosen level will persist as the default for new sessions Clicking a slash command in the autocomplete menu now fills it into your prompt instead of running it im...
signal 8hype 1release_notesagent_cliclaude_codelaunchsource ↗
openai/blog· First-partyJun 3, 01:15 PM
OpenAI expands GPT-Rosalind toolkit
Adds stronger biological reasoning, medicinal chemistry insight, genomics analysis, and experiment workflow orchestration, signaling a push toward domain-specific AI assistants for wet-lab scientists.
Introducing new capabilities to GPT-Rosalind — GPT-Rosalind advances life sciences research with enhanced biological reasoning, medicinal chemistry expertise, genomics analysis, and experimental workflow capabilities.
signal 6hype 3model_updatespecialized_gptopenailaunchsource ↗

findings

(04)

hn/frontpage· AggregatorJun 4, 12:27 AM
How Anthropic contains Claude
Engineering deep-dive into sandboxing, permissioning, and bounding LLM actions across products, offering concrete patterns for agents that touch tools, data, and user prompts.
The ways we contain Claude across products — Article URL: https://www.anthropic.com/engineering/how-we-contain-claude Comments URL: https://news.ycombinator.com/item?id=48392082 Points: 49 # Comments: 20
signal 9hype 1safetysystem_designguardrailstechnicalsource ↗
hn/frontpage· AggregatorJun 4, 12:56 AM
Testing if LLMs can hack apps
A $1,500 controlled test against a purposely vulnerable app evaluates how far leading models get at real exploitation versus CTF-style prompts, surfacing clear limits and risks.
I built a vulnerable app and spent $1,500 seeing if LLMs could hack it — Article URL: https://kasra.blog/blog/i-spent-1500-seeing-if-llms-could-hack-my-app/ Comments URL: https://news.ycombinator.com/item?id=48392343 Points: 59 # Comments: 27
signal 7hype 1securityllm_evalred_teamingtechnicalsource ↗
cloudflare/blog· First-partyJun 3, 05:00 PM
Cloudflare pushes First AS enforcement
Cloudflare details and promotes enforcing the first AS in BGP AS_PATHs to blunt forged-path attacks that RPKI misses, a practical routing hygiene step operators can adopt now.
Enforcing the First AS in BGP AS_PATHs — BGP is vulnerable to routing hijacks and path leaks that negatively impact traffic on the Internet. RPKI helps solve some of these problems, but for some forged paths, we need to rely on a simpler mechanism: First AS enforcement in BGP.
signal 6hype 1bgpnetwork_securityinternet_infratechnicalsource ↗
huggingface/blog· First-partyJun 3, 12:55 PM
DPO techniques beyond chatbots
Hugging Face shares methods and examples for applying Direct Preference Optimization to classification, extraction, and ranking tasks, extending alignment techniques beyond conversational agents.
Direct Preference Optimization Beyond Chatbots
signal 7hype 1dpopreference_optimizationalignmenttechnicalsource ↗

voices

(01)

simonw/blog· AnalysisJun 3, 12:01 PM
Uber caps AI tool usage
Willison contextualizes Uber reportedly burning its 2026 AI budget in four months, underscoring cost governance pressures emerging around copilots, code agents, and internal AI tooling.
Uber Caps Usage of AI Tools Like Claude Code to Manage Costs — <p><strong><a href="https://www.bloomberg.com/news/articles/2026-06-02/uber-caps-usage-of-ai-tools-like-claude-code-to-cut-costs">Uber Caps Usage of AI Tools Like Claude Code to Manage Costs</a></strong></p> I wrote <a href="https://simonwillison.net/2026/May/27/product-market-fit/#the-ai-failure-stories-around-this-are-pretty-thin">the other day</a> about Uber blowing its 2026 AI budget in four months, and how that wasn't particu...
signal 6hype 1ai_costsenterprise_adoptionclaude_codeculturalsource ↗

jun04

Next.js canary trims legacy paths

Claude Code improves agent clarity

OpenAI expands GPT-Rosalind toolkit

How Anthropic contains Claude

Testing if LLMs can hack apps

Cloudflare pushes First AS enforcement

DPO techniques beyond chatbots

Uber caps AI tool usage