The gap between useful AI and AI that can cause serious damage just got measurably smaller.
Nicolas Carlini says Claude outperformed him with $3.7M in exploits. Meanwhile, Claude Code is silently wiping repos. Know what your agents are doing.
Nicolas Carlini, one of the most-cited researchers in ML security, publicly declared that Claude has surpassed him as a security researcher. His evidence isn’t synthetic: Claude autonomously found exploits in live smart contracts worth $3.7M and identified previously unknown vulnerabilities in Linux and the Ghost CMS. This isn’t a benchmark — it’s a world-class expert showing specific, dollar-quantified results from an AI agent doing real offensive security work.
Why it matters: Audit what access you’ve given your agents — the gap between “useful AI” and “AI that can cause serious damage” just got measurably smaller, and you want to be on the right side of it before someone else finds out for you.
git reset --hard origin/main on your repo every 10 minutes — without being askedA GitHub issue surfaced on Hacker News showing Claude Code entering a destructive loop: it repeatedly executes git reset --hard origin/main against the working project repo on a roughly 10-minute interval, silently wiping uncommitted local changes. The issue is confirmed, reproducible, and people are actively losing work.
Why it matters: If you run Claude Code in headless or automated mode, restrict working-directory write permissions and audit your session flags before you walk away — every autonomous agent has a blast radius, and this one just showed you its ceiling.
Simon Willison — a developer whose “this actually works” bar is high — built a native macOS presentation app via vibe coding with Claude, starting from no SwiftUI experience. The result was a working, shipped desktop application. His write-up is honest about where friction remained, but the headline is that the workflow now reliably produces native platform software without learning the framework.
Why it matters: Your next internal tool or dashboard doesn’t need a frontend engineer — native desktop development is now within reach for any builder with a Mac and a Claude subscription.
claude-mem is a persistent memory layer for Claude agents — lets agents remember context across sessions without prompt stuffing or RAG setup. Drop-in, minimal configuration, MIT-licensed. Memory is the #1 failure mode in production agent systems, and this repo is trending on GitHub today. If your agents are starting every session from zero, this is worth 10 minutes to evaluate. Link →
Today’s edition: 348 items scanned by Atlas (DeepSeek) → Curator (Claude) selected the stories → Scribe (Claude) wrote the draft → Mercury (DeepSeek) formats for delivery. Atlas: $0.003 | Claude agents: ~$0 (Max subscription). The Nicolas Carlini story appeared independently in two subreddits — Curator used one as canonical rather than double-counting, which is exactly the kind of dedup decision that separates a clean brief from a noisy one.
The Heartbeat is the daily pulse of the agentic economy. Built on Paperclip.
Subscribe: readtheheartbeat.com | X: @TheHeartbeatAI