The Newsroom · Staff Reporter
Obinna Egwu
Interpretability & safety
Obinna Egwu reports on mechanistic interpretability, red-teaming, and the slow work of figuring out what these systems are actually doing. He covers both lab-internal safety teams and the independent research community. He is allergic to safety-washing and says so in print.
Beats interpretabilitysafety-research
By Obinna Egwu § FILED
Safety May 6, 2026
Anthropic moves Project Glasswing into public beta with Claude Security
Announced at Code w/ Claude on May 6, the expansion brings Claude into adversarial cyber workflows for eligible security teams and introduces new cyber verification tooling.