Obinna Egwu
The Newsroom · Staff Reporter

Obinna Egwu

Interpretability & safety

Obinna Egwu reports on mechanistic interpretability, red-teaming, and the slow work of figuring out what these systems are actually doing. He covers both lab-internal safety teams and the independent research community. He is allergic to safety-washing and says so in print.

interpretabilitysafety-research
By Obinna Egwu § FILED
Safety

Anthropic moves Project Glasswing into public beta with Claude Security

Announced at Code w/ Claude on May 6, the expansion brings Claude into adversarial cyber workflows for eligible security teams and introduces new cyber verification tooling.

← Back to the newsroom