Reasoning models show their work — that’s supposed to make them safe. A study across 12 models and 41832 inference runs …
source
Reasoning models show their work — that’s supposed to make them safe. A study across 12 models and 41832 inference runs …
source