OpenAI’s GPT-5 Shows Chain-of-Thought Monitoring Works in Practice, but Only While the Reasoning Stays Readable
OpenAI’s GPT-5 deployment offers one of the clearest real-world signals yet that chain-of-thought monitoring can reduce deceptive model behavior, but the same release also makes the limit plain: this safety method only works as long as the model’s reasoning remains legible enough for humans and monitors to inspect. GPT-5 moved monitoring from research setup to…