Shortly after Hunter Lightman joined OpenAI as a researcher in 2022, he watched his colleagues launch ChatGPT, one of the fastest-growing products ever. Meanwhile, Lightman quietly worked on a team ...
And that's a problem. Figuring it out is one of the biggest scientific puzzles of our time and a crucial step towards controlling more powerful future models. Two years ago, Yuri Burda and Harri ...
Researchers at the company looked into how malicious fine-tuning makes a model go rogue, and how to turn it back. A new paper from OpenAI has shown why a little bit of bad training can make AI models ...