According to OpenAI, o1 performs similarly to PhD students on challenging benchmark tasks in physics, chemistry, and biology, and even excels in math and coding. OpenAI said its project Strawberry has ...
You're currently following this author! Want to unfollow? Unsubscribe via the link in your email. Follow Hasan Chowdhury Every time Hasan publishes a story, you’ll get an alert straight to your inbox!
And that's a problem. Figuring it out is one of the biggest scientific puzzles of our time and a crucial step towards controlling more powerful future models. Two years ago, Yuri Burda and Harri ...