OpenAI releases o1, its first model with ‘reasoning’ abilities

nave@lemmy.ca · edit-2 2 months ago

OpenAI releases o1, its first model with ‘reasoning’ abilities

BetaDoggo_@lemmy.world · edit-2 2 months ago

All signs point to this being a finetune of gpt4o with additional chain of thought steps before the final answer. It has exactly the same pitfalls as the existing model (9.11>9.8 tokenization error, failing simple riddles, being unable to assert that the user is wrong, etc.). It’s still a transformer and it’s still next token prediction. They hide the thought steps to mask this fact and to prevent others from benefiting from all of the finetuning data they paid for.

Communist@lemmy.frozeninferno.xyz · 2 months ago

It does not fail the 9.11 > 9.8 thing.

Echo Dot@feddit.uk · 2 months ago

They hide the thought steps to mask this fact and to prevent others from benefiting from all of the finetuning data they paid for.

Well possibly but they also hide the chain of thought steps because as they point out in their article it needs to be able to think about things outside of what it’s normally allowed allowed to say which obviously means you can’t show the content. If you’re trying to come up with worst case scenarios for a situation you actually have to be able to think about those worst case scenarios