😊
OpenAI o1の動作原理
o1やo1-miniモデルでは回答生成の前に思考を行っている。
OpenAIのガイドに動作原理の解説があった。
reasoning tokenを導入して、回答生成の前に思考させている。回答生成後はReasoning部分は捨てられる。
How reasoning works
The o1 models introduce reasoning tokens. The models use these reasoning tokens to "think", breaking down their understanding of the prompt and considering multiple approaches to generating a response. After generating reasoning tokens, the model produces an answer as visible completion tokens, and discards the reasoning tokens from its context.
Here is an example of a multi-step conversation between a user and an assistant. Input and output tokens from each step are carried over, while reasoning tokens are discarded.
Discussion