o1やo1-miniモデルでは回答生成の前に思考を行っている。
<img src="https://storage.googleapis.com/zenn-user-upload/577024779b19-20241020.jpg" loading="lazy" class="md-img">
<a href="https://platform.openai.com/docs/guides/reasoning" target="_blank" rel="nofollow noopener noreferrer">OpenAIのガイド</a>に動作原理の解説があった。
reasoning tokenを導入して、回答生成の前に思考させている。回答生成後はReasoning部分は捨てられる。
<blockquote data-line="9" class="code-line">
How reasoning works 
The o1 models introduce reasoning tokens. The models use these reasoning tokens to "think", breaking down their understanding of the prompt and considering multiple approaches to generating a response. After generating reasoning tokens, the model produces an answer as visible completion tokens, and discards the reasoning tokens from its context. 
Here is an example of a multi-step conversation between a user and an assistant. Input and output tokens from each step are carried over, while reasoning tokens are discarded.
</blockquote>
<img src="https://storage.googleapis.com/zenn-user-upload/8ee5f589d20d-20241020.png" loading="lazy" class="md-img">

OpenAI o1の動作原理

Discussion