ponta

 OpenAI Playground2024/10/02のDevDayにて、OpenAI Playgroundにプロンプトの自動生成機能が追加された。
https://x.com/OpenAIDevs/status/1841176443306295685
Playground にて、簡単なプロンプトを入力して、「Create」ボタンを押すとより高品質なプロンプトを生成してくれるものだ。
Xでたまたま見かけたのだが、このプロンプト生成機能で利用されているプロンプトもリークされている（？）
https://www.linkedin.com/posts/philipp-schmid-a6a2bb196_attention-openais-meta-prompt-for-activity-7247135871651442688-9mPS?utm_source=share&utm_medium=member_desktop
https://gist.github.com/philschmid/3a0ecc9e45763716f4dd9c36b6445fca#file-openai_meta-txt

ponta

 Realtime APIOpenAIからRealtime APIが発表された。これにより、より高速な音声コミュニケーションを提供することができるようになった。
https://github.com/openai/openai-realtime-api-beta?tab=readme-ov-file
https://github.com/openai/openai-realtime-console?tab=readme-ov-file
openai-realtime-consoleをCloneすれば簡単にRealtime APIを試すことができる。

 プロンプトと音声モデルの設定https://github.com/openai/openai-realtime-console/blob/6ea4dba795fee868c60ea9e8e7eba7469974b3e9/src/pages/ConsolePage.tsx#L379-L382

 Turn Detectionhttps://github.com/openai/openai-realtime-console/blob/6ea4dba795fee868c60ea9e8e7eba7469974b3e9/src/pages/ConsolePage.tsx#L263-L265
Turn DetectionにはServer VAD mode (Default)とNo turn detectionの２種類がある。
Server VAD modeは、常に音声入力する電話のような場面で利用。
No turn detectionは、push-to-talkの時に利用。
参照：https://platform.openai.com/docs/guides/realtime/responses

 Interruptionshttps://github.com/openai/openai-realtime-console/blob/6ea4dba795fee868c60ea9e8e7eba7469974b3e9/src/pages/ConsolePage.tsx#L236-L239
https://github.com/openai/openai-realtime-console/blob/6ea4dba795fee868c60ea9e8e7eba7469974b3e9/src/pages/ConsolePage.tsx#L472-L478
client.cancelResponse(id, sampleCount);で途中で介入することができる
https://platform.openai.com/docs/guides/realtime/handling-interruptions
https://x.com/kenn/status/1844528993002979768

 話すスピードは変えられるの？現時点ではできなさそう。
https://community.openai.com/t/speak-faster-instructions-that-work-for-real-time-api/971100
AudioのCreate speechではspeed調整が可能なようだが、Realtime APIではそのようなパラメータは見つけられなかった。

ponta

https://github.com/openai/openai-realtime-console?tab=readme-ov-file#using-a-relay-server
If you would like to build a more robust implementation and play around with the reference client using your own server, we have included a Node.js Relay Server.
と書いているけど、VercelにデプロイしているNext.jsアプリケーションではどうやってRelay Serverで実装するんだろうか？

ponta

 openai/swarmマルチエージェントシステムのための軽量で使いやすいインターフェースのCookbookとして出したようだ。

https://x.com/shyamalanadkat/status/1844888546014052800?s=46
例えば、Support botの場合は、

ユーザーインターフェースエージェント：ユーザーとの最初のやり取りを処理。ニーズに基づいてヘルプセンターエージェントに振り分け。

ヘルプセンターエージェント：具体的なヘルプ・サポートを提供。文書検索のためのQdrant VectorDBと統合されている。

 オーケストレーションのイメージ

ponta

Threads

Threadのメッセージは編集することは可能？
- Modify message
- ↑を利用することで編集可能