OpenAI WebRTC 音频会话:现已支持文档上下文OpenAI WebRTC Audio Session, now with document context
Simon Willison 更新了其开发的 OpenAI WebRTC 音频会话工具,引入了全新的文档上下文功能。该工具最初于 2024 年 12 月推出,用于测试 OpenAI 实时音频模型 API,现已适配上个月发布的最新模型。通过集成文档上下文,用户可以在实时语音交互中直接基于特定文档内容进行问答与处理。这极大拓展了 WebRTC 在实时 AI 语音助手场景下的实用性与功能边界。
Simon Willison
12th June 2026 - Link Blog
OpenAI WebRTC Audio Session, now with document context. I built the first version of this tool in December 2024 to try out the then-new OpenAI WebRTC API for interacting with their realtime audio models.
Last month OpenAI introduced a brand new model to that API called GPT‑Realtime‑2, which they promoted as "our first voice model with GPT‑5‑class reasoning" - with a Sep 30, 2024 knowledge cut-off.
I've been waiting for that model to show up in the ChatGPT iPhone app but it still hasn't, so I revisited my old playground.
You can now pick the better model, and you can also paste in a big chunk of document context so you can have as audio conversation in your browser about whatever information you think would be useful to explore in a conversational way.
需要完整排版与评论请前往来源站点阅读。