🛠 工具 / 开源

OpenAI WebRTC 音频会话：现已支持文档上下文OpenAI WebRTC Audio Session, now with document context

simonwillison.net·2026-06-12 节选正文

Simon Willison 更新了其开发的 OpenAI WebRTC 音频会话工具，引入了全新的文档上下文功能。该工具最初于 2024 年 12 月推出，用于测试 OpenAI 实时音频模型 API，现已适配上个月发布的最新模型。通过集成文档上下文，用户可以在实时语音交互中直接基于特定文档内容进行问答与处理。这极大拓展了 WebRTC 在实时 AI 语音助手场景下的实用性与功能边界。

阅读原文

Simon Willison

12th June 2026 - Link Blog

OpenAI WebRTC Audio Session, now with document context. I built the first version of this tool in December 2024 to try out the then-new OpenAI WebRTC API for interacting with their realtime audio models.

Last month OpenAI introduced a brand new model to that API called GPT‑Realtime‑2, which they promoted as "our first voice model with GPT‑5‑class reasoning" - with a Sep 30, 2024 knowledge cut-off.

I've been waiting for that model to show up in the ChatGPT iPhone app but it still hasn't, so I revisited my old playground.

You can now pick the better model, and you can also paste in a big chunk of document context so you can have as audio conversation in your browser about whatever information you think would be useful to explore in a conversational way.

需要完整排版与评论请前往来源站点阅读。