AI News HubLIVE
原文1 min read

OpenAI WebRTC Audio Session, now with document context

Simon Willison updates his OpenAI WebRTC Audio Session tool to support the new GPT-Realtime-2 model and allow pasting document context for conversational audio exploration.

OpenAI WebRTC Audio Session, now with document context

Simon Willison’s Weblog

Subscribe

12th June 2026 - Link Blog

OpenAI WebRTC Audio Session, now with document context. I built the first version of this tool in December 2024 to try out the then-new OpenAI WebRTC API for interacting with their realtime audio models.

Last month OpenAI introduced a brand new model to that API called GPT‑Realtime‑2, which they promoted as "our first voice model with GPT‑5‑class reasoning" - with a Sep 30, 2024 knowledge cut-off.

I've been waiting for that model to show up in the ChatGPT iPhone app but it still hasn't, so I revisited my old playground.

You can now pick the better model, and you can also paste in a big chunk of document context so you can have as audio conversation in your browser about whatever information you think would be useful to explore in a conversational way.

Recent articles

Claude Fable is relentlessly proactive - 11th June 2026

Initial impressions of Claude Fable 5 - 9th June 2026

Running Python code in a sandbox with MicroPython and WASM - 6th June 2026

This is a link post by Simon Willison, posted on 12th June 2026.

audio 20

tools 67

ai 2,070

openai 423

generative-ai 1,827

llms 1,795

multi-modal-output 16

webrtc 6

Monthly briefing

Sponsor me for $10/month and get a curated email digest of the month's most important LLM developments.

Pay me to send you less!

Sponsor & subscribe

Disclosures

Colophon

©

2002

2003

2004

2005

2006

2007

2008

2009

2010

2011

2012

2013

2014

2015

2016

2017

2018

2019

2020

2021

2022

2023

2024

2025

2026

OpenAI WebRTC Audio Session, now with document context | AI News Hub