OpenAI WebRTC Audio Session, now with document context
Simon Willison updates his OpenAI WebRTC Audio Session tool to support the new GPT-Realtime-2 model and allow pasting document context for conversational audio exploration.
OpenAI WebRTC Audio Session, now with document context
Simon Willison’s Weblog
Subscribe
12th June 2026 - Link Blog
OpenAI WebRTC Audio Session, now with document context. I built the first version of this tool in December 2024 to try out the then-new OpenAI WebRTC API for interacting with their realtime audio models.
Last month OpenAI introduced a brand new model to that API called GPT‑Realtime‑2, which they promoted as "our first voice model with GPT‑5‑class reasoning" - with a Sep 30, 2024 knowledge cut-off.
I've been waiting for that model to show up in the ChatGPT iPhone app but it still hasn't, so I revisited my old playground.
You can now pick the better model, and you can also paste in a big chunk of document context so you can have as audio conversation in your browser about whatever information you think would be useful to explore in a conversational way.
Recent articles
Claude Fable is relentlessly proactive - 11th June 2026
Initial impressions of Claude Fable 5 - 9th June 2026
Running Python code in a sandbox with MicroPython and WASM - 6th June 2026
This is a link post by Simon Willison, posted on 12th June 2026.
audio 20
tools 67
ai 2,070
openai 423
generative-ai 1,827
llms 1,795
multi-modal-output 16
webrtc 6
Monthly briefing
Sponsor me for $10/month and get a curated email digest of the month's most important LLM developments.
Pay me to send you less!
Sponsor & subscribe
Disclosures
Colophon
©
2002
2003
2004
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
2026