2026-06-06 10:50 UTCIn-site rewrite1 min readUpdated: 2026-06-30 13:03 UTC

New open-source voice model listens nonstop and decides every 0.4 seconds whether to speak or stay silent

Unlike GPT-4o or Qwen3.5-Omni, Audio Interaction doesn't wait for a recording to end: it translates, transcribes, chats, and picks up everyday noises like coughing in a single stream. Code, model weights, and download instructions are available on GitHub under the Apache 2.0 open-source license, with the training data to follow.

SourceThe DecoderAuthor: Jonathan Kemper

The article New open-source voice model listens nonstop and decides every 0.4 seconds whether to speak or stay silent appeared first on The Decoder.