Wan Streamer: The Real-Time Video Interaction Revolution with AI

Are you ready to meet the video assistants of the future? Until today, when we talked about AI “video calls,” clunky, cascaded systems came to mind. First, the audio was listened to, then transcribed to text, a response was generated, and finally, a video animation was rendered. This delayed architecture is now history. Wan-Streamer is the world’s first native-streaming, end-to-end AI model. By processing language, audio, and video simultaneously within a single model, it offers a truly full-duplex video call experience. ...

June 26, 2026 ·  2 min ·  380 words

OpenAI.fm Released! OpenAI's Newest Text-To-Speech Model

Hello friends! Today I’ll be talking about OpenAI’s newly released next-generation audio models. These models are taking the interaction between AI and voice to a completely new level! What’s Coming? OpenAI has been working on text-based agents for the past few months - like Operator, Deep Research, and Computer-Using Agents. But to create a true revolution, people need to be able to interact with AI in a more natural and intuitive way. That’s why they’ve made a huge leap in audio technologies. ...

March 20, 2025 ·  9 min ·  1869 words