Wan Streamer v0.1: End-to-End Real-Time Interactive Foundation Models
Wan StreamerAI ResearchEnd-to-end ModelReal-time InteractionFull-duplexAudio-visual InteractionTransformer
Author: smusamashah
Date: 6/27/2026
Article Summary:
Wan Streamer is a native-streaming, end-to-end interactive foundation model that models language, audio, and video as both input and output within a single Transformer, enabling real-time, low-latency, full-duplex audio-visual interaction.