Kyutai releases Hibiki: A 2,7B real-time speech-to-speech and speech-to-text translation with near-human quality and voice transfer
Real-time speech translation poses a complex challenge that requires trouble-free integration of speech recognition, machine translation and text-to-speech synthesis. Traditional cascaded approaches often introduce composite errors, fail to preserve the speaker identity and suffer from slow treatment, making them less suitable for real -time applications such as live interpretation. In addition, existing contemporary translation models … Read more