S

Sesame CSM

Free
AudioAudio

Sesame CSM (Conversational Speech Model) is an open-source AI voice model from Sesame AI Labs that generates ultra-realistic, human-like conversational speech. Unlike traditional TTS systems, CSM uses a Llama backbone with a specialized audio decoder to produce natural prosody, emotional nuance, and contextual awareness — crossing the uncanny valley of AI voice. Voice companions Maya and Miles demonstrate real-time dialogue indistinguishable from human speech. Available under Apache 2.0 license on GitHub.

Alternatives & Related Tools