kyutai

moshi Public

Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec.

Python 9.6k 880

pocket-tts Public

A TTS that fits in your CPU (and pocket)

Python 3.2k 359

delayed-streams-modeling Public

Kyutai's Speech-To-Text and Text-To-Speech models based on the Delayed Streams Modeling framework.

Python 2.8k 296

hibiki Public

Hibiki is a model for streaming speech translation (also known as simultaneous translation). Unlike offline translation—where one waits for the end of the source utterance to start translating--- H…

Rust 1.4k 110

unmute Public

Make text LLMs listen and speak

Python 1.2k 202

moshi-finetune Public

Python 378 57

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

kyutai

Popular repositories Loading

Repositories

Uh oh!

Uh oh!

Uh oh!

People

Top languages

Most used topics

Uh oh!