Skip to content
@kyutai-labs

kyutai

Kyutai - Open Science AI Lab

Popular repositories Loading

  1. moshi moshi Public

    Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec.

    Python 9.6k 880

  2. pocket-tts pocket-tts Public

    A TTS that fits in your CPU (and pocket)

    Python 3.2k 359

  3. delayed-streams-modeling delayed-streams-modeling Public

    Kyutai's Speech-To-Text and Text-To-Speech models based on the Delayed Streams Modeling framework.

    Python 2.8k 296

  4. hibiki hibiki Public

    Hibiki is a model for streaming speech translation (also known as simultaneous translation). Unlike offline translation—where one waits for the end of the source utterance to start translating--- H…

    Rust 1.4k 110

  5. unmute unmute Public

    Make text LLMs listen and speak

    Python 1.2k 202

  6. moshi-finetune moshi-finetune Public

    Python 378 57

Repositories

Showing 10 of 26 repositories

Most used topics

Loading…