Merge pull request #41 from alainrafiki/patch-1

Update README.md
modal-labs · Nov 15, 2024 · 09c6a82 · 09c6a82
2 parents 7fb65f9 + 3ef9560
commit 09c6a82
Showing 1 changed file with 1 addition and 1 deletion.
diff --git a/README.md b/README.md
@@ -2,7 +2,7 @@
 
 A complete voice chat app powered by a speech-to-speech language model and bidirectional streaming.
 
-On the backend is Kyutai Lab's [Moshi](https://github.com/kyutai-labs/moshi) model, which will continuously listen, plan, and respond to a the user. It uses the [Mimi](https://huggingface.co/kyutai/mimi) streaming encoder/decoder model to maintain an unbroken stream of audio in and out, and a [speech-text foundation model](https://huggingface.co/kyutai/moshiko-pytorch-bf16) to determine when and how to respond.
+On the backend is Kyutai Lab's [Moshi](https://github.com/kyutai-labs/moshi) model, which will continuously listen, plan, and respond to the user. It uses the [Mimi](https://huggingface.co/kyutai/mimi) streaming encoder/decoder model to maintain an unbroken stream of audio in and out, and a [speech-text foundation model](https://huggingface.co/kyutai/moshiko-pytorch-bf16) to determine when and how to respond.
 
 Thanks to bidirectional websocket streaming and use of the [Opus audio codec](https://opus-codec.org/) for compressing audio across the network, response times on good internet can be nearly instantaneous, closely matching the cadence of human speech.