AssistantResponse with TTS best approach? #4217

j0kerZ · 2024-12-27T17:15:17Z

j0kerZ
Dec 27, 2024

Hi everyone,

I'm trying to implement a chatbot assistant with a voice (TTS) function with Next.js and Vercel AI SDK.

Afaik, the AssistantResponse only returns as a stream to the useAssistant hook. So my current workflow is:

Chat send to chat API route /api/chat
Backend return AssistantResponse
useAssistant hook to check that the response is finished
Send the last message to voice API route /api/voice
The voice API sends the request to the OpenAI tts-1 model
Stream back the TTS response to UI

The problem with this approach is that it creates a great delay between text and voice response (I.e: when the user starts typing the next message, the voice returns)

So is there any workaround for this use case?
Such as combining these two APIs as 1 (E.g: wait for the AssistantResponse in the backend and send to the TTS model immediately)

Thank you all in advance!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

AssistantResponse with TTS best approach? #4217

{{title}}

Replies: 0 comments

Select a reply

AssistantResponse with TTS best approach? #4217

j0kerZ Dec 27, 2024

Replies: 0 comments

j0kerZ
Dec 27, 2024