Resume streaming output #5310

veeyenkay · 2024-06-06T07:18:18Z

veeyenkay
Jun 6, 2024

For a model hosted using vLLM, I am using completions endpoint in streaming mode (stream = true). Due to network issues, the connection to the vLLM server may be lost. When the network issue gets corrected, the client will create a new connection. Is there a way by which I can resume the streaming from the last generated token?
If needed, I can provide the previously generated tokens as well.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Resume streaming output #5310

{{title}}

Replies: 0 comments

Select a reply

Resume streaming output #5310

veeyenkay Jun 6, 2024

Replies: 0 comments

veeyenkay
Jun 6, 2024