Cog supports streaming output. Would be amazing if this was simple to use with RunPod.
Example cog worker with streaming: https://github.com/LagPixelLOL/cog-exllama
Cog streaming docs: https://github.com/replicate/cog/blob/main/docs/python.md#streaming-output
Thank you for the great work!