What’s the recommended way to use vLLM openAI server for batch processing? #7639

ktrapeznikov · 2024-08-18T12:05:59Z

ktrapeznikov
Aug 18, 2024

I want to process a batch of requests. What is the recommended way?
I typically use multiple workers with ThreadpoolExectuor. I am wondering if there is a better way?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

What’s the recommended way to use vLLM openAI server for batch processing? #7639

{{title}}

Replies: 0 comments

Select a reply

What’s the recommended way to use vLLM openAI server for batch processing? #7639

ktrapeznikov Aug 18, 2024

Replies: 0 comments

ktrapeznikov
Aug 18, 2024