[testing][shortfin llm] Add soak tests #1044

renxida · 2025-03-06T17:51:53Z

Our current concurrent-request tests aren't sufficient to catch most memory leaks / concurrency issues.

We probably also want some soak tests (https://en.wikipedia.org/wiki/Soak_testing), e.g. run for 30 minutes with a variable number of requests coming in throughout that period. These tests are unlikely to catch memory leaks or concurrency issues.

We should check what vLLM, SGLang, and llama.cpp do too.

(from Scott's comment on #1028)

renxida self-assigned this Mar 6, 2025

renxida mentioned this issue Mar 6, 2025

Resolve CPU llm smoke / integration test hangs by temporarily removing 8-request testcases #1028

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[testing][shortfin llm] Add soak tests #1044

[testing][shortfin llm] Add soak tests #1044

renxida commented Mar 6, 2025

[testing][shortfin llm] Add soak tests #1044

[testing][shortfin llm] Add soak tests #1044

Comments

renxida commented Mar 6, 2025