Add More Agent Clients (Qwen, DeepSeek, vLLM Local LLM) #55

Flemington8 · 2025-04-21T07:18:19Z

Description
To broaden the project’s supported agent clients, I’d like to include additional options such as Qwen, DeepSeek, and potentially a locally deployed model via vLLM. These agents should also offer deeper Weights & Biases (wandb) integration for logging and analysis.

Proposed Solution

Implement additional agent classes for Qwen/DeepSeek.
Provide a local LLM deployment option using vLLM.
Enhance wandb logging to capture important process information.

Additional Context
This expansion would help users experiment with different models in the same OpsBench environment, increasing flexibility for research or application-specific needs.

tianyin · 2025-04-21T07:51:02Z

This is absolutely cool.

If you send out PRs, we'd love to review and merge them.

Flemington8 · 2025-04-21T09:37:30Z

This is absolutely cool.

If you send out PRs, we'd love to review and merge them.

Sure, I will release my PR about this issue soon. Hope I could contribute more idea to the community.

Flemington8 · 2025-04-22T07:38:57Z

If we simply change the url and model name in our original GTP4Turbo class, we may receive this error:

# original code
class GPT4Turbo:
    """Abstraction for OpenAI's GPT-4 Turbo model."""

    def __init__(self):
        self.cache = Cache()

    def inference(self, payload: list[dict[str, str]]) -> list[str]:
        if self.cache is not None:
            cache_result = self.cache.get_from_cache(payload)
            if cache_result is not None:
                return cache_result

        client = OpenAI(api_key=os.getenv("OPENAI_API_KEY"))
        try:
            response = client.chat.completions.create(
                messages=payload,  # type: ignore
                model="gpt-4-turbo-2024-04-09",

openai.BadRequestError: Error code: 400 - {'error': {'message': 'deepseek-reasoner does not support successive user or assistant messages (messages[1] and messages[2] in your input). You should interleave the user/assistant messages in the message sequence.', 'type': 'invalid_request_error', 'param': None, 'code': 'invalid_request_error'}}

Seem like we need to change the logic of management of trace, maybe you can refer this PR: #56
What's more, maybe this class name GPT4Turbo will be outdated soon, I suggest change to another name, like GPTClient

HacksonClark · 2025-04-23T16:41:29Z

Closed by #56

HacksonClark closed this as completed Apr 23, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add More Agent Clients (Qwen, DeepSeek, vLLM Local LLM) #55

Add More Agent Clients (Qwen, DeepSeek, vLLM Local LLM) #55

Flemington8 commented Apr 21, 2025

tianyin commented Apr 21, 2025

Flemington8 commented Apr 21, 2025

Flemington8 commented Apr 22, 2025

HacksonClark commented Apr 23, 2025

Add More Agent Clients (Qwen, DeepSeek, vLLM Local LLM) #55

Add More Agent Clients (Qwen, DeepSeek, vLLM Local LLM) #55

Comments

Flemington8 commented Apr 21, 2025

tianyin commented Apr 21, 2025

Flemington8 commented Apr 21, 2025

Flemington8 commented Apr 22, 2025

HacksonClark commented Apr 23, 2025