openai-compatibility 上游请求 60s 硬超时返回 500，长响应被截断

## 描述

通过 `openai-compatibility` 配置的上游（如 NVIDIA API），当模型响应时间超过 60s 时，
CLIProxyAPI 直接返回 HTTP 500，请求被截断。

## 复现场景

- 配置了 NVIDIA `moonshotai/kimi-k2-instruct-0905` 作为 openai-compatibility 上游
- 客户端（Chrome 扩展 / cursor2api）发起 non-streaming 请求
- 上游模型推理耗时 > 60s 时，CLIProxyAPI 主动断开连接返回 500

部分请求耗时超过 1 分钟时会被强制中断。

## 期望行为

- `openai-compatibility` 上游应支持可配置的超时时间（类似 PR #2060 对 `claude-api-key` 的 `response-header-timeout`）
- 或者至少将默认超时提高到 300s 以适配推理类模型

## 建议

参考 #2060 的思路，在 `openai-compatibility` 的配置中也支持 `response-header-timeout` 字段：

```yaml
openai-compatibility:
- name: NVIDIA
  base-url: https://integrate.api.nvidia.com/v1
  response-header-timeout: 300  # 等待上游首字节的超时
  api-key-entries:
    - api-key: nvapi-xxx

#2060 的 `response-header-timeout` 应该也覆盖 `openai-compatibility` provider，不只是 `claude-api-key`。

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

openai-compatibility 上游请求 60s 硬超时返回 500，长响应被截断 #2144

描述

复现场景

期望行为

建议

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Uh oh!

openai-compatibility 上游请求 60s 硬超时返回 500，长响应被截断 #2144

Description

描述

复现场景

期望行为

建议

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions