diff --git a/hindsight-docs/docs/developer/models.mdx b/hindsight-docs/docs/developer/models.mdx index 6fb8bafb9..8169f45d1 100644 --- a/hindsight-docs/docs/developer/models.mdx +++ b/hindsight-docs/docs/developer/models.mdx @@ -377,6 +377,7 @@ Converts text into dense vector representations for semantic similarity search. | `google` | Google embeddings (Gemini API or Vertex AI) | Production, multilingual, high quality | | `tei` | HuggingFace Text Embeddings Inference | Production, self-hosted | | `litellm` | LiteLLM proxy (unified gateway) | Multi-provider setups | +| `litellm-sdk` | LiteLLM SDK (direct API, no proxy) | Multi-provider, simpler setup | ### Local Models @@ -447,6 +448,11 @@ export HINDSIGHT_API_EMBEDDINGS_TEI_URL=http://localhost:8080 export HINDSIGHT_API_EMBEDDINGS_PROVIDER=litellm export HINDSIGHT_API_LITELLM_API_BASE=http://localhost:4000 export HINDSIGHT_API_EMBEDDINGS_LITELLM_MODEL=text-embedding-3-small + +# LiteLLM SDK (direct, no proxy) +export HINDSIGHT_API_EMBEDDINGS_PROVIDER=litellm-sdk +export HINDSIGHT_API_EMBEDDINGS_LITELLM_SDK_API_KEY=sk-xxxxxxxxxxxx +export HINDSIGHT_API_EMBEDDINGS_LITELLM_SDK_MODEL=openai/text-embedding-3-small ``` See [Configuration](./configuration#embeddings) for all options including Azure OpenAI and custom endpoints. @@ -566,3 +572,4 @@ export HINDSIGHT_API_RERANKER_PROVIDER=rrf ``` See [Configuration](./configuration#reranker) for all options including Azure-hosted endpoints and batch settings. + diff --git a/skills/hindsight-docs/references/developer/models.md b/skills/hindsight-docs/references/developer/models.md index f76988bd0..6efe13553 100644 --- a/skills/hindsight-docs/references/developer/models.md +++ b/skills/hindsight-docs/references/developer/models.md @@ -399,6 +399,7 @@ Converts text into dense vector representations for semantic similarity search. | `google` | Google embeddings (Gemini API or Vertex AI) | Production, multilingual, high quality | | `tei` | HuggingFace Text Embeddings Inference | Production, self-hosted | | `litellm` | LiteLLM proxy (unified gateway) | Multi-provider setups | +| `litellm-sdk` | LiteLLM SDK (direct API, no proxy) | Multi-provider, simpler setup | ### Local Models @@ -468,6 +469,11 @@ export HINDSIGHT_API_EMBEDDINGS_TEI_URL=http://localhost:8080 export HINDSIGHT_API_EMBEDDINGS_PROVIDER=litellm export HINDSIGHT_API_LITELLM_API_BASE=http://localhost:4000 export HINDSIGHT_API_EMBEDDINGS_LITELLM_MODEL=text-embedding-3-small + +# LiteLLM SDK (direct, no proxy) +export HINDSIGHT_API_EMBEDDINGS_PROVIDER=litellm-sdk +export HINDSIGHT_API_EMBEDDINGS_LITELLM_SDK_API_KEY=sk-xxxxxxxxxxxx +export HINDSIGHT_API_EMBEDDINGS_LITELLM_SDK_MODEL=openai/text-embedding-3-small ``` See [Configuration](./configuration#embeddings) for all options including Azure OpenAI and custom endpoints. @@ -587,3 +593,4 @@ export HINDSIGHT_API_RERANKER_PROVIDER=rrf ``` See [Configuration](./configuration#reranker) for all options including Azure-hosted endpoints and batch settings. +