feat(provider): add Google VertexAI support #24

onyedikachi-david · 2024-12-30T13:03:52Z

Add Google VertexAI Provider Support

This PR adds support for Google VertexAI models (Gemini) as a new provider in the Hub, allowing users to route their LLM requests to Google's models through our unified API interface.

Definition of Done

Changes Made

Added new VertexAI provider implementation
- Full support for Gemini models (chat, completion)
- Text embeddings with textembedding-gecko
- Tool/function calling with proper mapping
- Streaming support for real-time responses
- Multi-modal capabilities for image+text inputs
Added comprehensive test suite
- Unit tests for core functionality
- Integration tests with recorded responses
- Test cassettes for offline testing
- Quota retry mechanism for rate limits
- Both auth methods covered
Updated documentation
- Detailed VertexAI setup instructions
- Authentication configuration guide
- Model compatibility list
- Usage examples with OpenAI SDK
- Configuration parameter reference
Added robust configuration support
- API key authentication
- Service account authentication (JSON key file)
- Project ID configuration
- Regional endpoint configuration
- Default values for optional parameters

Testing

Unit tests: cargo test

Integration tests with credentials:

# Service Account Auth
export VERTEXAI_CREDENTIALS_PATH="../credentials/vertexai-key.json"

# Record new responses
RECORD_MODE=1 cargo test

# Replay mode (default)
cargo test

Test retry configuration:

export RETRY_DELAY=60  # Seconds between retries

Security Considerations

Credentials stored outside repository
Test cassettes cleaned of sensitive data
Support for both API key and service account auth
Environment variable support for credentials
No hardcoded sensitive values

Notes

Default location is us-central1 (configurable)
Automatic retry on quota limits (configurable)
Test cassettes provided for offline development
Compatible with existing OpenAI SDK clients

Fixes #19
/claim #19

Signed-off-by: David Anyatonwu <[email protected]>

CLAassistant · 2024-12-31T08:04:13Z

All committers have signed the CLA.

Signed-off-by: David Anyatonwu <[email protected]>

…tting - Changed project_id retrieval to use expect for mandatory parameter. - Updated location retrieval to use unwrap_or for default value. - Modified endpoint formatting to dynamically include location in the URL for both chat and embeddings requests. - Refactored test provider setup to use constants for project_id and location.

onyedikachi-david · 2024-12-31T15:04:45Z

@galkleinman This PR is ready for review.

running 9 tests
test providers::vertexai::tests::test_chat_completions_with_api_key ... ignored, Requires valid API key which is not available yet
test providers::vertexai::provider::tests::test_gemini_request_conversion ... ok
test providers::vertexai::provider::tests::test_gemini_response_conversion ... ok
test providers::vertexai::provider::tests::test_provider_new_missing_project_id - should panic ... ok
test providers::vertexai::provider::tests::test_provider_new ... ok
test providers::vertexai::tests::test_chat_completions ... ok
test providers::vertexai::tests::test_chat_completions_with_tools ... ok
test providers::vertexai::tests::test_embeddings ... ok
test providers::vertexai::tests::test_completions ... ok

test result: ok. 8 passed; 0 failed; 1 ignored; 0 measured; 0 filtered out; finished in 3.32s

     Running unittests src/main.rs (target/debug/deps/hub-e2c6417f88e881cd)

running 0 tests

test result: ok. 0 passed; 0 failed; 0 ignored; 0 measured; 0 filtered out; finished in 0.00s

   Doc-tests hub

running 0 tests

test result: ok. 0 passed; 0 failed; 0 ignored; 0 measured; 0 filtered out; finished in 0.00s

- Introduced a new method `validate_location` to sanitize and validate location input, defaulting to "us-central1" if invalid. - Updated the provider initialization to utilize the new location validation method. - Added extensive unit tests for the provider, covering various scenarios including location validation, request conversion, and handling of empty messages. - Ensured that invalid characters in location parameters are filtered out correctly. - Enhanced tests to verify the precedence of API key over credentials path in configuration. This commit improves the robustness of the VertexAIProvider by ensuring valid location formats and enhancing test coverage.

nirga

Hey @onyedikachi-david - left a lot of comments. You're code isn't following idiomatic rust constructs and it's not following other providers already implemented in this repo. You're also missing integration (black box) tests which tests the whole system - not just the provider.

nirga · 2025-01-02T22:58:56Z

src/models/streaming.rs

@@ -37,3 +46,44 @@ pub struct ChatCompletionChunk {
    #[serde(skip_serializing_if = "Option::is_none")]
    pub usage: Option<Usage>,
 }
+
+impl ChatCompletionChunk {


This should use the From trait, and be part of gemini provider and not here (see other providers)

nirga · 2025-01-02T23:00:34Z

src/models/streaming.rs

+use crate::models::vertexai::GeminiChatResponse;
+
+#[derive(Deserialize, Serialize, Clone, Debug, Default)]
+pub struct Delta {


What's the difference between this and ChoiceDelta?

nirga · 2025-01-02T23:01:38Z

src/models/vertexai.rs

+use crate::models::tool_choice::{SimpleToolChoice, ToolChoice};
+use crate::models::usage::Usage;
+
+#[derive(Debug, Serialize, Deserialize)]


Please follow the similar structure we have in other providers - this should be under providers/vertexai/models.rs

nirga · 2025-01-02T23:02:11Z

src/models/vertexai.rs

+    pub total_token_count: i32,
+}
+
+impl GeminiChatRequest {


Should use the From trait

nirga · 2025-01-02T23:03:20Z