Refactor compile_annotation to use Pydantic TypeAdapter throughout #3194

devin-ai-integration · 2025-11-22T20:32:09Z

Comprehensively refactors compile_annotation to use Pydantic's TypeAdapter for all annotation types instead of manual type handling. This simplifies the code from ~100 lines to ~35 lines and ensures we leverage Pydantic's native schema generation capabilities.

Requested by: @vincent0426
Session: https://app.devin.ai/sessions/a7d1008865b740ce9434d1506ca3d4cc

Summary

The compile_annotation function previously manually handled each annotation type (Union, Literal, dict, list, tuple, dataclasses, BaseModel, etc.) with custom logic. This PR refactors it to use Pydantic's TypeAdapter for all types except special cases (None, Any, datetime, Json Union, ForwardRef).

Key changes:

Replaced manual type handling with TypeAdapter(annotation).json_schema() for all annotation types
Added _strip_titles() helper to recursively remove "title" keys that Pydantic adds to schemas
Simplified nested $defs handling by merging Pydantic's output directly into top-level defs dict
Updated all tests to match Pydantic's schema generation patterns
Removed unused Literal import and test_compile_function_definition__lambda test
Updated workflow serialization tests and fixtures to use Pydantic's simpler $ref naming (e.g., #/$defs/ChatMessage instead of #/$defs/vellum.client.types.chat_message.ChatMessage)

Behavioral changes:

$ref naming: Pydantic generates simpler $ref names (e.g., ChatMessage) instead of fully qualified names (e.g., vellum.client.types.chat_message.ChatMessage)
Inlined schemas: Pydantic now inlines dataclass/BaseModel schemas instead of using $ref + $defs (e.g., the schema is directly embedded in the property rather than referenced)
Enum serialization: Enum values are now serialized to their actual values (e.g., MyEnum.FOO becomes "foo")
Additional properties: Unparameterized list now includes "items": {} and unparameterized dict includes "additionalProperties": true (more correct JSON Schema)

Review & Testing Checklist for Human

$ref naming changes: Pydantic now generates simpler $ref names like #/$defs/ChatMessage instead of fully qualified names. Verify that no downstream code relies on the old fully qualified naming convention (e.g., string matching, schema validation, serialization/deserialization logic).
Schema inlining vs $refs: Pydantic inlines schemas for dataclasses/BaseModel instead of using $ref + $defs. Verify this doesn't break downstream consumers (LLM function calling, workflow serialization, etc.) that may expect or rely on $refs.
Unannotated parameters: The refactored code no longer has special handling for inspect._empty (unannotated parameters). Functions with unannotated parameters will now raise ValueError instead of returning {"type": "null"}. Verify this is acceptable or add back support if needed.
$defs collision risk: The code now does a naive defs.update(nested_defs) merge. If two different models have the same class name from different modules (e.g., foo1.Foo and foo2.Foo), the second will overwrite the first. Verify this edge case doesn't occur in practice or add collision detection if needed.
Workflow serialization compatibility: Test that workflow serialization/deserialization still works correctly with the new schema format, especially for tool calling nodes with chat history outputs.
LLM compatibility: Test with actual LLM function calling to ensure the generated schemas work correctly. Pydantic may include additional metadata or constraints that weren't present before.

Test Plan

Run the full test suite (all tests pass including 28 in test_functions.py and 154 in workflow serialization tests)
Test with real LLM function calling using various parameter types
Test workflow serialization/deserialization end-to-end, especially for tool calling nodes
Search codebase for any hardcoded references to fully qualified $ref names (e.g., vellum.client.types.chat_message.ChatMessage)
Test with complex nested types and edge cases (deeply nested models, recursive types, etc.)

Notes

All existing tests pass after updating expectations to match Pydantic's output. The refactor significantly simplifies the code and delegates schema generation to Pydantic, which should be more maintainable long-term. However, the behavioral changes (especially $ref naming and schema inlining) are significant and warrant careful testing with downstream consumers. Only two places in the codebase referenced the old fully qualified $ref names (both in tests/fixtures), which have been updated.

- Replace manual BaseModel field iteration with model_json_schema() - Add _strip_titles helper to remove title keys from generated schemas - Handle nested from Pydantic's schema generation - Remove PydanticUndefined import (no longer needed) - Remove test_compile_function_definition__lambda test (dropping support for unannotated lambda parameters) Co-Authored-By: [email protected] <[email protected]>

devin-ai-integration · 2025-11-22T20:32:13Z

🤖 Devin AI Engineer

I'll be helping with this pull request! Here's what you should know:

✅ I will automatically:

Address comments on this PR that start with 'DevinAI' or '@devin'.
Look at CI failures and help fix them

Note: I can only respond to comments from users who have write access to this repository.

⚙️ Control Options:

Disable automatic comment and CI monitoring

vellum-automation · 2025-11-22T20:32:13Z

@codex review

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

src/vellum/workflows/utils/functions.py

- Replace manual type handling with TypeAdapter for all annotation types - Simplify nested handling by merging Pydantic's output directly - Update tests to match Pydantic's schema generation (inlined schemas, enum values, etc.) - Remove unused Literal import and fix whitespace issues Co-Authored-By: [email protected] <[email protected]>

- Change #//vellum.client.types.chat_message.ChatMessage to #//ChatMessage - Update test_basic_tool_calling_node_serialization.py expectations - Update simple_composio_tool_calling_node.json fixture - Aligns with Pydantic's TypeAdapter schema generation behavior Co-Authored-By: [email protected] <[email protected]>

- Remove type_map dict that was only used in the now-removed fallback - Replace fallback with clearer ValueError that wraps the original exception - All tests pass - fallback was never hit in the test suite - Simplifies code by fully trusting Pydantic's TypeAdapter Co-Authored-By: [email protected] <[email protected]>

vincent0426 · 2025-11-22T23:28:57Z

@codex review

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

chatgpt-codex-connector · 2025-11-22T23:32:14Z

src/vellum/workflows/utils/functions.py

+    # Use Pydantic's TypeAdapter for everything else
+    try:
+        schema = TypeAdapter(annotation).json_schema()
+        schema = _strip_titles(schema)


Preserve support for unannotated parameters

Parameters without type hints now hit the TypeAdapter call and raise instead of defaulting to {"type": "null"} as before. Any call to compile_function_definition on a function whose parameters are unannotated (where param.annotation is inspect._empty) will now propagate a ValueError from TypeAdapter, whereas the previous type_map explicitly handled that case. This breaks existing usage of untyped functions.

Useful? React with 👍 / 👎.

chatgpt-codex-connector · 2025-11-22T23:32:14Z

src/vellum/workflows/utils/functions.py

+        # Merge any nested $defs into the top-level defs dict
+        if "$defs" in schema:
+            nested_defs = schema.pop("$defs")
+            defs.update(nested_defs)


Avoid $defs collisions when models share class names

Merging nested $defs directly into the shared defs dict uses the short names emitted by TypeAdapter (typically the class __name__), so two different models with the same class name (e.g., foo1.Foo and foo2.Foo) will overwrite one another and both $refs will resolve to whichever schema was processed last. The previous implementation used module-qualified names to keep definitions unique; the new approach can yield incorrect schemas whenever duplicate class names appear in a single function/workflow.

Useful? React with 👍 / 👎.

vellum-automation · 2025-11-22T23:56:21Z

@codex review

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

src/vellum/workflows/utils/functions.py

vellum-automation · 2025-11-23T00:07:33Z

@codex review

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

chatgpt-codex-connector · 2025-11-23T00:09:35Z

src/vellum/workflows/utils/functions.py

+    # Use Pydantic's TypeAdapter for everything else
+    try:
+        schema = TypeAdapter(annotation).json_schema()


Restore null schema for unannotated parameters

Parameters without type hints arrive as inspect._empty, but compile_annotation now falls through to TypeAdapter(annotation) which raises because _empty is not a valid type. Previously these were mapped to {"type": "null"}, so compiling an unannotated tool/workflow function succeeded; now the same call surfaces a ValueError and blocks schema generation for any function with untyped args.

Useful? React with 👍 / 👎.

chatgpt-codex-connector · 2025-11-23T00:09:35Z

src/vellum/workflows/utils/functions.py

+        # Merge any nested $defs into the top-level defs dict
+        if "$defs" in schema:
+            nested_defs = schema.pop("$defs")
+            defs.update(nested_defs)


Avoid clobbering $defs when class names collide

Nested $defs emitted by TypeAdapter are merged with a raw defs.update, so if two different dataclasses/Pydantic models share the same class name (e.g., two ChatMessage types from different modules), the second schema overwrites the first while both parameters still point to #/$defs/ChatMessage. That produces an incorrect schema for the first parameter; the prior implementation used fully qualified names to avoid this collision.

Useful? React with 👍 / 👎.

devin-ai-integration bot assigned vincent0426 Nov 22, 2025

devin-ai-integration bot requested a review from vincent0426 November 22, 2025 20:32

chatgpt-codex-connector bot reviewed Nov 22, 2025

View reviewed changes

src/vellum/workflows/utils/functions.py Outdated Show resolved Hide resolved

devin-ai-integration bot changed the title ~~Refactor compile_annotation to use Pydantic's model_json_schema~~ Refactor compile_annotation to use Pydantic TypeAdapter throughout Nov 22, 2025

devin-ai-integration bot and others added 2 commits November 22, 2025 22:34

vincent0426 marked this pull request as ready for review November 22, 2025 23:28

chatgpt-codex-connector bot reviewed Nov 22, 2025

View reviewed changes

devin-ai-integration bot force-pushed the devin/1763843204-use-pydantic-model-json-schema branch from 6b61cb2 to 3bf693e Compare November 22, 2025 23:56

chatgpt-codex-connector bot reviewed Nov 22, 2025

View reviewed changes

src/vellum/workflows/utils/functions.py Outdated Show resolved Hide resolved

devin-ai-integration bot force-pushed the devin/1763843204-use-pydantic-model-json-schema branch from 3bf693e to 6b61cb2 Compare November 23, 2025 00:07

chatgpt-codex-connector bot reviewed Nov 23, 2025

View reviewed changes

Refactor compile_annotation to use Pydantic TypeAdapter throughout #3194

Are you sure you want to change the base?

Refactor compile_annotation to use Pydantic TypeAdapter throughout #3194

Conversation

devin-ai-integration bot commented Nov 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Review & Testing Checklist for Human

Test Plan

Notes

Uh oh!

devin-ai-integration bot commented Nov 22, 2025

🤖 Devin AI Engineer

Uh oh!

vellum-automation bot commented Nov 22, 2025

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

Uh oh!

vincent0426 commented Nov 22, 2025

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector bot Nov 22, 2025

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector bot Nov 22, 2025

Choose a reason for hiding this comment

Uh oh!

vellum-automation bot commented Nov 22, 2025

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

Uh oh!

vellum-automation bot commented Nov 23, 2025

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector bot Nov 23, 2025

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector bot Nov 23, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

devin-ai-integration bot commented Nov 22, 2025 •

edited

Loading