Make chat and embed interfaces provider-agnostic using pydantic_ai by gvanrossum · Pull Request #200 · microsoft/typeagent-py

gvanrossum · 2026-02-17T17:04:05Z

Unreviewed agent output.

…stic using pydantic_ai

bmerkle · 2026-02-23T21:10:24Z

pyproject.toml or lockfile was not touched so the dependecy to pydantic_ai is missing.

bmerkle · 2026-02-23T20:22:37Z

src/typeagent/aitools/embeddings.py

                return input, len(tokens)
+
+
+def create_embedding_model(


There are now two different create_embedding_model functions with different signatures:

embeddings.py:355: create_embedding_model(embedding_size, model_name, **kwargs) — creates AsyncEmbeddingModel (OpenAI/Azure)
model_adapters.py:184: create_embedding_model(model_spec, *, embedding_size) — creates PydanticAIEmbeddingModel

This is confusing. Callers importing from different modules get entirely different behaviors.

IMO we only need the PydanticAIEmbeddingModel as it can include to call/configure also OpenAI/Azure LLM

bmerkle · 2026-02-23T20:30:34Z

src/typeagent/aitools/model_adapters.py

+# ---------------------------------------------------------------------------
+
+
+class PydanticAIEmbeddingModel(IEmbeddingModel):


AsyncEmbeddingModel (in aitools/embeddings.py) does NOT inherit from IEmbeddingModel, so kind of asymetric ?
furthermore: Is subclassing a Protocol directly really the intended pattern ?

bmerkle · 2026-02-23T20:32:09Z

tests/test_model_adapters.py

+
+def test_embedding_model_is_iembedding_model() -> None:
+    """PydanticAIEmbeddingModel inherits from IEmbeddingModel."""
+    assert IEmbeddingModel in PydanticAIEmbeddingModel.__mro__


mro ? again the inheritance with IEmbeddingModel seems asymetric

bmerkle · 2026-02-23T20:37:35Z

tools/ingest_vtt.py

        print("Setting up conversation settings...")
    try:
-        embedding_model = AsyncEmbeddingModel(model_name=embedding_name)
+        embedding_model = create_embedding_model(model_name=embedding_name)


the pydantic.ai API uses a ctor call to instantiate models or agents and not a method so this looks kind of strange.
e.g.

model = os.getenv('PYDANTIC_AI_MODEL', 'openai:gpt-5.2') print(f'Using model: {model}') agent = Agent(model, output_type=MyModel)

bmerkle · 2026-02-23T20:45:23Z

src/typeagent/aitools/utils.py

+            env: dict[str, str | None] = dict(os.environ)
+            key_name = "AZURE_OPENAI_API_KEY"
+            env[key_name] = api_key
+            self.base_model = typechat.create_language_model(env)


why do we set the environment again ? better pass the api key via argument than via the environment

why do we create a new language model each time the refresh (line 112). might be expensive operation

bmerkle · 2026-02-23T21:01:54Z

src/typeagent/aitools/vectorbase.py

+)

-from .embeddings import AsyncEmbeddingModel, NormalizedEmbedding, NormalizedEmbeddings
+DEFAULT_MAX_RETRIES = 2


Before: from openai import DEFAULT_MAX_RETRIES (whatever OpenAI sets).
Now: DEFAULT_MAX_RETRIES = 2.

?

bmerkle · 2026-02-23T21:19:04Z

src/typeagent/aitools/model_adapters.py

+    def add_embedding(self, key: str, embedding: NormalizedEmbedding) -> None:
+        self._cache[key] = embedding
+
+    async def _probe_embedding_size(self) -> None:


why probe the embedding size ?
we should specify it, or reject the call

bmerkle · 2026-02-23T21:20:11Z

src/typeagent/aitools/model_adapters.py

+            if self.embedding_size == 0:
+                await self._probe_embedding_size()


same where. we should specify the embedding size as mandatory parameter

bmerkle · 2026-02-23T21:22:26Z

src/typeagent/aitools/embeddings.py



+@runtime_checkable
+class IEmbeddingModel(Protocol):


he Protocol bundles caching semantics (add_embedding, get_embedding with cache, _nocache variants) into the provider interface itself. This means every new provider implementation (e.g., Anthropic, Cohere) must re-implement the same caching boilerplate.
e.g. PydanticAIEmbeddingModel duplicates nearly identical caching logic from AsyncEmbeddingModel.

Consider splitting the interface:

A minimal IEmbedder Protocol with only get_embedding_nocache / get_embeddings_nocache + model_name + embedding_size
A shared CachingEmbeddingModel base class (or a decorator/wrapper) that adds the cache layer on top

bmerkle · 2026-02-23T21:23:56Z

src/typeagent/aitools/utils.py

+DEFAULT_TIMEOUT_SECONDS = 25
+
+
+class ModelWrapper(typechat.TypeChatLanguageModel):


ModelWrapper is a concrete class wrapping typechat.TypeChatLanguageModel with Azure-specific token refresh logic. It now lives in utils.py — a catch-all module. With the new PydanticAIChatModel abstraction, there are now two different abstractions for wrapping a TypeChat model: ModelWrapper (Azure, in utils.py) and PydanticAIChatModel (pydantic_ai, in model_adapters.py). There's no clear guidance on when to use which.

gvanrossum-ms added 3 commits February 17, 2026 09:02

Unreviewed agent output: make chat and embed interfaces provider-agno…

74097cf

…stic using pydantic_ai

Agent step 2 -- unreviewed

d59d7b6

Agent step 3 -- unreviewed -- use Pydantic's model registry

ff733f5

bmerkle mentioned this pull request Feb 17, 2026

Support most AI providers #84

Open

Don't hardcode an incomplete table of embedding sizes

1015737

gvanrossum force-pushed the agnostic branch from dce97ba to 1015737 Compare February 18, 2026 16:22

gvanrossum-ms added 2 commits February 18, 2026 08:36

Fix test failures

4bd1387

Rename model_registry -> model_adapters

60aa403

bmerkle mentioned this pull request Feb 21, 2026

plan for PYDANTIC_AI migration and model selection refactorings #201

Closed

bmerkle reviewed Feb 24, 2026

View reviewed changes

gvanrossum and others added 5 commits February 24, 2026 13:25

Move pydantic-ai to main deps

067f3b9

Remove obsolete create_embedding_model -- wasn't easy

17b959f

Merge branch 'main' into agnostic

76621b4

Fix test_configure_models_returns_correct_types

6f1286f

Fall back on Azure for OpenAI models if only Azure key is present

83d6f0a

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comments

Make chat and embed interfaces provider-agnostic using pydantic_ai#200

Make chat and embed interfaces provider-agnostic using pydantic_ai#200
gvanrossum wants to merge 11 commits intomainfrom
agnostic

gvanrossum commented Feb 17, 2026

Uh oh!

bmerkle commented Feb 23, 2026 •

edited

Loading

Uh oh!

bmerkle Feb 23, 2026

Uh oh!

bmerkle Feb 23, 2026

Uh oh!

bmerkle Feb 23, 2026

Uh oh!

bmerkle Feb 23, 2026

Uh oh!

bmerkle Feb 23, 2026

Uh oh!

bmerkle Feb 23, 2026

Uh oh!

bmerkle Feb 23, 2026

Uh oh!

bmerkle Feb 23, 2026

Uh oh!

bmerkle Feb 23, 2026

Uh oh!

bmerkle Feb 23, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		# ---------------------------------------------------------------------------


		class PydanticAIEmbeddingModel(IEmbeddingModel):

		if self.embedding_size == 0:
		await self._probe_embedding_size()

		DEFAULT_TIMEOUT_SECONDS = 25


		class ModelWrapper(typechat.TypeChatLanguageModel):

Comments

Conversation

gvanrossum commented Feb 17, 2026

Uh oh!

bmerkle commented Feb 23, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

bmerkle commented Feb 23, 2026 •

edited

Loading