refactor(core): standard content blocks #32085

mdrxy · 2025-07-17T15:26:25Z

block_type replaces both old type and source_type fields
- Encodes the full semantic of each block in one literal string
Benefits:
- Unambiguous dispatch: only ever switch on block_type, instead of both type and source_type
- Type safety: each Literal value in block_type directly maps to exactly one TypedDict.
Note: collapsed the ID‐based blocks into a single "file" bucket for three main reasons:
1. An ID (e.g. a handle, storage key, cloud object ID) is by definition opaque. There's no reliable way to infer "this ID points to an image vs. an audio clip vs. a PDF." Treating every opaque ID as a generic file means you don't risk mistyping or misrouting it based on assumptions about its contents.
2. Most provider SDKs or APIs that consume "ID" blocks simply expect a file reference (e.g. upload-by-reference, fetch-by-ID), not a media‐specific variant. By funneling all IDs through a single data:file:id channel, you simplify adapters like convert_to_openai_data_block, and avoid boilerplate branching on media kind
3. If tomorrow we discover a storage system that uses IDs but also provides MIME metadata server-side, we can enrich the generic file block with a mime_type field or introduce a new discriminator (e.g. data:image:id). But until there's a strong, real-world need to distinguish "ID for image" vs. "ID for audio," the single file:id variant keeps the schema lean

vercel · 2025-07-17T15:26:30Z

The latest updates on your projects. Learn more about Vercel for Git ↗︎

Name	Status	Preview	Comments	Updated (UTC)
langchain	✅ Ready (Inspect)	Visit Preview	💬 Add feedback	Jul 22, 2025 0:56am

sydney-runkle

Like block_type, don't think we should use it on annotations. Have some remaining questions about multimodal

libs/core/langchain_core/messages/content_blocks.py

ccurme · 2025-07-17T18:12:26Z

libs/core/langchain_core/messages/content_blocks.py



-def is_data_content_block(


I don't mind updating the types but I don't think we should break untyped dicts in these formats in 0.4 if we can avoid it. If we decide to migrate IMO we should retain backward compatibility and emit deprecation warnings in the appropriate code paths (you may be able to just add a warning to is_data_content_block but we'll need to check integrations).

libs/core/langchain_core/messages/content_blocks.py

… update ReasoningContentBlock structure

…reat each step as a distinct block in a list

…ary fields

sydney-runkle

Nice progress, some nits

libs/core/langchain_core/messages/content_blocks.py

ccurme · 2025-07-17T21:55:12Z

libs/core/langchain_core/messages/content_blocks.py

+    """Signature of the reasoning.
+
+    Inspired by:
+    - https://ai.google.dev/gemini-api/docs/thinking#signatures


AFAIK Gemini doesn't include signature on thinking parts, rather on text with thought=False or function calls.

libs/core/langchain_core/messages/content_blocks.py

codspeed-hq · 2025-07-21T19:19:49Z

CodSpeed WallTime Performance Report

Merging #32085 will not alter performance

_{Comparing mdrxy/updated-content-blocks (d4a0c10) with standard_outputs (3c19caf)}

⚠️

Unknown Walltime execution environment detected

Using the Walltime instrument on standard Hosted Runners will lead to inconsistent data.

For the most accurate results, we recommend using CodSpeed Macro Runners: bare-metal machines fine-tuned for performance measurement consistency.

Summary

✅ 13 untouched benchmarks

codspeed-hq · 2025-07-21T19:26:02Z

CodSpeed Instrumentation Performance Report

Merging #32085 will not alter performance

_{Comparing mdrxy/updated-content-blocks (d4a0c10) with standard_outputs (3c19caf)}

Summary

✅ 14 untouched benchmarks

libs/core/langchain_core/messages/content_blocks.py

Copilot · 2025-07-21T19:26:15Z

libs/core/langchain_core/messages/content_blocks.py

+    mime_type: Literal["text/plain"]
+    """MIME type of the file. Required for base64."""
+
+    base64: str


Required base64 field conflicts with optional text field. If text is optional when base64 is provided, then base64 should probably be NotRequired as well to allow for text-only usage.

Suggested change

base64: str

base64: NotRequired[str]

Copilot · 2025-07-21T19:26:15Z

libs/core/langchain_core/messages/content_blocks.py

+    base64: str
+    """Data as a base64 string."""


Required base64 field is inconsistent with the presence of optional file_id field. If content can be provided via file_id, then base64 should be NotRequired to allow for ID-only usage.

Suggested change

base64: str

"""Data as a base64 string."""

base64: NotRequired[str]

"""Data as a base64 string. Optional if `file_id` is provided."""

Copilot · 2025-07-21T19:26:16Z

libs/core/langchain_core/messages/content_blocks.py

    except ValidationError:
        return False
    else:
        return True


+# These would need to be refactored
 def convert_to_openai_image_block(content_block: dict[str, Any]) -> dict:
    """Convert image content block to format expected by OpenAI Chat Completions API."""
    if content_block["source_type"] == "url":


This function references the old source_type field which has been removed in the refactor. This will cause a KeyError when called. The function needs to be updated to work with the new unified type system.

ccurme · 2025-07-21T20:28:12Z

libs/core/langchain_core/messages/content_blocks.py


    Returns:
        True if the content block is a data content block, False otherwise.
    """
    try:
-        _ = _DataContentBlockAdapter.validate_python(content_block)
+        _DataAdapter.validate_python(block)
    except ValidationError:
        return False


If we return False or block.get("type") in ("audio", "image", "video)" and "source_type" in block, I believe we keep backwards compat.

v2 refactor

231d74a

mdrxy requested a review from eyurtsev as a code owner July 17, 2025 15:26

mdrxy requested review from ccurme and sydney-runkle July 17, 2025 15:26

vercel bot deployed to Preview July 17, 2025 15:35 View deployment

sydney-runkle reviewed Jul 17, 2025

View reviewed changes

libs/core/langchain_core/messages/content_blocks.py Outdated Show resolved Hide resolved

libs/core/langchain_core/messages/content_blocks.py Outdated Show resolved Hide resolved

libs/core/langchain_core/messages/content_blocks.py Outdated Show resolved Hide resolved

ccurme reviewed Jul 17, 2025

View reviewed changes

mdrxy commented Jul 17, 2025

View reviewed changes

libs/core/langchain_core/messages/content_blocks.py Outdated Show resolved Hide resolved

fix: rename block_type to annotation_type in citation classes and…

e4b5d59

… update ReasoningContentBlock structure

vercel bot deployed to Preview July 17, 2025 19:22 View deployment

mdrxy added 2 commits July 17, 2025 16:30

.

1977897

.

1ed1b16

vercel bot had a problem deploying to Preview July 17, 2025 21:05 Failure

mdrxy added 7 commits July 18, 2025 10:31

Instead of nesting tool_calls within the ReasoningContentBlock, t…

032c5d4

…reat each step as a distinct block in a list

fix: rename reasoning_effort to effort in ReasoningContentBlock

631c496

feat: add ToolOutputContentBlock for tool call results

0fb5df8

.

0590704

fix: update NonStandardContentBlock documentation and remove unnecess…

ac72160

…ary fields

.

3700d15

.

b87cac5

vercel bot deployed to Preview July 18, 2025 15:52 View deployment

sydney-runkle reviewed Jul 18, 2025

View reviewed changes

libs/core/langchain_core/messages/content_blocks.py Outdated Show resolved Hide resolved

libs/core/langchain_core/messages/content_blocks.py Outdated Show resolved Hide resolved

libs/core/langchain_core/messages/content_blocks.py Outdated Show resolved Hide resolved

update multimodal; add context field to multimodal content blocks

72d5971

vercel bot deployed to Preview July 18, 2025 17:45 View deployment

.

43982c6

vercel bot deployed to Preview July 18, 2025 18:08 View deployment

ccurme reviewed Jul 18, 2025

View reviewed changes

mdrxy changed the title ~~refactor: standard content blocks~~ refactor(core): standard content blocks Jul 18, 2025

.

907bb58

vercel bot deployed to Preview July 18, 2025 21:09 View deployment

mdrxy added 2 commits July 21, 2025 15:18

.

5d9f4be

.

bc15a09

mdrxy requested a review from Copilot July 21, 2025 19:24

Copilot AI reviewed Jul 21, 2025

View reviewed changes

.

6772814

vercel bot deployed to Preview July 21, 2025 19:49 View deployment

.

2326766

vercel bot deployed to Preview July 21, 2025 20:26 View deployment

ccurme reviewed Jul 21, 2025

View reviewed changes

mdrxy added 2 commits July 21, 2025 20:43

bump lock

550ff5e

remove beta_content

d4a0c10

vercel bot deployed to Preview July 22, 2025 00:56 View deployment

ccurme merged commit b24f90d into standard_outputs Jul 22, 2025
53 of 66 checks passed

ccurme deleted the mdrxy/updated-content-blocks branch July 22, 2025 13:17

refactor(core): standard content blocks #32085

refactor(core): standard content blocks #32085

Uh oh!

Conversation

mdrxy commented Jul 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

vercel bot commented Jul 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sydney-runkle left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ccurme Jul 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

sydney-runkle left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ccurme Jul 17, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

codspeed-hq bot commented Jul 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

CodSpeed WallTime Performance Report

Merging #32085 will not alter performance

Summary

Uh oh!

codspeed-hq bot commented Jul 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

CodSpeed Instrumentation Performance Report

Merging #32085 will not alter performance

Summary

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI Jul 21, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Jul 21, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Jul 21, 2025

Choose a reason for hiding this comment

Uh oh!

ccurme Jul 21, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

mdrxy commented Jul 17, 2025 •

edited

Loading

vercel bot commented Jul 17, 2025 •

edited

Loading

ccurme Jul 17, 2025 •

edited

Loading

codspeed-hq bot commented Jul 21, 2025 •

edited

Loading

codspeed-hq bot commented Jul 21, 2025 •

edited

Loading