Skip to content

Refactor DI from extraction step into analyze+extract#110

Merged
jnu merged 4 commits into
mainfrom
update-di
Apr 14, 2026
Merged

Refactor DI from extraction step into analyze+extract#110
jnu merged 4 commits into
mainfrom
update-di

Conversation

@jnu

@jnu jnu commented Apr 14, 2026

Copy link
Copy Markdown
Contributor

Moves document intelligence step from the existing extract module into a new analyze module. This will let us re-use the existing analyze-di module with the forthcoming ontology extraction module (though with different models -- current pipelines use read, but the ontology parser will want layout).

This is a breaking change for any pipelines defined with the extract step. They will need to be updated to convert extract:azuredi -> analyze:azuredi | extract:azuredi. The app introduces a custom mime-type for the AnalyzeResult to pass data in a MemoryFile between these steps.

Comment on lines +78 to +82
poller = self.document_analysis_client.begin_analyze_document(
self.config.document_model,
document=docbytes,
locale=self.config.locale,
)

This comment was marked as outdated.

@jnu jnu merged commit 8b6ce54 into main Apr 14, 2026
5 checks passed
@jnu jnu deleted the update-di branch April 14, 2026 14:45
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant