feat: add LLM enrichment for model name extraction by cassiocouto · Pull Request #63 · Trusera/ai-bom

cassiocouto · 2026-03-02T14:09:12Z

Summary

Closes #36 — adds optional LLM-based enrichment to extract specific model names (e.g., gpt-4o, claude-3-opus-20240229) from code snippets around detected AI components.

New --llm-enrich CLI flag with --llm-model, --llm-api-key, and --llm-base-url options
Uses litellm as a unified LLM client — supports OpenAI, Anthropic, Ollama (local), and 100+ providers
Only enriches llm_provider and model component types with empty model_name (filters out containers, tools, MCP servers, etc.)
Reads ~20 lines of source context around detection sites for accurate extraction
Cross-references extracted names against the built-in model registry for provider/deprecation metadata
Batched LLM calls (default 5 per request) with graceful fallback to individual calls on error
Dependency-missing guard: clear error with install hint when litellm is not installed
Privacy warning for non-local models; recommends ollama/* for sensitive codebases
39 new tests (unit + CLI integration), all mocked — no real LLM calls in tests
New docs/enrichment.md with usage guide, privacy section, and cost guidance
Also fixes two pre-existing test failures (test_demo_command, test_sarif_relative_path_calculation on Windows)

Test plan

pytest tests/test_enrichment/ — 39 tests pass (prompt templates, JSON parsing, component filtering, batch/single enrichment, error handling, CLI flags, privacy warnings)
pytest tests/ — full suite: 786 passed, 0 failed
ruff check — no lint errors on new/modified files
Manual test with --llm-enrich --llm-model ollama/llama3 against a real project (requires Ollama running locally)

Zie619

Great feature addition! Well-designed and thoroughly tested:

Clean module structure (enrichment/ package with separate prompts, enricher, init)
Smart filtering — only enriches llm_provider/model types with empty model_name
Privacy-conscious with Ollama recommendation and cloud API warnings
Batched LLM calls with graceful fallback to individual calls on error
Model registry cross-referencing for provider/deprecation metadata
39 mocked tests covering all paths (parsing, batching, errors, CLI integration)
Good docs in enrichment.md
Also fixes the SARIF relative path bug

LGTM — merging once CI passes.

Zie619 · 2026-03-14T16:04:24Z

CI caught formatting issues — 3 files need ruff format:

src/ai_bom/enrichment/llm_enricher.py
tests/test_enrichment/test_cli_llm_enrich.py
tests/test_enrichment/test_llm_enricher.py

Quick fix:

ruff format src/ai_bom/enrichment/llm_enricher.py tests/test_enrichment/test_cli_llm_enrich.py tests/test_enrichment/test_llm_enricher.py

Once formatted, this is ready to merge!

cassiocouto added 2 commits March 2, 2026 10:59

feat: new llm-enrichment for ai-bom

cd6653a

fix: fixing tests

7c582d2

cassiocouto requested a review from Zie619 as a code owner March 2, 2026 14:09

Zie619 approved these changes Mar 14, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add LLM enrichment for model name extraction#63

feat: add LLM enrichment for model name extraction#63
cassiocouto wants to merge 2 commits intoTrusera:mainfrom
cassiocouto:feat/llm-enrichment

cassiocouto commented Mar 2, 2026 •

edited

Loading

Uh oh!

Zie619 left a comment

Uh oh!

Zie619 commented Mar 14, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

cassiocouto commented Mar 2, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Test plan

Uh oh!

Zie619 left a comment

Choose a reason for hiding this comment

Uh oh!

Zie619 commented Mar 14, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

cassiocouto commented Mar 2, 2026 •

edited

Loading