feat(testing): add canonical run artifact for DSL test execution by AdityaShome · Pull Request #1556 · mofa-org/mofa

AdityaShome · 2026-04-01T02:41:03Z

Summary

This PR is rebased on #1555 and #1447 and adds a canonical run artifact for mofa test-dsl execution and exposes it as optional JSON output via --artifact-out.

The goal is to move the DSL path onto a stable execution artifact instead of relying only on ad hoc runner fields and formatter-specific report generation. This keeps the DSL thin while establishing the output contract needed for later baseline comparison, replay-oriented workflows, and CI artifact upload.

Context

The current DSL and CLI path can execute test cases and emit reports, but it does not yet have one stable, serializable run artifact that represents a case execution end to end.

in /tmp/dsl-artifact.json saved file:

This PR closes that gap by:

defining a canonical artifact shape for DSL backed agent runs
building that artifact from the existing AgentRunResult
separating test-case execution from assertion evaluation so assertion outcomes can be captured explicitly
allowing mofa test-dsl to write the artifact as JSON

What Changed

Added a canonical AgentRunArtifact in the root-level tests/ crate
Added serializable nested artifact types for:
- agent identity
- tool calls
- LLM request/response
- session snapshot
- workspace snapshots
- assertion outcomes
Split DSL execution from assertion evaluation
Added --artifact-out to mofa test-dsl
Updated report generation to build from the canonical artifact
Added unit and CLI integration coverage for artifact generation

Execution Flow

Files Changed

Core Files

`tests/src/artifact.rs`

Added the canonical AgentRunArtifact type and nested serializable artifact models.
Added conversion from TestCaseDsl + AgentRunResult into a stable artifact.

`tests/src/dsl.rs`

Split execution from assertion evaluation.
Added execute_test_case.
Added collect_assertion_outcomes.
Added assertion_error_from_outcomes.
Preserved existing DSL behavior while making assertion results explicit for artifact capture.

`tests/src/lib.rs`

Exported the new artifact types and DSL evaluation helpers from the root-level tests/ crate.

`crates/mofa-cli/src/cli.rs`

Execution Flow

Added --artifact-out for mofa test-dsl.

`crates/mofa-cli/src/commands/test_dsl.rs`

Builds a canonical artifact from DSL execution.
Writes artifact JSON when --artifact-out is provided.
Builds report output from the artifact-backed result path.

`crates/mofa-cli/src/main.rs`

Wired --artifact-out through CLI dispatch.

`tests/tests/artifact_tests.rs`

Added focused unit coverage for canonical artifact generation and failed assertion capture.

Supporting Files

crates/mofa-cli/tests/test_dsl_integration_tests.rs: Added CLI integration coverage for --artifact-out.

Execution Flow

Canonical Artifact Scope

The initial artifact captures:

case name
pass/fail status
output text
runner error
duration and start timestamp
execution id and session id
agent identity
assertion outcomes
tool call records
last LLM request/response
session snapshot
workspace snapshots before and after execution

CLI Usage

Examples:

mofa test-dsl tests/examples/simple_agent.toml --artifact-out dsl-artifact.json

mofa test-dsl tests/examples/tool_agent.toml --artifact-out dsl-artifact.json --report-out dsl-report.json --report-format json

Example Output

mofa test-dsl tests/examples/simple_agent.toml --artifact-out dsl-artifact.json

saved in /tmp/dsl-artifact.json saved file.

Tests

cargo test -p mofa-testing --test artifact_tests
cargo test -p mofa-testing --test dsl_tests
cargo test -p mofa-cli --test test_dsl_integration_tests
cargo test -p mofa-cli test_test_dsl_

Notes

This PR does not introduce replay or baseline comparison yet. It establishes the canonical artifact foundation those later steps can build on.

…traps support

… runner

…ests

AdityaShome added 13 commits March 23, 2026 19:41

tests: add real agent runner harness

00d3d55

Add agent runner test harness with workspace isolation and tool/boots…

aed5b97

…traps support

Add agent runner metadata, tool capture, and prompt customization

4543c4d

Add workspace snapshots, session assertions, and tool timing to agent…

dcdb83e

… runner

Add agent runner examples and session aware runner updates

4bf72fe

Normalize workspace snapshot paths for cross platform agent runner t…

5560ff7

…ests

Merge branch 'main' into agent-runner-testing

709f06e

feat(testing): add agent runner assertions

9efd792

feat(testing): add minimal TOML DSL adapter

f3768d3

feat(cli): add test dsl command for TOML agent test cases

b4d3d08

feat(cli): add report output for test dsl command

3dcceb5

feat(testing): add canonical run artifact for test dsl execution

c7099b9

test(testing): add missing bootstrap DSL fixture

70b6708

This was referenced Apr 1, 2026

feat(testing): add baseline comparison for DSL run artifacts #1558

Open

feat(testing): add structured comparison output for test-dsl #1566

Open

feat(testing): add replay backed provider support for DSL runs #1581

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(testing): add canonical run artifact for DSL test execution#1556

feat(testing): add canonical run artifact for DSL test execution#1556
AdityaShome wants to merge 13 commits intomofa-org:mainfrom
AdityaShome:testing-canonical-artifact

AdityaShome commented Apr 1, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

AdityaShome commented Apr 1, 2026

Summary

Context

What Changed

Execution Flow

Files Changed

Core Files

tests/src/artifact.rs

tests/src/dsl.rs

tests/src/lib.rs

crates/mofa-cli/src/cli.rs

Execution Flow

crates/mofa-cli/src/commands/test_dsl.rs

crates/mofa-cli/src/main.rs

tests/tests/artifact_tests.rs

Supporting Files

Execution Flow

Canonical Artifact Scope

CLI Usage

Example Output

Tests

Notes

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

`tests/src/artifact.rs`

`tests/src/dsl.rs`

`tests/src/lib.rs`

`crates/mofa-cli/src/cli.rs`

`crates/mofa-cli/src/commands/test_dsl.rs`

`crates/mofa-cli/src/main.rs`

`tests/tests/artifact_tests.rs`