Feat/error stack spike by cds-amal · Pull Request #304 · solana-foundation/txtx

cds-amal · 2025-07-04T18:23:02Z

Error-Stack Integration Spike for txtx

Overview

This document explores integrating the error-stack library into the txtx project to enhance error reporting with structured, context-rich error handling.

Why error-stack?

The error-stack library addresses several weaknesses identified in our initial audit:

Rich Context: Automatically captures and preserves error context as it propagates
Structured Reporting: Type-safe error boundaries with explicit context changes
Attachments: Can attach arbitrary data (documentation, examples, spans) to errors
Backtraces: Automatic backtrace capture for debugging
Error Chaining: Built-in support for error chains with parent relationships

How to test

You can now see the new error-stack implementation in action by:

Running the demo example:

cargo run --example error_stack_demo --package txtx-addon-kit

Running tests with verbose output:

cargo test --package txtx-addon-kit errors::demo::tests::test_process_action_invalid_address -- --nocapture

Looking at the comparison in error-spike-docs/comparison_example.md

The key improvements you'll see:

Rich Error Context: Instead of just "unable to parse address", you get the full story with location, documentation, and examples
Error Chains: You can see how errors propagate through the system with context added at each level
Structured Attachments: Type-safe additional information like account balances, transaction details, etc.
Actionable Messages: Clear guidance on how to fix issues

The demo shows four different error scenarios:

Missing required input with documentation
Type mismatch with expected vs actual types
Validation error with full action context
Network error showing propagation chain

Each demonstrates how error-stack provides significantly better error reporting than the current Diagnostic approach.

Integration Strategy

Phase 1: Define Core Error Types

Replace the current Diagnostic with error-stack based types:

use error_stack::{Report, ResultExt, Context};

#[derive(Debug)]
pub enum TxtxError {
    Parsing,
    Validation,
    Execution,
    Network,
    TypeMismatch,
    MissingInput,
}

impl fmt::Display for TxtxError {
    fn fmt(&self, f: &mut fmt::Formatter<'_>) -> fmt::Result {
        match self {
            TxtxError::Parsing => write!(f, "Failed to parse runbook"),
            TxtxError::Validation => write!(f, "Validation failed"),
            TxtxError::Execution => write!(f, "Execution failed"),
            TxtxError::Network => write!(f, "Network operation failed"),
            TxtxError::TypeMismatch => write!(f, "Type mismatch"),
            TxtxError::MissingInput => write!(f, "Missing required input"),
        }
    }
}

impl Context for TxtxError {}

Phase 2: Create Domain-Specific Error Types

For each addon and major component:

// For EVM addon
#[derive(Debug)]
pub enum EvmError {
    InvalidAddress,
    TransactionFailed,
    ContractDeploymentFailed,
    InsufficientFunds,
}

impl fmt::Display for EvmError {
    fn fmt(&self, f: &mut fmt::Formatter<'_>) -> fmt::Result {
        match self {
            EvmError::InvalidAddress => write!(f, "Invalid Ethereum address"),
            EvmError::TransactionFailed => write!(f, "Transaction failed"),
            EvmError::ContractDeploymentFailed => write!(f, "Contract deployment failed"),
            EvmError::InsufficientFunds => write!(f, "Insufficient funds"),
        }
    }
}

impl Context for EvmError {}

Phase 3: Attachment Types for Rich Context

Define structured attachments to replace current diagnostic fields:

#[derive(Debug)]
pub struct ErrorLocation {
    pub file: String,
    pub line: u32,
    pub column: u32,
}

#[derive(Debug)]
pub struct ErrorDocumentation {
    pub help: String,
    pub example: Option<String>,
    pub link: Option<String>,
}

#[derive(Debug)]
pub struct ErrorSpan {
    pub start: usize,
    pub end: usize,
    pub source_text: String,
}

#[derive(Debug)]
pub struct ActionContext {
    pub action_name: String,
    pub namespace: String,
    pub construct_id: String,
}

Phase 4: Error Creation Patterns

Replace diagnosed_error! with error-stack patterns:

// Current pattern:
diagnosed_error!("unable to parse contract_id ({})", id)

// New pattern with error-stack:
fn parse_contract_id(id: &str) -> Result<ContractId, Report<EvmError>> {
    ContractId::from_str(id)
        .change_context(EvmError::InvalidAddress)
        .attach_printable(format!("contract_id: {}", id))
        .attach(ErrorDocumentation {
            help: "Contract IDs must be valid Ethereum addresses (0x followed by 40 hex characters)".into(),
            example: Some("0x1234567890abcdef1234567890abcdef12345678".into()),
            link: Some("https://docs.txtx.io/evm/addresses".into()),
        })
}

Phase 5: Error Propagation

Update error propagation to preserve context:

// In action execution
pub async fn execute_action(
    &self,
    inputs: &CommandInputs,
) -> Result<CommandExecutionResult, Report<TxtxError>> {
    let address = self.parse_address(&inputs.address)
        .change_context(TxtxError::Execution)
        .attach_printable("Failed to execute deploy_contract action")
        .attach(ActionContext {
            action_name: self.name.clone(),
            namespace: "evm".into(),
            construct_id: self.id.clone(),
        })?;
    
    let result = self.deploy_contract(address)
        .await
        .attach_printable("Network call failed")?;
    
    Ok(result)
}

Phase 6: Error Display

Implement rich error display using error-stack's formatting:

pub fn display_error(error: &Report<TxtxError>) {
    // error-stack provides detailed formatting out of the box
    eprintln!("{:?}", error);
    
    // Custom formatting for specific attachments
    if let Some(location) = error.request_ref::<ErrorLocation>() {
        eprintln!("  at {}:{}:{}", location.file, location.line, location.column);
    }
    
    if let Some(docs) = error.request_ref::<ErrorDocumentation>() {
        eprintln!("\nHelp: {}", docs.help);
        if let Some(example) = &docs.example {
            eprintln!("Example:\n{}", example);
        }
    }
}

Migration Plan

Step 1: Add Dependency

[dependencies]
error-stack = { version = "0.5", default-features = false }

Step 2: Create Compatibility Layer

During migration, create a compatibility layer:

impl From<Diagnostic> for Report<TxtxError> {
    fn from(diag: Diagnostic) -> Self {
        let base_error = match diag.level {
            DiagnosticLevel::Error => TxtxError::Execution,
            _ => TxtxError::Execution, // Map appropriately
        };
        
        let mut report = Report::new(base_error)
            .attach_printable(diag.message);
        
        if let Some(location) = diag.location {
            report = report.attach(ErrorLocation {
                file: location.to_string(),
                line: diag.span.as_ref().map(|s| s.line_start).unwrap_or(0),
                column: diag.span.as_ref().map(|s| s.column_start).unwrap_or(0),
            });
        }
        
        if let Some(doc) = diag.documentation {
            report = report.attach(ErrorDocumentation {
                help: doc,
                example: diag.example,
                link: None,
            });
        }
        
        report
    }
}

Step 3: Gradual Migration

Start with core error types in txtx-core
Migrate one addon at a time
Update CLI error display
Remove old Diagnostic type once migration is complete

Benefits

Automatic Context: Error context is automatically preserved and enhanced as errors propagate
Type Safety: Strongly typed error boundaries prevent context loss
Rich Attachments: Can attach any data type for debugging
Better Stack Traces: Automatic backtrace capture
Standardized Display: Consistent error formatting out of the box

Considerations

Learning Curve: Team needs to understand error-stack patterns
Refactoring Effort: Significant changes to error handling code
Dependency: Adds external dependency (though well-maintained)
API Changes: Public APIs returning Diagnostic will need updates

Example: Before and After

Before (Current Diagnostic)

fn deploy_contract(&self, address: &str) -> Result<(), Diagnostic> {
    let parsed = parse_address(address)
        .map_err(|e| diagnosed_error!("unable to parse address: {}", e))?;
    
    do_deploy(parsed)
        .map_err(|e| diagnosed_error!("deployment failed: {}", e))
}

After (With error-stack)

fn deploy_contract(&self, address: &str) -> Result<(), Report<EvmError>> {
    let parsed = parse_address(address)
        .change_context(EvmError::InvalidAddress)
        .attach_printable(format!("address: {}", address))
        .attach(ErrorDocumentation {
            help: "Ensure the address starts with 0x and contains 40 hex characters".into(),
            example: Some("0x742d35Cc6634C0532925a3b844Bc9e7595f89590".into()),
            link: None,
        })?;
    
    do_deploy(parsed)
        .change_context(EvmError::ContractDeploymentFailed)
        .attach_printable("Check that the contract bytecode is valid")
        .attach(ActionContext {
            action_name: "deploy_contract".into(),
            namespace: "evm".into(),
            construct_id: self.id.clone(),
        })
}

Recommendation

The error-stack library would significantly improve txtx's error reporting by:

Providing structured, context-rich errors
Enforcing good error handling practices
Offering better debugging capabilities
Standardizing error handling across the codebase

The migration effort is substantial but can be done incrementally, starting with new code and gradually updating existing error handling.

…orting - Add error-stack dependency to txtx-addon-kit and evm addon - Create core error types (TxtxError) with structured attachments - Implement ErrorAttachments trait for fluent error enhancement - Add compatibility layer for migrating from Diagnostic - Create EVM-specific error types demonstrating domain errors - Add comprehensive tests (18 passing) and demonstrations - Document implementation strategy and migration path This spike demonstrates how error-stack can provide: - Rich context preservation through error propagation - Type-safe error boundaries - Actionable error messages with documentation - Better debugging with automatic backtraces - Incremental migration from existing error handling

- Add CliError enum with common CLI error scenarios - Create rich attachments (ManifestInfo, RunbookContext, OutputInfo, etc.) - Implement enhanced error display with recovery suggestions - Add migration examples showing before/after patterns - Create working demo showing improved error messages Common errors now provide: - Clear context about what failed - Available alternatives (e.g., valid runbooks) - Actionable recovery steps - Links to documentation - Graceful fallbacks for output failures This migration demonstrates how error-stack dramatically improves the user experience by transforming cryptic errors into helpful guidance.

MicaiahReid · 2025-07-07T14:36:52Z

This is suuuuuuuuper cool.

It would be a daunting task indeed to update the whole repo, but it would be incredible to have this in place.

The main thing that's daunting is that the "gradual migration" isn't really possible.

All of the addon functions implement a trait, which currently returns our Diagnostic type. For txtx-core to be using error stack, the traits would need to return that type of error, and that means that every addon also needs to be updated.

cds-amal added 3 commits July 4, 2025 14:14

docs: add crates and error-spike-docs

db9519a

MicaiahReid force-pushed the main branch 3 times, most recently from 92ccc36 to 2468c3b Compare March 25, 2026 20:34

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feat/error stack spike#304

Feat/error stack spike#304
cds-amal wants to merge 3 commits intosolana-foundation:mainfrom
cds-amal:feat/error-stack-spike

cds-amal commented Jul 4, 2025 •

edited

Loading

Uh oh!

MicaiahReid commented Jul 7, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

cds-amal commented Jul 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Error-Stack Integration Spike for txtx

Overview

Why error-stack?

How to test

Integration Strategy

Phase 1: Define Core Error Types

Phase 2: Create Domain-Specific Error Types

Phase 3: Attachment Types for Rich Context

Phase 4: Error Creation Patterns

Phase 5: Error Propagation

Phase 6: Error Display

Migration Plan

Step 1: Add Dependency

Step 2: Create Compatibility Layer

Step 3: Gradual Migration

Benefits

Considerations

Example: Before and After

Before (Current Diagnostic)

After (With error-stack)

Recommendation

Uh oh!

MicaiahReid commented Jul 7, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

cds-amal commented Jul 4, 2025 •

edited

Loading