diff --git a/.gitignore b/.gitignore
index 4e70a18..03554fc 100644
--- a/.gitignore
+++ b/.gitignore
@@ -51,6 +51,11 @@ experiments/
nccl_thread_sweep_*.log
nccl_thread_sweep_summary_*.txt
+# Test data and outputs (tests/gemm_analysis/)
+expected_outputs/
+testdata/
+actual_outputs/
+
# IDE/project-specific folders
.vscode/
.idea/
diff --git a/docs/comprehensive_report.html b/docs/comprehensive_report.html
index 803cb60..445aa0f 100644
--- a/docs/comprehensive_report.html
+++ b/docs/comprehensive_report.html
@@ -201,7 +201,7 @@
GEMM RCCL Overlap Comprehensive Performance Report
Complete Analysis: rocm-7.0.8-meta (Baseline) vs rocm-7.0.10-meta (Test)
Generated: 2025-12-08 14:59:23
-
+
Quick Navigation
@@ -216,13 +216,13 @@
Quick Navigation
512t/70ch
-
+
Executive Summary
This comprehensive report contains all performance analysis plots and metrics for RCCL comparisons across multiple configurations.
-
+
Test Configuration
@@ -232,650 +232,650 @@ Test Configuration
- Total Plots: 96 visualizations
-
+
-
+
-
+
Base: rocm-7.0.8-meta
Test: rocm-7.0.10-meta
-
+
Overview Plots
-
+
Percentage Change Overview
-
+
Absolute Time Comparison
-
+
Performance Heatmap
-
+
Total Execution Time by Rank
-
+
-
+
Detailed Metrics
-
+
Computation Time Across Ranks
-
+
Communication Time Across Ranks
-
+
Idle Time Across Ranks
-
+
Percentage Difference All Metrics
-
+
-
+
NCCL Analysis
-
+
NCCL Latency Analysis
-
+
NCCL Summary Analysis
-
+
-
+
-
+
-
+
Base: rocm-7.0.8-meta
Test: rocm-7.0.10-meta
-
+
Overview Plots
-
+
Percentage Change Overview
-
+
Absolute Time Comparison
-
+
Performance Heatmap
-
+
Total Execution Time by Rank
-
+
-
+
Detailed Metrics
-
+
Computation Time Across Ranks
-
+
Communication Time Across Ranks
-
+
Idle Time Across Ranks
-
+
Percentage Difference All Metrics
-
+
-
+
NCCL Analysis
-
+
NCCL Latency Analysis
-
+
NCCL Summary Analysis
-
+
-
+
-
+
-
+
Base: rocm-7.0.8-meta
Test: rocm-7.0.10-meta
-
+
Overview Plots
-
+
Percentage Change Overview
-
+
Absolute Time Comparison
-
+
Performance Heatmap
-
+
Total Execution Time by Rank
-
+
-
+
Detailed Metrics
-
+
Computation Time Across Ranks
-
+
Communication Time Across Ranks
-
+
Idle Time Across Ranks
-
+
Percentage Difference All Metrics
-
+
-
+
NCCL Analysis
-
+
NCCL Latency Analysis
-
+
NCCL Summary Analysis
-
+
-
+
-
+
-
+
Base: rocm-7.0.8-meta
Test: rocm-7.0.10-meta
-
+
Overview Plots
-
+
Percentage Change Overview
-
+
Absolute Time Comparison
-
+
Performance Heatmap
-
+
Total Execution Time by Rank
-
+
-
+
Detailed Metrics
-
+
Computation Time Across Ranks
-
+
Communication Time Across Ranks
-
+
Idle Time Across Ranks
-
+
Percentage Difference All Metrics
-
+
-
+
NCCL Analysis
-
+
NCCL Latency Analysis
-
+
NCCL Summary Analysis
-
+
-
+
-
+
-
+
Base: rocm-7.0.8-meta
Test: rocm-7.0.10-meta
-
+
Overview Plots
-
+
Percentage Change Overview
-
+
Absolute Time Comparison
-
+
Performance Heatmap
-
+
Total Execution Time by Rank
-
+
-
+
Detailed Metrics
-
+
Computation Time Across Ranks
-
+
Communication Time Across Ranks
-
+
Idle Time Across Ranks
-
+
Percentage Difference All Metrics
-
+
-
+
NCCL Analysis
-
+
NCCL Latency Analysis
-
+
NCCL Summary Analysis
-
+
-
+
-
+
-
+
Base: rocm-7.0.8-meta
Test: rocm-7.0.10-meta
-
+
Overview Plots
-
+
Percentage Change Overview
-
+
Absolute Time Comparison
-
+
Performance Heatmap
-
+
Total Execution Time by Rank
-
+
-
+
Detailed Metrics
-
+
Computation Time Across Ranks
-
+
Communication Time Across Ranks
-
+
Idle Time Across Ranks
-
+
Percentage Difference All Metrics
-
+
-
+
NCCL Analysis
-
+
NCCL Latency Analysis
-
+
NCCL Summary Analysis
-
+
-
+
-
+
-
+
Base: rocm-7.0.8-meta
Test: rocm-7.0.10-meta
-
+
Overview Plots
-
+
Percentage Change Overview
-
+
Absolute Time Comparison
-
+
Performance Heatmap
-
+
Total Execution Time by Rank
-
+
-
+
Detailed Metrics
-
+
Computation Time Across Ranks
-
+
Communication Time Across Ranks
-
+
Idle Time Across Ranks
-
+
Percentage Difference All Metrics
-
+
-
+
NCCL Analysis
-
+
NCCL Latency Analysis
-
+
NCCL Summary Analysis
-
+
-
+
-
+
-
+
Base: rocm-7.0.8-meta
Test: rocm-7.0.10-meta
-
+
Overview Plots
-
+
Percentage Change Overview
-
+
Absolute Time Comparison
-
+
Performance Heatmap
-
+
Total Execution Time by Rank
-
+
-
+
Detailed Metrics
-
+
Computation Time Across Ranks
-
+
Communication Time Across Ranks
-
+
Idle Time Across Ranks
-
+
Percentage Difference All Metrics
-
+
-
+
NCCL Analysis
-
+
NCCL Latency Analysis
-
+
NCCL Summary Analysis
-
+
-
+
-
+