ROCm docs style IA and style guide updates by mattwill-amd · Pull Request #50 · AMD-AGI/Magpie

mattwill-amd · 2026-06-23T12:57:32Z

Edited and restructured docs according to our Style Guide and Diataxis information architecture guidance.

sinarafati-amd

overall looks good. left some few comments to be addressed before merging

randyh62 · 2026-06-30T21:36:44Z

+# Standalone gap analysis on existing traces
+python -m Magpie benchmark gap-analysis --trace-dir results/benchmark_vllm_<timestamp>/
+


According to Cursor, there is no gap-analysis subcommand. Magpie/main.py implements standalone gap analysis via --trace-dir on benchmark.

Suggested fix:

python -m Magpie benchmark --trace-dir results/benchmark_vllm_<timestamp>/

randyh62 · 2026-06-30T21:37:20Z

+
+Magpie's benchmark mode runs end-to-end performance tests against LLM inference frameworks—vLLM, SGLang, and Atom—and collects throughput and latency metrics in a structured JSON report. Benchmarks can run inside a Docker container, directly on the host, or on a remote Ray cluster, and optionally capture torch profiler traces for deeper analysis with TraceLens and gap analysis. Use this mode to measure inference performance on AMD Instinct™ GPUs and identify the GPU kernels that dominate runtime.
+
+Review these topics for more information:


Suggested change

Review these topics for more information:

For more information, see the following topics:

randyh62 · 2026-06-30T21:42:20Z

+
+## Benchmark report
+
+The primary summary file is **`benchmark_report.json`** in the run workspace (see `WorkspaceManager.save_report`). It aggregates throughput, latency, and optional `gap_analysis` / `tracelens_analysis` sections. A typical shape (abbreviated, with `...` marking elided values):


It is not clear what "see WorkspaceManager.save_report" is directing the user to?

randyh62 · 2026-06-30T21:47:12Z

+}
+```
+
+## More info


Not sure why we need both More Info and Related Sources.

randyh62 · 2026-06-30T21:48:53Z

+- [Automatic GPU selection in Magpie's benchmark mode](automatic-gpu.md) — how Magpie picks idle GPUs before launching and how to override or disable selection
+- [Persistent server reuse (local) in Magpie's benchmark mode](persistent-server-reuse.md) — keep a server alive across runs to avoid model reload overhead
+- [Profiling options in Magpie's benchmark mode](profiling-options.md) — configure torch profiler, TraceLens, and gap analysis
+- [Analyze and compare kernels with Magpie](../analyze-compare.md) — kernel evaluation modes (orthogonal to Benchmark)


Suggested change

- [Analyze and compare kernels with Magpie](../analyze-compare.md) — kernel evaluation modes (orthogonal to Benchmark)

- [Analyze and compare kernels with Magpie](../analyze-compare.md) — kernel evaluation modes independent of benchmark mode

randyh62 · 2026-06-30T22:08:06Z

+python -m Magpie benchmark gap-analysis \
+    --trace-dir results/benchmark_vllm_<timestamp>/torch_trace \
+    --start-pct 50 --end-pct 80 \
+    --top-k 15 \
+    --categories kernel gpu \
+    --ignore-categories gpu_user_annotation


Is there a gap-analysis sub-command?

randyh62 · 2026-06-30T23:06:22Z

+
+# Magpie troubleshooting
+
+This topic covers errors and debugging techniques. Each section presents symptoms and their solutions in a table so you can quickly find the issue you're seeing. For benchmark configuration problems not listed here, enable verbose logging with `--log-level DEBUG` and check the output before filing a bug report.


The api-reference.md line 29 mentions --verbose / -v for Debug logging, but the later Config settings discusses logging at line 162, but without any levels or options discussed. It is not clear from the api-reference, what the correct approach is for detailed logging for debug purposes?

ROCm docs style IA and style guide updates

73c9f1a

mattwill-amd requested review from haofrank, irvineoy and sinarafati-amd as code owners June 23, 2026 12:57

mattwill-amd commented Jun 23, 2026

View reviewed changes

Comment thread docs/how-to/benchmarking/benchmark.md

sinarafati-amd previously approved these changes Jun 23, 2026

View reviewed changes

Comment thread docs/index.rst Outdated

Comment thread docs/what-is-magpie.rst Outdated

Comment thread docs/how-to/benchmarking/benchmark.md

Resolving links

f7b6c20

mattwill-amd dismissed sinarafati-amd’s stale review via f7b6c20 June 26, 2026 09:13

sinarafati-amd approved these changes Jun 26, 2026

View reviewed changes

mattwill-amd marked this pull request as draft June 29, 2026 12:30

randyh62 approved these changes Jul 1, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

ROCm docs style IA and style guide updates#50

ROCm docs style IA and style guide updates#50
mattwill-amd wants to merge 2 commits into
mainfrom
rocm-docs-review

mattwill-amd commented Jun 23, 2026 •

edited

Loading

Uh oh!

Uh oh!

sinarafati-amd left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

randyh62 Jun 30, 2026

Uh oh!

randyh62 Jun 30, 2026

Uh oh!

randyh62 Jun 30, 2026

Uh oh!

randyh62 Jun 30, 2026

Uh oh!

randyh62 Jun 30, 2026

Uh oh!

randyh62 Jun 30, 2026

Uh oh!

randyh62 Jun 30, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		# Standalone gap analysis on existing traces
		python -m Magpie benchmark gap-analysis --trace-dir results/benchmark_vllm_<timestamp>/


		Magpie's benchmark mode runs end-to-end performance tests against LLM inference frameworks—vLLM, SGLang, and Atom—and collects throughput and latency metrics in a structured JSON report. Benchmarks can run inside a Docker container, directly on the host, or on a remote Ray cluster, and optionally capture torch profiler traces for deeper analysis with TraceLens and gap analysis. Use this mode to measure inference performance on AMD Instinct™ GPUs and identify the GPU kernels that dominate runtime.

		Review these topics for more information:

	Review these topics for more information:
	For more information, see the following topics:


		## Benchmark report

		The primary summary file is `benchmark_report.json` in the run workspace (see `WorkspaceManager.save_report`). It aggregates throughput, latency, and optional `gap_analysis` / `tracelens_analysis` sections. A typical shape (abbreviated, with `...` marking elided values):

	- [Analyze and compare kernels with Magpie](../analyze-compare.md) — kernel evaluation modes (orthogonal to Benchmark)
	- [Analyze and compare kernels with Magpie](../analyze-compare.md) — kernel evaluation modes independent of benchmark mode


		# Magpie troubleshooting

		This topic covers errors and debugging techniques. Each section presents symptoms and their solutions in a table so you can quickly find the issue you're seeing. For benchmark configuration problems not listed here, enable verbose logging with `--log-level DEBUG` and check the output before filing a bug report.

Uh oh!

Conversation

mattwill-amd commented Jun 23, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

sinarafati-amd left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

randyh62 Jun 30, 2026

Choose a reason for hiding this comment

Uh oh!

randyh62 Jun 30, 2026

Choose a reason for hiding this comment

Uh oh!

randyh62 Jun 30, 2026

Choose a reason for hiding this comment

Uh oh!

randyh62 Jun 30, 2026

Choose a reason for hiding this comment

Uh oh!

randyh62 Jun 30, 2026

Choose a reason for hiding this comment

Uh oh!

randyh62 Jun 30, 2026

Choose a reason for hiding this comment

Uh oh!

randyh62 Jun 30, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

mattwill-amd commented Jun 23, 2026 •

edited

Loading