feat: add blob encoding benchmark and profiling harness by wbbradley · Pull Request #3085 · MystenLabs/walrus

wbbradley · 2026-03-08T20:54:21Z

Description

Adds phase-level benchmarking and profiling infrastructure for the blob encoding pipeline (encode_with_metadata()) at production parameters (n_shards=1000).

examples/profile_encoding.rs — Standalone profiling binary for use with samply record or cargo flamegraph. Accepts --size, --shards, --iterations flags. Reports wall-clock time and throughput (MiB/s).
benches/encoding_phases.rs — Criterion benchmark measuring individual phases: secondary encoding, primary encoding, primary encoding + hashing, metadata/Merkle tree construction, and full pipeline. Blob sizes: 1MiB, 32MiB, 256MiB.
Makes leaf_hash public in merkle.rs so benchmarks can measure hashing independently.
Adds clap dev-dependency for the profiling binary's CLI.

Test plan

Verified compilation via chk (formatting, clippy).
Ran cargo nextest run -p walrus-core (232 tests pass).
Ran cargo bench -p walrus-core --bench encoding_phases to verify benchmarks execute correctly.
Ran cargo build --release --example profile_encoding && ./target/release/examples/profile_encoding to verify the profiling binary works.

Release notes

Storage node:
Aggregator:
Publisher:
CLI:

mlegner

Thanks a lot for expanding our benchmarking/profiling toolbox. A few questions mainly about code duplication.

Verified compilation via chk (formatting, clippy).

Where is that chk defined?

mlegner · 2026-03-09T10:19:38Z

crates/walrus-core/Cargo.toml

 [[bench]]
 name = "blob_encoding"
 harness = false
+
+[[bench]]
+name = "encoding_phases"
+harness = false


Question: What is the relationship to the existing benchmarks? Can we combine them?

mlegner · 2026-03-09T10:22:40Z

crates/walrus-core/examples/profile_encoding.rs

+fn parse_size(s: &str) -> Result<usize, String> {
+    let s = s.to_lowercase();
+    let (num, mult) = if let Some(n) = s.strip_suffix('g') {
+        (n, 1 << 30)
+    } else if let Some(n) = s.strip_suffix('m') {
+        (n, 1 << 20)
+    } else if let Some(n) = s.strip_suffix('k') {
+        (n, 1 << 10)
+    } else {
+        (s.as_str(), 1)
+    };
+    let n: usize = num.parse().map_err(|e| format!("invalid size: {e}"))?;
+    Ok(n * mult)
+}


Hint: We already have a struct that does this in the walrus-service crate. That could be moved to walrus-core or walrus-utils.

mlegner · 2026-03-09T10:27:10Z

crates/walrus-core/benches/encoding_phases.rs

AFAICT, many of the sub-benchmarks here mostly copy some code from crates/walrus-core/src/encoding/blob_encoding.rs. Can we instead create some functions there that are called in both the production code and here in the benchmarks? In that case we probably also don't have to export the leaf_hash function.

Add phase-level criterion benchmarks (encoding_phases) that measure secondary encoding, primary encoding, hashing, and metadata construction independently at production parameters (n_shards=1000). Add a standalone profiling binary (profile_encoding) designed for use with samply/flamegraph without criterion overhead. Make leaf_hash public to support external benchmarking of hashing costs.

Add heap peak tracking via peakmem-alloc and RSS peak via libc::getrusage() to the profiling binary. Each iteration now reports peak_heap, peak_rss, and heap expansion ratio. Multi-iteration runs report max_peak_heap in the summary.

Add --concurrent-blobs N flag that encodes N blobs simultaneously using std::thread::scope, simulating multi-blob uploads. Reports per-blob latency, total wall time, and peak memory with per-blob expansion ratio for direct comparison with single-blob runs.

wbbradley requested review from halfprice, mlegner and sadhansood March 8, 2026 21:12

wbbradley force-pushed the wbbradley/profiling-encoding branch from 89f3b76 to 19db448 Compare March 9, 2026 03:53

mlegner reviewed Mar 9, 2026

View reviewed changes

wbbradley force-pushed the wbbradley/profiling-encoding branch from 19db448 to f13ab19 Compare March 10, 2026 00:51

wbbradley added 2 commits March 9, 2026 19:04

feat: add peak memory tracking to profile_encoding

885db7d

Add heap peak tracking via peakmem-alloc and RSS peak via libc::getrusage() to the profiling binary. Each iteration now reports peak_heap, peak_rss, and heap expansion ratio. Multi-iteration runs report max_peak_heap in the summary.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add blob encoding benchmark and profiling harness#3085

feat: add blob encoding benchmark and profiling harness#3085
wbbradley wants to merge 3 commits intomainfrom
wbbradley/profiling-encoding

wbbradley commented Mar 8, 2026 •

edited

Loading

Uh oh!

mlegner left a comment

Uh oh!

mlegner Mar 9, 2026

Uh oh!

mlegner Mar 9, 2026

Uh oh!

mlegner Mar 9, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

wbbradley commented Mar 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Test plan

Release notes

Uh oh!

mlegner left a comment

Choose a reason for hiding this comment

Uh oh!

mlegner Mar 9, 2026

Choose a reason for hiding this comment

Uh oh!

mlegner Mar 9, 2026

Choose a reason for hiding this comment

Uh oh!

mlegner Mar 9, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

wbbradley commented Mar 8, 2026 •

edited

Loading