bench_kvdb

A benchmarking tool designed to evaluate random read performance of traditional KV databases such as RocksDB and PebbleDB, specifically focusing on IO operations per key read (IO per GET).

This project originated from research around the new trie database design proposed in the Base triedb repo:

Traditional Key/Value stores such as LevelDB / pebble (used by geth) and MDBX (used by Reth), while extremely optimized for general-purpose arbitrary key-value workloads, are not optimal for persisting highly structured data like the Ethereum State Trie.

Traversing a state trie requires O(log N) lookup per node, but storing nodes in a generic KV store leads to a compounded cost of O(log N * log N) because the database must also perform an indexed lookup for every node.

Full reference: base/triedb

Why This Project Exists

The assumption in many blockchain discussions is that RocksDB/Pebble require approximately log(N) lookup cost for each GET.
Multiplied by a trie traversal log(N) path, the estimated cost becomes:

However — this assumption may be incorrect or outdated.

Hypothesis Tested by This Repo

KV storage engines like RocksDB & Pebble should not require full logarithmic disk I/O per GET, because:

Block cache reduces KV index lookups.

Bloom filters avoid unnecessary disk reads.

Index and table metadata are memory resident.

Modern KV implementations may achieve ~1–5 random I/O per GET, not log(N).

This repository exists to empirically measure that.

Build & Run

How to Build

git clone https://github.com/QuarkChain/bench_kvdb
cd bench_kvdb/src/bench_pebble
go build

How to Run

Usage：

--i：init insert data, default value is false
--b: batch insert, default value is true
--c: cache size in MB
--T：total number of keys count
--t: threads count
--w：random write count
--r：random read count
--p：db path
--l：log level

Sample run 2B keys

mkdir -p ./data
./bench_pebble --i --T 2000000000 --w 0 --r 0 --l 2 > runlog/Write_2B.log
sleep 10
./bench_pebble --T 2000000000 --w 0 --l 2 --t 64 > runlog/RadmonRead_2B_1_Hot.log
sleep 10
echo 3 | sudo tee /proc/sys/vm/drop_caches
./bench_pebble --T 2000000000 --w 0 --l 2 --t 64 > runlog/RadmonRead_2B_2_Cold.log
sleep 10
./bench_pebble --T 2000000000 --w 0 --l 2 --t 64 > runlog/RadmonRead_2B_3_hot.log

Benchmark Results

PebbleDB — IO per Random Read

Random-read benchmark using 10M random keys:

 Data Count    |  Size(MB)  |  IO per Key 
---------------+------------+--------------
   200M Keys   |   22 GB    |    1.01
   2B Keys     |  226 GB    |    1.92
   20B Keys    |  2.2 TB    |    2.5

Logs: src/bench_pebble/runlog/

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
src/bench_pebble		src/bench_pebble
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

bench_kvdb

Why This Project Exists

Hypothesis Tested by This Repo

Build & Run

How to Build

How to Run

Sample run 2B keys

Benchmark Results

PebbleDB — IO per Random Read

About

Uh oh!

Releases

Packages

Languages

QuarkChain/bench_kvdb

Folders and files

Latest commit

History

Repository files navigation

bench_kvdb

Why This Project Exists

Hypothesis Tested by This Repo

Build & Run

How to Build

How to Run

Sample run 2B keys

Benchmark Results

PebbleDB — IO per Random Read

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages