DGX-SPARK

DGX Spark research and tests — containers, benchmarks, and investigation notes for running large models on the NVIDIA DGX Spark (GB10, SM 12.1, 128 GB unified memory).

Entries address compatibility issues with CUDA 13.x, aarch64, and SM121 that aren't covered by upstream containers or documentation. Each folder is a self-contained topic; dates and environment details live inside each folder's README.

Hardware


System	NVIDIA DGX Spark
GPU	GB10 Blackwell, SM 12.1, 128 GB unified memory
CPU	20-core ARM Grace (aarch64)
CUDA	13.2, Driver 580.142

Status

This is hobbyist work on a single hardware configuration. Results may not generalize to other setups. The TurboQuant container patches an unmerged vLLM PR — the API may change. Sharing what worked in case it helps others with similar hardware.

Acknowledgments

eugr/spark-vllm-docker — Community vLLM container with prebuilt SM121 wheels
vLLM PR #38479 — TurboQuant attention backend by vibhavagarwal5
TurboQuant — Zandieh et al., Google Research, ICLR 2026
turboquant-torch — Community PyTorch reimplementation
NVIDIA DGX Spark Playbooks
The DGX Spark community on NVIDIA Developer Forums

Tested March 2026 — DGX Spark GB10, SM121, CUDA 13.2, Driver 580.142

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
mamba-dev		mamba-dev
nemo3-super-gguf		nemo3-super-gguf
nvfp4-guide		nvfp4-guide
nvfp4-landscape		nvfp4-landscape
nvfp4-memory		nvfp4-memory
turboquant		turboquant
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DGX-SPARK

Contents

1. nvfp4-guide/ — NVFP4 on DGX Spark: 120 GB → 32 GB

2. turboquant/ — TurboQuant 3-bit KV Cache Compression

3. mamba-dev/ — mamba-ssm for aarch64

4. nvfp4-landscape/ — NVFP4 on DGX Spark: Landscape Snapshot (March 2026)

5. nvfp4-memory/ — NVFP4 Memory Footprint (supporting data)

6. nemo3-super-gguf/ — Nemotron-3-Super 120B via sm_121 llama.cpp

Hardware

Status

Acknowledgments

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

DGX-SPARK

Contents

1. nvfp4-guide/ — NVFP4 on DGX Spark: 120 GB → 32 GB

2. turboquant/ — TurboQuant 3-bit KV Cache Compression

3. mamba-dev/ — mamba-ssm for aarch64

4. nvfp4-landscape/ — NVFP4 on DGX Spark: Landscape Snapshot (March 2026)

5. nvfp4-memory/ — NVFP4 Memory Footprint (supporting data)

6. nemo3-super-gguf/ — Nemotron-3-Super 120B via sm_121 llama.cpp

Hardware

Status

Acknowledgments

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages