Release version 1.0.0 by heylf · Pull Request #139 · nf-core/spatialxe

heylf · 2026-04-28T16:17:30Z

The first release version of spatialxe. During the developemnt many tools were debugged, and new tools were added.
Since it is a first release thorough review is required.

PR checklist

This comment contains a description of changes (with reason).
If you've fixed a bug or added code that should be tested, add tests!
If you've added a new tool - have you followed the pipeline conventions in the contribution docs- [ ] If necessary, also make a PR on the nf-core/spatialxe branch on the nf-core/test-datasets repository.
Make sure your code lints (nf-core lint).
Ensure the test suite passes (nextflow run . -profile test,docker --outdir <OUTDIR>).
Usage Documentation in docs/usage.md is updated.
Output Documentation in docs/output.md is updated.
CHANGELOG.md is updated.
README.md is updated (including new tool citations and authors/contributors).

…ng module

choice of segmentation, changes to baysor, updated metromap

Changes to spatialdata handling based on coordinate space

…rklfow changes

Important! Template update for nf-core/tools v3.3.2

…iance) Eliminate `params.*` reads from process bodies and align with the nf-core modules guideline: all non-mandatory CLI args MUST flow through `task.ext.args`, assembled in `conf/modules.config` as a single closure keyed by `withName:`. Only the sanctioned `ext.*` keys (`args`, `args2`, `args3`, `prefix`, `when`) are used. Cluster A — local modules: - SEGGER_CREATE_DATASET: drop dynamic `maxForks`, drop non-compliant `task.ext.format` consumption + body-side format validation, fold --sample-type/--tile-width/--tile-height into a guarded ext.args list. - SEGGER_TRAIN: drop non-compliant `task.ext.devices` (dead access), drop dynamic `maxForks` directive in modules.config, merge the args/args2 into one guarded ext.args list, drop the stale `${params.devices}` reference from the log echo. - SEGGER_PREDICT: convert single-string ext.args to guarded list form. - PROSEG: drop non-compliant `task.ext.format` consumption + body-side format validation, fold `--${format}` into a guarded ext.args list. Cluster B — nf-core modules (locally patched): - XENIUMRANGER_IMPORT_SEGMENTATION and XENIUMRANGER_RESEGMENT now match upstream nf-core/modules byte-for-byte (git_sha 39365e944e9). All param-driven flags (expansion-distance, dapi-filter, boundary-stain, interior-stain) moved into ext.args closures in modules.config. The two `.patch` files are now empty and have been removed, with the `patch` references dropped from `modules.json`. Cluster D — nextflow.config: - Drop redundant `.intValue()` on `task.memory.toGiga()` (lines 278, 284). MemoryUnit.toGiga() already returns long; the no-op chain warns under strict syntax. Cluster E — validation relocation: - Add `enum: ["xenium", "cosmx", "merscope"]` to the `format` param in `nextflow_schema.json` (schema-level constraint). - Add method-aware format compatibility check in `validateInputParameters()` (segger requires xenium; proseg accepts all three). - Plumb `format` through PIPELINE_INITIALISATION `take:` and the main.nf call site. maxForks concurrency control: - Replace silently-inert dynamic `maxForks params.restrict_concurrency` directives (Nextflow forbids dynamic maxForks) with a config-load-time `if (params.restrict_concurrency) { process { withName: ... { maxForks = 1 } } }` block in modules.config. This is the nf-core-idiomatic pattern and actually takes effect. Bonus fixes (unblocks the test suite, gated by the same review pass): - SPATIALDATA_{WRITE,META,MERGE}: replace `python3 -c "import spatialdata; print(spatialdata.__version__)"` version eval with `pip show spatialdata | sed -n 's/^Version: //p'`. The spatialdata Python package does not expose `__version__` in the current container, causing all stub tests to fail before this fix. The pip-show pattern is the project's canonical fallback per CLAUDE.md. Test snapshots regenerated for coordinate/default/segfree modes; the diff is version-string drift only (file structure and MD5 content unchanged). The image/preview mode tests fail with pre-existing tifffile/pyarrow version-eval bugs in other modules and are out of scope for this PR. Reviewer comments addressed: 3245861758, 3245857609, 3245855348, 3245874088, 3245874347, 3216307528, 3216295080, 3216300407, 3216300657. Follow-up (not in this PR): baysor/baysor_run, baysor/baysor_preprocess, and xenium_patch/divide still use non-compliant `task.ext.<custom>` keys (prior_column, prior_confidence, tile_width, overlap, balanced, filter_method, iqr_multiplier, z_threshold). Same anti-pattern as the ones addressed here; refactor in a focused follow-up. Refs: https://nf-co.re/docs/guidelines/components/modules

Sweep the remaining `python3 -c "import X; print(X.__version__)"` version-eval patterns across utility/xenium_patch modules and convert them to `pip show X 2>/dev/null | sed -n 's/^Version: //p'`. The import-based pattern crashes when: - the package is missing (ModuleNotFoundError), or - the package exists but doesn't expose __version__ (AttributeError). The `pip show` form returns empty output instead of crashing, matching the CLAUDE.md "pip-installed tools without __version__" canonical fallback. Modules updated: utility/{downscale_morphology, parquet_to_csv, resize_tif, upscale_mask}, xenium_patch/{divide, stitch}. Also drop 3 pre-existing ruff F401 unused imports (`spatialdata` in spatialdata_merge.py + spatialdata_write.py, `zarr` in spatialdata_meta.py — the helpers actually use spatialdata via `sd` alias / the `_zarr_group` re-import for the v3 compat shim). `ruff check bin/ tests/` is now clean.

…roseg CLI CI revealed three issues with the previous commits: 1. nf-core lint: `nextflow config -flat` rejected the top-level `if (params.restrict_concurrency) { process { ... } }` block in `conf/modules.config` with "If statements cannot be mixed with config statements". Per the reviewer comment (#3245861758) flagging the original `maxForks params.restrict_concurrency ? 1 : 0` as a non-functional dynamic directive: the right fix is to drop the whole feature. Nextflow does not support dynamic `maxForks`, and users who genuinely need concurrency capping can already do so via `-process.maxForks 1` (CLI) or `withName: <PROC> { maxForks = N }` (custom config). The pipeline-level `params.restrict_concurrency` wrapper was redundant and never worked. Drop it from `nextflow.config` and `nextflow_schema.json`. 2. Snapshot pollution: the previous commit regenerated mode snapshots on a host where some version evals fail (no spatialdata, no tifffile, etc.) and where proseg is v2.0.4 vs the container's v3.1.0. Restore `tests/{coordinate,default,segfree}_mode.nf.test.snap` to the upstream/dev state so they match what CI containers actually produce. 3. proseg v3.1.0 CLI: `--output-spatialdata` was renamed to `--output-path` in proseg v3.x. The module's invocation now uses the v3-correct flag (this was the module-test failure on CI shard 5/12).

The previous commit replaced `--output-spatialdata` with `--output-path` based on a misleading local error. proseg v3.1.0 source confirms: - `--output-spatialdata <PATH>` writes the SpatialData zarr (default `proseg-output.zarr`). - `--output-path <PREFIX>` is a path PREFIX prepended to every output filename — a different mechanism entirely. CI run on the previous commit revealed the bug: proseg accepted `--output-path test_run_proseg/proseg-output.zarr` (parsed fine) but then panicked at output time because it tried to write each output file with that prefix prepended (e.g. `test_run_proseg/proseg-output.zarr/ expected-counts.csv.gz` — a directory that doesn't exist). Restoring `--output-spatialdata` is what proseg's docs and source say to use.

…pulling xeniumranger image (infra-only, not code)

PR runs detect tests across all changed files (via `nf-test --changed-since HEAD^` in get-shards action), causing more container pulls than direct dev-branch pushes. Combined containers (cellpose, xeniumranger, baysor, proseg, spatialdata, multiqc, etc.) exceed the 4cpu-linux-x64 + disk=large runner capacity on shards that exercise multiple modes. Adds jlumbroso/free-disk-space@v1.3.1 step before each nf-test job to reclaim ~10-20GB by removing preinstalled tool caches (Android SDK, dotnet, haskell, large-packages, swap) that nf-test doesn't need. docker-images is kept to preserve any layers the runner pre-pulled. Previously failing on shards 5/6/7 with: docker: write /var/lib/docker/tmp/GetImageBlob...: no space left on device

… cache rm Previous attempt with jlumbroso/free-disk-space@v1.3.1 only freed ~1GB on the self-hosted runs-on runners (they don't ship with Android SDK, dotnet, etc. that the action targets). Switch to manual cleanup: - `docker system prune -af --volumes` reclaims storage from any previous docker state on a reused runner - direct `rm -rf` on dotnet/android/ghc/CodeQL/boost/powershell tool caches (best-effort; some paths may not exist) - `df -h` before+after for diagnostic visibility The blocker is xeniumranger:4.0 (ships CUDA cuDNN libs, ~5-10GB).

…tion This commit reverts c9e3311 and b6b7122 (CI workflow disk-cleanup attempts) and supersedes c8c7094 (empty retrigger commit). After running locally with the correct profile syntax matching CI: nf-test test --profile=+docker --ci --changed-since upstream/dev all 21 tests pass (SUCCESS in 1131s). The `+` syntax is what adds the `test` profile baseline declared in nf-test.config — without it, the test profile drops and `validateXeniumBundle` runs against an HTTPS bundle URL it cannot introspect (pre-existing pipeline behavior, unrelated to this PR). The intermittent CI failures observed on docker shards 5/6/7 are runner-disk pressure (xeniumranger:4.0 is 7.92GB; PR-scope test selection triggers more container pulls than direct-push dev runs). That's a CI infra concern unrelated to the module/params refactor in scope here, and should be addressed separately if needed.

The recurring shard-5/6/7 failures on this PR are not a runner-disk limit hit by any single image. They're a *cumulative* pull problem: with max_shards=12 and 21 selected tests, each shard runs ~2 tests and their containers accumulate in /var/lib/docker. For shard 5 specifically, BAYSOR_PREVIEW (5.5 GB) plus the coordinate-mode pipeline test (which pulls spatialdata + proseg + xeniumranger) lands on the same /var/lib/docker, and xeniumranger:4.0's ~12 GB peak extraction overflows the remaining ~3-4 GB free. With max_shards=24, each test gets its own shard and its own runner /var/lib/docker. Worst single-shard cumulative pull drops from ~13 GB to ~10 GB (one pipeline-mode stub at a time), which fits the runner's 12 GB free margin. Trade-off: more parallel CI runners (24 vs 12), but each shorter. runs-on spot pool should handle this; if not, fall back to a singularity matrix entry (image cache on the workspace volume bypasses the docker partition entirely). Verified locally: nf-test --profile=+docker --ci --changed-since upstream/dev passes 21/21 in 1131s on a host with 26 GB free in /var/lib/docker. The cliff only appears on the smaller CI runners.

After max_shards=24 (commit ee80948), the per-shard cumulative container pull is reduced but a single pipeline-level coordinate-mode stub still pulls UNTAR + SPATIALDATA (~2GB) + PROSEG (~0.8GB) + XENIUMRANGER:4.0 (~12GB peak extraction) on the same 12GB-free runner volume. The xeniumranger extraction overflows. The runner provisioned 29GB total via `disk=large`; 17GB is OS+tools, leaving ~12GB for docker. Switching to `disk=xlarge` should give a larger root volume so the cumulative pull within one pipeline test fits. Same tests as before, same containers, no profile swap, no skip/remove/ reduce — just more disk on the runner.

`disk=xlarge` label wasn't honored — runner still has 28 GiB total / ~12 GiB free. Same 25.04.0 shards keep failing on xeniumranger extraction. Trying a different approach: detect at runtime whether the runner has ANY mount with more free space than `/`, and if so, move docker's data-root there before nf-test starts. Falls back to aggressive cleanup of preinstalled tool caches if no bigger mount exists. Includes lsblk + df -hT diagnostic output so the next iteration is data-driven instead of guesswork. Still no skip / no remove / no reduce / no switch — same 21 tests on same docker profile.

Diagnostic from previous commit confirmed the runner has only ONE disk (nvme0n1, 29G total) with NO secondary volume to relocate to. disk=large and disk=xlarge both map to the same 29G root. The runner image is runs-on-v2.2-ubuntu24-full-x64 which preinstalls ~16GB of GHA tooling (dotnet, android, ghc, codeql, etc.) that nf-test doesn't need. Swap to the slim noble variant: - image=ubuntu24-full-x64 (default; preinstalls full toolchain) + image=ubuntu24-noble-x64 (slim; only base ubuntu) The 29G volume now has ~28G free instead of ~12G — enough headroom for the cumulative pull (xeniumranger 7.92G + spatialdata 2G + proseg 0.8G ≈ 11G) within a single pipeline-mode stub test. Still no skip / no remove / no reduce / no switch.

…runs)

@awgymer

@awgymer pointed at the runs-on `volume=` syntax. The old `disk=large` label is deprecated, and we measured it wasn't actually growing the root volume on our runners (still 29 GB total, ~12 GB free). Per runs-on docs migration guide: disk=large → volume=80gb https://runs-on.com/configuration/job-labels/#volume 80 GB gives ample headroom for the cumulative container pull on the image-mode pipeline stub test (~21 GB worst case: untar + spatialdata + proseg + cellpose + baysor + xeniumranger) on top of the runner's ~17 GB OS overhead.

@awgymer

CI passed with the volume=80gb label change suggested by @awgymer. Now reverting the other workflow-file experiments that were attempts to work around the underlying disk-pressure issue: - max_shards: 12 → 24 reverted to upstream/dev value (12) - "Inspect runner storage + free space" diagnostic+cleanup step removed - All earlier disk=large/disk=xlarge/image=noble experiments were already reverted Final delta on .github/workflows/nf-test.yml vs upstream/dev is now exactly one line: disk=large → volume=80gb.

…ation guard ## Motivation Previously `validateXeniumBundle` ran during `PIPELINE_INITIALISATION` and was gated by `if (!workflow.profile.contains('test'))`. The skip was needed because: * The test samplesheet points at a `.tar.gz` URL, not a directory; the validation iterates required files like `cell_boundaries.csv.gz` over that path and would always fail in test mode. * For cloud-storage inputs (`s3://`, `gs://`, `az://`) the validation also bailed out because `file().exists()` is unreliable before Nextflow stages the input. Both reasons describe the same root cause: validation was running BEFORE staging, so it had to handle multiple un-staged input shapes. ## Change Move validation to AFTER the `UNTAR` staging step, where the bundle is unconditionally a resolved local directory (no matter whether the user provided a directory, a remote tarball, or a cloud URL). This removes the need for both the test-profile guard AND the cloud-storage early return — the same validation runs uniformly for production and test. * `subworkflows/local/utils_nfcore_spatialxe_pipeline/main.nf`: delete the `validateXeniumBundle(ch_samplesheet)` call and its `if (!workflow.profile.contains('test')) { ... }` guard. Delete the `def validateXeniumBundle(ch_samplesheet)` function (it was tightly coupled to the channel/un-staged-path shape). * `workflows/spatialxe.nf`: inline the file-presence check into the existing `ch_bundle_path` `.map` block. Required-file list errors; optional-file list warns. Validation runs once per sample, after `UNTAR` (test mode) or directly on the user-provided directory (production). ## Side effect: UNTAR stub fix Validating post-staging exposed an existing inconsistency between the nf-core/untar real script and its stub: the real script uses `tar --strip-components 1` when the archive has a single top-level directory, but the stub iterates `tar -tf` output unchanged — which places stubbed files OUTSIDE `${prefix}/` (at `./xenium_bundle/...` in our case). The previous local untar patch worked around this by adding 4 hardcoded `touch ${prefix}/...` lines for the files this pipeline happens to need first. The patch is replaced with a proper fix: in the stub's single- top-level-dir branch, strip the leading directory from each tar entry so the file layout matches what a real extraction produces. The 4 hardcoded touches are removed because the loop now creates them naturally. `modules/nf-core/untar/untar.diff` regenerated. Snapshots regenerated to reflect the now-complete extracted-bundle file listing (was previously truncated to the 4 hardcoded files). ## Verification * `nf-test test tests/{default,coordinate_mode,image_mode,preview_mode,segfree_mode}.nf.test --profile=+docker --ci` → 5/5 PASS in ~450s after snapshot regen. * Workflow runs to `workflow.success == true` against the test samplesheet (`https://.../xenium_bundle.tar.gz`) — validation now correctly sees the extracted directory contents. * No new dependencies. The validation function definition was deleted from utils_nfcore_spatialxe_pipeline/main.nf since the inline check in spatialxe.nf is simpler than the old channel-based iteration.

The legacy `disk=large` label is silently ignored by the current runs-on version — runners get 29 GB total / ~12 GB free, not enough for the cumulative container pull on heavy stub tests. `volume=80gb` (the new syntax) is honored and provisions 76 GB / ~60 GB free. This is the same one-line fix landed in PR #158 (commit 32f479d); PR #159 needs it independently since it branched from upstream/dev where the workflow file still has the old label.

Two reconciliations: 1. nf-test version drift. Local devs run nf-test 0.9.5; CI was pinned to 0.9.3. The two versions capture snapshot file metadata differently for gzip-format files (0.9.5 records full File-object shape, 0.9.3 records bare md5 strings), causing PR #159's locally-regenerated snapshots to mismatch CI's run output. Bumping CI to 0.9.5 (matching the published local default) makes the snapshots round-trip cleanly. 2. Pre-merge harmony with PR #158. PR #158 (params-in-modules cleanup) edits `subworkflows/local/utils_nfcore_spatialxe_pipeline/ main.nf` to add `format` plumbing and a method-aware format compatibility check. PR #159 (this PR) edits the same file to remove the `validateXeniumBundle` call + def. The two edits are in different parts of the file but to ensure PR #159 cleanly rebases onto PR #158 (which will merge first), the PR #158 additions are now present in PR #159's version of this file. No conflict on the eventual rebase.

Previous commit (8996511) tried to pre-emptively port PR #158's additions onto PR #159's subworkflow file. That broke things: PR #158's subworkflow additions include adding `format` as a take input, but PR #158's matching change to `main.nf` (passing `params.format` at the call site) was not pulled in (it's PR #158-only, not overlapping). Result: PIPELINE_INITIALISATION expected 22 take inputs but main.nf passed 21 → `workflow.success = false` in CI. Reverting the subworkflow file to PR #159's standalone version (removes validateXeniumBundle call + def, no other changes). When PR #158 merges first, the rebase of PR #159 will produce a normal 3-way merge on this file — git's auto-merge handles disjoint edits (PR #158's additions are in different line ranges from PR #159's removals), so no manual reconciliation is expected.

…apshots nf-test's FileUtil.getMd5() decompresses .gz files before hashing. When the UNTAR stub used `touch foo.gz` (0-byte file), GZIPInputStream threw EOF; the exception handler in PathConverter.java fell through to dumping raw File metadata (path, absolutePath, freeSpace, ...) into the JSON snapshot, causing machine-dependent dict-form entries. Fix: `: | gzip -n > file.gz` produces a 20-byte deterministic empty gzip (magic + empty deflate block, no embedded mtime/filename). Decompressed md5 is d41d8cd98f00b204e9800998ecf8427e on every machine, so snapshots are stable across local and CI runners. All 5 mode tests pass twice locally (regen + validate-without-update).

The original justifications for ignoring MULTIQC failures are gone: 1. `ext.args` title-flag bug (exit 2) — fixed; modules.config now emits `--title "..."` correctly for both PRE_XR and POST_XR variants. 2. `.map().flatten()` channel blocking — fixed (see docs/failures/2026-03-03_multiqc-channel-blocking.md). With both root causes resolved, `errorStrategy = 'ignore'` no longer protects against any known failure mode — it just silences any new MultiQC regression (parser break, OOM, missing _mqc.yml, etc.), letting the pipeline report success without a rendered report. Removing the directive restores a real CI signal for MultiQC failures.

…rams-strict fix(modules): address awgymer review comments on PR #139 (strict nf-core compliance)

…er-untar fix: validate Xenium bundle after UNTAR (drop test-profile skip-validation guard)

heylf · 2026-05-20T11:38:25Z

@awgymer comments are solved.

awgymer

Looking much better. Just one small change needed here.

Just to note that after this I am going to hold off on approval and green-lighting this release due to the pending decision on the potential re-scoping of this pipeline to support other spatial data as a name change of the pipeline would be more easily facilitated if there hasn't yet been a release. Hopefully a decision is made on that quickly.

heylf · 2026-05-21T06:37:01Z

Yes, that it ok. We try to clarify as fast as possible. I agree that a potential name change should be done before the release.

fix: address review comments on PR #139

khersameesh24 added 30 commits July 16, 2025 15:36

fixes for baysor runs, included transcript filtering, new preprocessi…

baa2dae

…ng module

parallel processing for baysor runs

8c828ef

linting fix

939d225

updated metromap

e3c3ebc

spatialdata and baysor fixes

0eeb8de

testing changes, spatialdata errors, segger changes

a769207

changed path for segger2xr

6a02cc4

integrate segger transcripts into xr bundle and run imp-seg

331ce5c

Merge pull request #95 from khersameesh24/dev

b4e4926

choice of segmentation, changes to baysor, updated metromap

minor changes to sd handling

43a0c04

changes to handling spatialdata generation based on coordinate space

afc1b4c

Merge pull request #98 from khersameesh24/dev

0c353e7

Changes to spatialdata handling based on coordinate space

minor changes and fixes

fff9e7d

minor linting fixes and small changes

9763ea3

more linting fixes

67dd108

changes to segger module, move params to nextflow config, minor subwo…

fc17397

…rklfow changes

more linting fixes

a4227b0

added meta.yml files for subworkflows, other minor changes

bd89e4b

more linting fixes and added meta.yml files for subworkflows

9007bb2

resolving merge conflicts

54263ca

Template update for nf-core/tools version 3.3.2

aa9c630

Merge branch 'TEMPLATE' into nf-core-template-merge-3.3.2

e43d5b0

resolving conflicts

7c3c503

resolving conflicts

87517b7

fixed precommit run

f42c9db

fix nf-core linting errors

1ce65b2

removed fastqc

8ef79b8

Merge pull request #93 from nf-core/nf-core-template-merge-3.3.2

1355f46

Important! Template update for nf-core/tools v3.3.2

Merge branch 'dev' into dev

17905be

fix for ro-crate

b71eee5

awgymer requested changes May 15, 2026

View reviewed changes

an-altosian added 2 commits May 15, 2026 16:45

an-altosian mentioned this pull request May 15, 2026

fix(modules): address awgymer review comments on PR #139 (strict nf-core compliance) #158

Merged

10 tasks

an-altosian and others added 21 commits May 15, 2026 17:44

ci: retrigger — previous run hit 'no space left on device' on runner …

c8c7094

…pulling xeniumranger image (infra-only, not code)

Revert: restore disk=xlarge (image=noble label invalid, broke all 43 …

2333c0f

…runs)

Merge pull request #158 from an-altosian/worktree-fix-pr139-module-pa…

5c81338

…rams-strict fix(modules): address awgymer review comments on PR #139 (strict nf-core compliance)

Merge pull request #159 from an-altosian/worktree-validate-bundle-aft…

e7a745b

…er-untar fix: validate Xenium bundle after UNTAR (drop test-profile skip-validation guard)

awgymer requested changes May 21, 2026

View reviewed changes

Comment thread workflows/spatialxe.nf Outdated

minor change in spatialxe.nf

62e5034

Merge pull request #160 from heylf/dev

2af78b1

fix: address review comments on PR #139

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Release version 1.0.0#139

Release version 1.0.0#139
heylf wants to merge 435 commits into
masterfrom
dev

heylf commented Apr 28, 2026 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

heylf commented May 20, 2026

Uh oh!

awgymer left a comment •

edited

Loading

Uh oh!

Uh oh!

heylf commented May 21, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

Conversation

heylf commented Apr 28, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

PR checklist

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

heylf commented May 20, 2026

Uh oh!

awgymer left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

heylf commented May 21, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

heylf commented Apr 28, 2026 •

edited

Loading

awgymer left a comment •

edited

Loading