Skip to content

Report on running #438 usecases with datalad 0.13.0rc1 and "local" resource #511

@yarikoptic

Description

@yarikoptic

This was an attempt to run with @kyleam 's version of the script with only minor tune ups, as available from the branch scratch/use-case-local here.
Command I had was

bash /home/yoh/proj/repronim/reproman/docs/usecases/bids-fmriprep-workflow-NP.sh try1 2>&1 | tee try1.log

and full log try1.log is now at http://www.onerussian.com/tmp/reproman-try1.txt (renamed).

1. The overall failure was due to

+ reproman run --follow -r local --sub local --orc datalad-pair --jp container=containers/bids-fmriprep --input data/bids --output data/fmriprep --bp pl=02,13 '{inputs}' '{outputs}' participant --participant_label '{p[pl]}' --fs-license-file=containers/licenses/freesurfer -w work
2020-05-07 18:14:13,964 [INFO] No root directory supplied for local; using '/home/yoh/.reproman/run-root'
[INFO] Publishing Dataset(/home/yoh/temp-test-reproman/try1) to local 
...
2020-05-07 18:14:14,954 [ERROR] 'datalad publish' failed. Try running 'datalad update -s local --merge --recursive' first [orchestrators.py:prepare_remote:793] (OrchestratorError) 
while it went just fine upon checking and retrying after crash
lena:~/temp-test-reproman
$> datalad status -d /home/yoh/temp-test-reproman/try1 
nothing to save, working tree clean

$> datalad siblings -d /home/yoh/temp-test-reproman/try1
.: here(+) [git]
.: local(+) [/home/yoh/.reproman/run-root/ac410208-90ae-11ea-889c-c3382a41054b (git)]

$> datalad status -d /home/yoh/.reproman/run-root/ac410208-90ae-11ea-889c-c3382a41054b
nothing to save, working tree clean

$> datalad publish -d /home/yoh/temp-test-reproman/try1 --to=local   

which is strange. I wish ERROR message provided a more informative message (i.e. how it failed, some stderr or returned record from datalad publish invocation if done in Python).

2. In the log you could see above this failure

2020-05-07 18:14:10,194 [INFO] Getting outputs from 'local'
2020-05-07 18:14:10,430 [WARNING] 1 subjob failed. Check files in /home/yoh/temp-test-reproman/try1/.reproman/jobs/local/20200507-181356-aeda/
2020-05-07 18:14:10,430 [INFO] 20200507-181356-aeda stderr: /home/yoh/temp-test-reproman/try1/.reproman/jobs/local/20200507-181356-aeda/stderr.0
2020-05-07 18:14:10,603 [INFO] Finished with remote resource 'local'
2020-05-07 18:14:10,604 [INFO] Unregistered job 20200507-181356-aeda

for running mriqc. so there is a WARNING but overall it seems it didn't cause the process to fail. @kyleam shouldn't overall run --follow fail if subjob fails?

trusting noone including myself I have tested that it is not somehow my demo script which should fail if individual command fails due to `set -e` on the top... it fails as expected with the following patch
(git)lena:~/proj/repronim/reproman[scratch/use-case-local]
$> git diff
diff --git a/docs/usecases/bids-fmriprep-workflow-NP.sh b/docs/usecases/bids-fmriprep-workflow-NP.sh
index 3ac4eca..c54473c 100755
--- a/docs/usecases/bids-fmriprep-workflow-NP.sh
+++ b/docs/usecases/bids-fmriprep-workflow-NP.sh
@@ -97,7 +97,8 @@ unknown_runner () {
 # Common invocation of ReproMan
 # TODO: just make it configurable per project/env?
 reproman_run () {
-    reproman run --follow -r "${RM_RESOURCE}" --sub "${RM_SUB}" --orc "${RM_ORC}" "$@"
+    /bin/ls bogus
+    # reproman run --follow -r "${RM_RESOURCE}" --sub "${RM_SUB}" --orc "${RM_ORC}" "$@"
 }

FTR, the content of that stderr file was

$> cat /home/yoh/temp-test-reproman/try1/.reproman/jobs/local/20200507-181356-aeda/stderr.0
ERROR  : Failed to mount squashfs image in (read only): Invalid argument
ABORT  : Retval = 255

my singularity-container from neurodebian is 2.6.1-2~nd100+1

DataLad 0.13.0rc1 WTF (dependencies, datalad) # WTF ## datalad - full_version: 0.13.0rc1 - version: 0.13.0rc1 ## dependencies - appdirs: 1.4.3 - boto: 2.49.0 - cmd:7z: 16.02 - cmd:annex: 7.20190708+git9-gfa3524b95-1~ndall+1 - cmd:bundled-git: 2.20.1 - cmd:git: 2.20.1 - cmd:system-git: 2.26.2 - cmd:system-ssh: 8.1p1 - exifread: 2.1.2 - git: 3.1.0 - gitdb: 4.0.2 - humanize: 2.3.0 - iso8601: 0.1.12 - keyring: 18.0.1 - keyrings.alt: 3.4.0 - msgpack: 0.6.2 - mutagen: 1.40.0 - requests: 2.23.0 - tqdm: 4.43.0 - wrapt: 1.11.2

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions