[ARVADOS] created: bf207e24d447248b90c25cfdb77a82e85a1fb02c

git at public.curoverse.com git at public.curoverse.com
Mon Mar 23 14:05:19 EDT 2015


        at  bf207e24d447248b90c25cfdb77a82e85a1fb02c (commit)


commit bf207e24d447248b90c25cfdb77a82e85a1fb02c
Author: Peter Amstutz <peter.amstutz at curoverse.com>
Date:   Mon Mar 23 14:08:58 2015 -0400

    5524: Match magic string arvados.errors.Keep as likely part of an exception
    backtrace and mark task as temporary failure.

diff --git a/sdk/cli/bin/crunch-job b/sdk/cli/bin/crunch-job
index ea9a987..294696c 100755
--- a/sdk/cli/bin/crunch-job
+++ b/sdk/cli/bin/crunch-job
@@ -1235,7 +1235,7 @@ sub preprocess_stderr
       # whoa.
       $main::please_freeze = 1;
     }
-    elsif ($line =~ /srun: error: (Node failure on|Unable to create job step|.*: Communication connection failure)/) {
+    elsif ($line =~ /(srun: error: (Node failure on|Unable to create job step|.*: Communication connection failure))|arvados.errors.Keep/) {
       $jobstep[$job]->{node_fail} = 1;
       ban_node_by_slot($jobstep[$job]->{slotindex});
     }

-----------------------------------------------------------------------


hooks/post-receive
-- 




More information about the arvados-commits mailing list