[arvados] about slurm job using through container API

Yongjian Guo guoyj2003 at gmail.com
Sat Sep 9 23:02:51 EDT 2017


Hi, Everyone,

When I submit a job through the container API, I saw the the following
output on the compute node:

zzzzz-dz642-18tx6hpy1j48oz2 2017-09-10T02:45:36.883643677Z Executing
container 'zzzzz-dz642-18tx6hpy1j48oz2'
zzzzz-dz642-18tx6hpy1j48oz2 2017-09-10T02:45:36.883706265Z Executing on
host 'compute8'
zzzzz-dz642-18tx6hpy1j48oz2 2017-09-10T02:45:36.903586911Z Fetching Docker
image from collection '62b8bd1d1a65c04571d0aece47172b07+342'
zzzzz-dz642-18tx6hpy1j48oz2 2017-09-10T02:45:36.919914767Z Using Docker
image id
'sha256:6c4e1452f1a36c2b8255fba7097147aff739f98cd05a8c82f9214df4d3dc801d'
zzzzz-dz642-18tx6hpy1j48oz2 2017-09-10T02:45:36.920946105Z Loading Docker
image from keep
zzzzz-dz642-18tx6hpy1j48oz2 2017-09-10T02:45:52.357768811Z While loading
container image: While loading container image into Docker: error during
connect: Post http://%2Fvar%2Frun%2Fdocker.sock/v1.21/images/load?quiet=0:
unexpected EOF
zzzzz-dz642-18tx6hpy1j48oz2 2017-09-10T02:45:52.357810953Z Cancelled

Looks like the computer node is trying to load the docker image
(arvados/job) but is not successful. The configuration on my compute node
must be wrong. Could you give me more information on how to fix this
problem?

Also, is it trying to load the image from keep? How can I check if the keep
storage has been mounted?

Thanks a lot.

Jason
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.arvados.org/pipermail/arvados/attachments/20170909/b7fa2874/attachment.html>


More information about the arvados mailing list