Project

General

Profile

Bug #21566

INC000001007918 jobsub dev release multiple issues

Added by Nikolay Kuropatkin 7 months ago. Updated 6 months ago.

Status:
Resolved
Priority:
High
Assignee:
-
Category:
-
Target version:
-
Start date:
12/17/2018
Due date:
% Done:

0%

Estimated time:
First Occurred:
Occurs In:
Stakeholders:
Duration:

Description

I ran a test job this morning as CDF on the jobsub dev release. I see two different problems i think. Files are here:

-bash-4.1$ pwd
/
-bash-4.1$ hostname
jobsubdevgpvm01.fnal.gov

The err file says:

-bash-4.1$ cat sleep_job.sh_20181217_094248_3218084_1_1_cluster.729.0.err
./condor_exec.exe: line 218: /fife/local/scratch/uploads/cdf/boyd/2018-12-17_094247.332164_1937/sleep_job.sh: No such file or directory
./condor_exec.exe: line 239: unexpected EOF while looking for matching `"'
./condor_exec.exe: line 242: syntax error: unexpected end of file

So I think there are two problems here. One is that it seems to be trying to run the user shell script at the path from the head node instead of the path where it sits on the worker node. The .out below says the job is executing in /storage/local/data1/condor/execute/dir_17483/no_xfer/work but in the .err above it's trying to execute it at the location on the head node.

Also, I believe there is an unmatched double quote as noted in the .err file. One of the last lines in the _wrap file is:

exiting with status $cpy_stat"

I'm not sure where that matching quote is.

-bash-4.1$ cat sleep_job.sh_20181217_094248_3218084_1_1_cluster.729.0.out
executing in directory
/storage/local/data1/condor/execute/dir_17483/no_xfer/work
executing: /fife/local/scratch/uploads/cdf/boyd/2018-12-17_094247.332164_1937/sleep_job.sh -f 15 -s 5 -e cdf
job_1.err
job_1.out
Mon Dec 17 15:55:50 UTC 2018
executing:/storage/local/data1/condor/execute/dir_17483/jsb_tmp/ifdh.sh cp jobsub_cdf_output.tgz /pnfs/cdf/scratch/boyd/jobsubdevgpvm01.fnal.gov_1-727.tgz

exiting with status $cpy_stat"


Related issues

Related to JobSub - Bug #21572: Extra quote in CdfSettings.py causing ALL cdf jobs to failClosed2018-12-21

History

#1 Updated by Shreyas Bhat 7 months ago

  • Related to Bug #21572: Extra quote in CdfSettings.py causing ALL cdf jobs to fail added

#2 Updated by Dennis Box 6 months ago

  • Status changed from New to Resolved

fixes have been tested and will be included in the next release



Also available in: Atom PDF