Project

General

Profile

Bug #6586

Milestone #5333: Incorporate functionality of cdfcaf into JobSub

Milestone #6689: Incorporate initial Run II Functionality for v0.4

make sure SAM projects work for CDF

Added by Dennis Box about 5 years ago. Updated about 5 years ago.

Status:
Closed
Priority:
Normal
Assignee:
Category:
-
Target version:
Start date:
07/01/2014
Due date:
07/30/2014
% Done:

100%

Estimated time:
First Occurred:
Occurs In:
Stakeholders:
Duration: 30

Description

Willis' immediate problem is an older, incompatible jobsub_tools installed on the server, but I want to watch this and look for other problems as well.
Dennis

Hi:
I tried to add a SAM dataset defintion input to
jobsub_client, but it gave me errors. The submit
sequence is given below, and the log file $SLOGFILE
is attached. What should I try next?
-- Willis

setup ifdhc v1_3_2
export EXPERIMENT=cdf
export SAM_STATION=cdf-caf
export IFDH_BASE_URI='http://samweb.fnal.gov:8480/sam/cdf/api'

DSAMNAME="${SDSID}_${PBLOCK}_`date +%y%m%d%H`"
export SAM_DATASET $DSAMNAME
DATASET="cdf.dataset $SDSID and run_number $RRANGE"

ifdh createDefinition $DSAMNAME "$DATASET" $USER test >> $SLOGFILE 2>&1

SCRIPT='./run.sh'
STRIPMACRO='ZeeTrk.C'

gCMD="jobsub_submit.py \
-G cdf --resource-provides=usage_model=DEDICATED,OPPORTUNISTIC \
-g -M \
--dataset_definition=$DSAMNAME -N 1 \
file:///cdf/spool/willis/stntuple6if/Submit6/runif.sh \
$SCRIPT $STRIPMACRO $DSAMNAME"

get-cert
source ~cdfsoft/cdf2.shrc
setup jobsub_client v0_3_1_2
$gCMD >> $SLOGFILE 2>&1

edatSubmit.log

submitwToGrid [ Tue Jul 1 15:52:01 CDT 2014 ]
jobsub_submit.py -G cdf --resource-provides=usage_model=DEDICATED,OPPORTUNISTIC -g -M --dataset_definition=ze1s6d_0000_14070115 -N 1 file:///cdf/spool/willis/stntuple6if/Submit6/runif.sh ./run.sh ZeeTrk.C ze1s6d_0000_14070115
Server response code: 200
Response OUTPUT:
/fife/local/scratch/uploads/cdf/willis/2014-07-01_155202.472762_2570/runif.sh_20140701_155202_32680_1.dag

Response ERROR:
Traceback (most recent call last):

File "/fnal/ups/prd/jobsub_tools/v1_3_0/Linux-2/bin/jobsub", line 102, in <module>
settings.makeCondorFiles()
File "/fnal/ups/prd/jobsub_tools/v1_3_0/Linux-2/pylib/groupsettings/JobSettings.py", line 707, in makeCondorFiles
self.makeSAMBeginFiles()
File "/fnal/ups/prd/jobsub_tools/v1_3_0/Linux-2/pylib/groupsettings/JobSettings.py", line 720, in makeSAMBeginFiles
ifdh_setup=JobUtils().ifdhString()%settings['wn_ifdh_location']

TypeError: not enough arguments for format string

Remote Submission Processing Time: 0.453346014023 sec

History

#1 Updated by Dennis Box about 5 years ago

  • Description updated (diff)

I found a bug in the generated dag begin node, the error message is that it is failing because it can't find ifdh.
I need to release a new jobsub_tools to fix this, shouldn't take long.
Dennis

On 7/2/14 11:26 AM, Willis Sakumoto wrote:

Hi Neha, Dennis, and Joe:

Thanks. However, things are not yet working as
expected. The SAM file job got started, but failed
immediately. Attached is the jobsub_fetchlog.py output,
and below is the sambegin error file. I tried to email
the job output, but it was banned:
"Microsoft Forefront Protection for Exchange
Server has detected a file filter match.
Filter name: Banned File Attachments: .cmd".
I'm amuzed that it did not complain about .sh.
Dennis and Joe - for the output .tgz file, go to
fcdflnx6:/cdf/spool/willis/stntuple6if/Submit6/zoutput

-- Willis

sambegin-runif.sh_20140702_094906_15096.err:
+ EXPERIMENT=cdf
+ DEFN=ze1s6d_0000_14070209
+ PRJ_NAME=willis-runif.sh_20140702_094906_15096
+ SAM_USER=willis
+ cat
+ chmod x
/local/stage1/disk3/dir_11620/glide_a11692/execute/dir_15710/ifdh.sh
export IFDH_BASE_URI=http://samweb.fnal.gov:8480/sam/cdf/api
+ IFDH_BASE_URI=http://samweb.fnal.gov:8480/sam/cdf/api
+ ifdh describeDefinition ze1s6d_0000_14070209
/local/stage1/disk3/dir_11620/glide_a11692/execute/dir_15710/condor_exec.exe:
line 31: ifdh: command not found
+ '[' '' = '' ']'
+ SAM_STATION=cdf
+ ifdh startProject willis-runif.sh_20140702_094906_15096 cdf
ze1s6d_0000_14070209 willis cdf

/local/stage1/disk3/dir_11620/glide_a11692/execute/dir_15710/condor_exec.exe:
line 35: ifdh: command not found

On Tue, 1 Jul 2014, Neha Sharma wrote:

Hi Willis

The setup of new production cluster is now complete!
Also, as per request from Dennis, jobsub_tools has been updated

[root@fifebatch2 ~]# ups list jobsub_tools

DATABASE=/fnal/ups/db
Product=jobsub_tools Version=v1_3_1_1 Flavor=Linux+2
Qualifiers="" Chain=current

[root@fifebatch2 ~]#

Please try now and let me know how it goes

-Neha

On Jul 1, 2014, at 4:26 PM, Willis Sakumoto <> wrote:

Hi:

Thanks for the quick feedback. I'll wait until I hear
from Neha. (I don't know if the job went to fifebatch.fnal.gov).

-- Willis

On Tue, 1 Jul 2014, Joe Boyd wrote:

Neha is in the middle of reinstalling that cluster too. You may have hit a problem there.

Neha, let Willis know when it's all clear too please.

joe

On 07/01/2014 04:07 PM, Dennis Box wrote:

Hi Willis,
This was fifebatch.fnal.gov, right? The version of jobsub_tools
installed on the server is old and incompatible.
I will get Neha to upgrade it, and open a 'make sure SAM works for CDF'
ticket for the jobsub project.
Dennis
On 7/1/14 3:54 PM, Willis Sakumoto wrote:

Hi:
I tried to add a SAM dataset defintion input to
jobsub_client, but it gave me errors. The submit
sequence is given below, and the log file $SLOGFILE
is attached. What should I try next?
-- Willis
setup ifdhc v1_3_2
export EXPERIMENT=cdf
export SAM_STATION=cdf-caf
export IFDH_BASE_URI='http://samweb.fnal.gov:8480/sam/cdf/api'
DSAMNAME="${SDSID}_${PBLOCK}_`date +%y%m%d%H`"
export SAM_DATASET $DSAMNAME
DATASET="cdf.dataset $SDSID and run_number $RRANGE"
ifdh createDefinition $DSAMNAME "$DATASET" $USER test >> $SLOGFILE 2>&1
SCRIPT='./run.sh'
STRIPMACRO='ZeeTrk.C'
gCMD="jobsub_submit.py \
-G cdf --resource-provides=usage_model=DEDICATED,OPPORTUNISTIC \
-g -M \
--dataset_definition=$DSAMNAME -N 1 \
file:///cdf/spool/willis/stntuple6if/Submit6/runif.sh \
$SCRIPT $STRIPMACRO $DSAMNAME"
get-cert
source ~cdfsoft/cdf2.shrc
setup jobsub_client v0_3_1_2
$gCMD >> $SLOGFILE 2>&1
edatSubmit.log
submitwToGrid [ Tue Jul 1 15:52:01 CDT 2014 ]
jobsub_submit.py -G cdf --resource-provides=usage_model=DEDICATED,OPPORTUNISTIC -g -M --dataset_definition=ze1s6d_0000_14070115 -N 1file:///cdf/spool/willis/stntuple6if/Submit6/runif.sh ./run.sh ZeeTrk.C ze1s6d_0000_14070115
Server response code: 200
Response OUTPUT:
/fife/local/scratch/uploads/cdf/willis/2014-07-01_155202.472762_2570/runif.sh_20140701_155202_32680_1.dag
Response ERROR:
Traceback (most recent call last):

File "/fnal/ups/prd/jobsub_tools/v1_3_0/Linux-2/bin/jobsub", line 102, in <module>

settings.makeCondorFiles()

File "/fnal/ups/prd/jobsub_tools/v1_3_0/Linux-2/pylib/groupsettings/JobSettings.py", line 707, in makeCondorFiles

self.makeSAMBeginFiles()

File "/fnal/ups/prd/jobsub_tools/v1_3_0/Linux-2/pylib/groupsettings/JobSettings.py", line 720, in makeSAMBeginFiles

ifdh_setup=JobUtils().ifdhString()%settings['wn_ifdh_location']
TypeError: not enough arguments for format string
Remote Submission Processing Time: 0.453346014023 sec

#2 Updated by Parag Mhashilkar about 5 years ago

  • Status changed from New to Resolved

#3 Updated by Parag Mhashilkar about 5 years ago

  • Parent task set to #6689

#4 Updated by Parag Mhashilkar about 5 years ago

  • % Done changed from 0 to 100

#5 Updated by Parag Mhashilkar about 5 years ago

  • Status changed from Resolved to Closed

#6 Updated by Parag Mhashilkar about 5 years ago

  • Due date set to 07/30/2014


Also available in: Atom PDF