Project

General

Profile

Bug #9976

jobsub_q prints wrong info, no error diagnostic

Added by Andrei Gaponenko almost 4 years ago. Updated over 3 years ago.

Status:
Closed
Priority:
Normal
Assignee:
Category:
-
Target version:
Start date:
08/29/2015
Due date:
% Done:

0%

Estimated time:
First Occurred:
Occurs In:
Stakeholders:

rlc@fnal.gov,
kutschke@fnal.gov,
gandr@fnal.gov

Duration:

Description

As mentioned by Ray in INC000000592035, jobsub_q output can miss information from a server. If a server can not be contacted, it must print a warning or error message, and the exit status should be non-zero. I was deceived by jobsub_q output into believing that all my jobs have completed, while in fact it just failed to show running jobs.

See below for an example of how to reproduce.
Andrei

mu2egpvm05 ~$ while true; do jobsub_q > tmp.txt; st=$?; grep -q fifebatch2 tmp.txt && echo worked st=$st || echo broke st=$st; sleep 10; done

worked st=0
broke st=0
worked st=0
worked st=0
worked st=0
worked st=0
worked st=0
worked st=0
worked st=0
worked st=0
worked st=0
broke st=0

History

#1 Updated by Dennis Box over 3 years ago

  • Target version set to v1.1.9

#2 Updated by Dennis Box over 3 years ago

  • Target version changed from v1.1.9 to v1.2

#3 Updated by Dennis Box over 3 years ago

  • Status changed from New to Feedback

fix checked in to branch 9975 in git

#4 Updated by Dennis Box over 3 years ago

doh branch 9976. Have been testing it on fifebatch-dev, killing a schedd manually and querying it with jobsub_q. The problem with condor_q -g is that it exits with the status of the last schedd that it checks, and I am not sure how it determines the order. Jobsub_q on the server now checks each schedd in turn with condor_q -name 'schedd_name' and throws and exits with non-zero status if any of them are bad.

#5 Updated by Dennis Box over 3 years ago

  • Target version changed from v1.2 to v1.1.9.1

#6 Updated by Dennis Box over 3 years ago

  • Status changed from Feedback to Closed


Also available in: Atom PDF