Project

General

Profile

Bug #1002

exception when when DISPLAY is not set

Added by Jarek Nowak over 9 years ago. Updated over 9 years ago.

Status:
Closed
Priority:
Normal
Category:
Navigation
Target version:
Start date:
02/23/2011
Due date:
% Done:

100%

Estimated time:
Occurs In:
Scope:
Internal
Experiment:
-
SSI Package:
Duration:

Description

When I start a job with session without DISPLAY set properly I get an exception

Example:
nova c runrecodata.fcl -s ndos_r00011222_s00.raw-development.root
terminate called after throwing an instance of 'cet::coded_exception<art::errors::ErrorCodes, &(art::detail::translate(art::errors::ErrorCodes))>'
what(): ---
FatalRootError BEGIN
Fatal Root Error: @SUB=TUnixSystem::SetDisplay
DISPLAY not set, setting it to 131.225.174.137:0.0
---- FatalRootError END

Aborted (core dumped)

It works when after I set this variable
export DISPLAY=:0

It is problematic for running jobs on the grid.

History

#1 Updated by Christopher Green over 9 years ago

  • Category set to Navigation
  • Status changed from New to Resolved
  • Assignee set to Christopher Green
  • Target version set to 0.5.0
  • % Done changed from 0 to 100

Fixed with a3c9b68

#2 Updated by Christopher Green over 9 years ago

  • Status changed from Resolved to Reviewed

#3 Updated by Christopher Green over 9 years ago

  • Status changed from Reviewed to Closed

#4 Updated by Jarek Nowak over 9 years ago

I still see the same problem. I've tried to run it with art v52a and v60 and I get the same error message.

command lines:

export PATH=$PATH:/nova/app/users/janow/scripts/
novaart_jobsub -r development nova -c datacheckjob.fcl -s /nova/data/novareco/NDOS/S11.03.28/reco/SubRuns/reco-r00011837-s00-t02-cosmic-S11.03.28.root /nova/data/condor-tmp/janow/batch/test.root

Error message:
more /nova/data/condor-tmp/janow/nova_20110401_100738_1.err
terminate called after throwing an instance of 'cet::coded_exception<art::errors::ErrorCodes, &(art::detail::translate
(art::errors::ErrorCodes))>'
what(): ---- FatalRootError BEGIN
Fatal Root Error: @SUB=TSystem::ExpandFileName
input: $HOME/.root.mimes, output: $HOME/.root.mimes
---- FatalRootError END

/nova/app/condor-exec/janow/nova_20110401_100738_1_wrap.sh: line 49: 16237 Aborted (core dumped) nova
${args}

#5 Updated by Christopher Green over 9 years ago

Hi,

As far as I can tell, your latest report has only two things in common with the main issue on this ticket:
  1. They are exceptions originating with ROOT;
  2. They occurred during grid jobs.

This latest exception is due to the fact that the HOME environment variable is not set in a grid environment. Something (maybe in a .rootlogon macro or similar) is trying to tell ROOT about a .mimes file and ROOT is unable to expand $HOME. This is correctly interpreted by the framework as a fatal error in my opinion because we cannot generalize about the consequences of this type of error; therefore it must be assumed to be fatal.

I think the solution for you going forward is to find a way to ensure that whatever macro is being invoked does not try to do things that make no sense in grid jobs; or if this is not possible, make sure that the HOME environment variable is set appropriately for grid jobs.

Thanks,
Chris.



Also available in: Atom PDF