Project

General

Profile

Bug #22623

Print collector name when reconfig fails because of failed communication

Added by Marco Mascheroni 3 months ago.

Status:
New
Priority:
Normal
Category:
Frontend
Target version:
Start date:
05/23/2019
Due date:
% Done:

0%

Estimated time:
First Occurred:
Occurs In:
Stakeholders:
Duration:

Description

This is the error message when one of the factory collector is down:

[mmascher@vocms080 global]$ sudo -u frontend /sbin/gwms-frontend reconfig
Using default Frontend config file: /etc/gwms-frontend/frontend.xml
cmsgwms-factory.fnal.gov
Traceback (most recent call last):
  File "/sbin/reconfig_frontend", line 183, in <module>
    msg = check_config_frontend.main(xml)
  File "/usr/lib/python2.7/site-packages/glideinwms/creation/lib/check_config_frontend.py", line 88, in main
    f_version = get_factory_version(fc)
  File "/usr/lib/python2.7/site-packages/glideinwms/creation/lib/check_config_frontend.py", line 27, in get_factory_version
    results = collector.query(adtype, constraint, ['GlideinWMSVersion'])
IOError: Failed communication with collector.

We should at least print the hostname of the collector so the operator knows where to look.



Also available in: Atom PDF