Project

General

Profile

Bug #7484

non-unique piilot names at T1_UK_RAL

Added by Brian Bockelman about 5 years ago. Updated about 5 years ago.

Status:
Closed
Priority:
Urgent
Assignee:
Parag Mhashilkar
Category:
-
Target version:
Start date:
12/10/2014
Due date:
% Done:

0%

Estimated time:
First Occurred:
Occurs In:
Stakeholders:
Duration:

Description

T1_UK_RAL uses PID namespaces, meaning that all jobs start as PID 1.

Accordingly, the pilots all identify themselves as "glidein_58@$HOSTNAME" (as the condor_master or whatnot is the 58th process to launch).

This causes all sorts of havoc - effectively, only one pilot per hostname can be registered in the collector as they all overwrite each other's Name in the machine ad.

Can we change the glidein name to be a short randomly-generated string (unique) instead of the PID (not unique)?

This likely is fairly urgent for RAL.

History

#1 Updated by Parag Mhashilkar about 5 years ago

  • Status changed from New to Assigned
  • Assignee set to Parag Mhashilkar
  • Target version set to v3_2_8

#2 Updated by Parag Mhashilkar about 5 years ago

  • Status changed from Assigned to Feedback
  • Assignee changed from Parag Mhashilkar to Marco Mambelli

Changes are in v3/7484. Please review

#3 Updated by Marco Mambelli about 5 years ago

  • Assignee changed from Marco Mambelli to Parag Mhashilkar

See email for feedback

#4 Updated by Parag Mhashilkar about 5 years ago

  • Status changed from Feedback to Resolved

Merged

#5 Updated by Parag Mhashilkar about 5 years ago

  • Status changed from Resolved to Closed


Also available in: Atom PDF