Feature #4529
Create a tool to verify that DAQ nodes are running and accessible over Infiniband
Start date:
08/11/2013
Due date:
% Done:
0%
Estimated time:
Description
Request from Alessandro on 08-Aug: It would be helpful to have a small tool that verifies that all computers in the DAQ cluster are up, and that the IB connections between them are up and running. Initially, it is fine to have a command-line tool to do this. Ganglia or Nagios would be fine in the medium- to long-term.
History
#1 Updated by Kurt Biery over 7 years ago
If we could verify that MPI is working on all of the nodes, that would be a great addition.
#2 Updated by Kurt Biery almost 7 years ago
- Target version set to AdditionalFunctionality