Project

General

Profile

Support #9384

Create check_mk monitor script for CVMFS Stratum-1

Added by Anthony Tiradani over 4 years ago.

Status:
New
Priority:
Normal
Assignee:
Start date:
07/07/2015
Due date:
% Done:

0%

Estimated time:
component:
base
Scope:
Internal
Experiment:
-
Stakeholders:
Co-Assignees:
Categorization:
-
Duration:

Description

From Dave Dykstra:

Hi Tony,

While making plans for the full CVMFS stratum 1 monitoring for all
stratum 1s, it has come to my attention that there's a semi-decent
temporary way to monitor for common problems using any external
monitoring system. You can do
wget --timeout=10 -dqO/dev/null http://cvmfs.fnal.gov:8000/cvmfs/<reponame>/.cvmfspublished 2>&1 | grep Last-Modified
for every repository and look for a timestamp that is older than, say, 4
hours. That Last-Modified time is updated every time a snapshot command
runs successfully. I suggest making it go into a Warning level if it is
between 4 and 24 hours, and go Critical if it is more than 24 hours or
missing, or use whatever time parameters you prefer. Can you make use
of that in your alarm system?

For the list of repositories, use the list you have in puppet and add
all those listed from running this command
https://raw.githubusercontent.com/DrDaveD/cvmfs-hastratum1/master/print_osg_repos

Dave



Also available in: Atom PDF