Project

General

Profile

DM - Expert Documentation » History » Version 65

Version 64 (Afroditi Papadopoulou, 12/28/2018 10:09 AM) → Version 65/72 (Michael Kirby, 11/20/2019 11:38 AM)

{{>toc}}

h1. DM - Expert Documentation

h2. Documentation!

... is finally initiated by David Caratelli !PUBS.pdf! :) https://www.overleaf.com/3384459hxbyns#/9541217/

For super-duper experts, if interested in, PUBS base framework documentation is on "DocDB 5400":http://microboone-docdb.fnal.gov:8080/cgi-bin/ShowDocument?docid=5400

Keep updated! Also attach the latest version to this Wiki.

h2. Starting the PUBS daemon running

*%{color:blue}The daemons should be restarted every Monday-Wednesday-Friday-Sunday.%*

Details are on this page. [[Starting the PUBS online daemon]]

h2. Moving all projects to a single online machine

Details are on this page. [[Running all PUBS projects on single server]]

h2. Building up the PUBS online testbed

Details are on this page. [[Building up the PUBS online testbed]]

h2. Mapping project name to names on GUI.

How do I find the project name (database table name) given the name of a specific box on the monitoring gui? [[Project GUI Map]]

h2. Changing the Database Configuration for Online PUBS [[Online PUBS Database Reconfig]]

h2. Correcting Errors in PUBS

* Querying DB for errors. [[DB Query]]

* Errors in *Metadata Generation* From Incomplete Files [[Correcting Failed Metadata Generation]]

* Failed *Near1 Binary Transfers* [[Correcting Failed Near1 Binary Transfer]]

* Errors in *Registering File Metadata* and crontab entries for kerberos tickets and grid proxies [[Correcting Failed Metadata Registration]]

h2. Expired Certificate on Near1 "Request OSG Production Service Certificate":https://cdcvs.fnal.gov/redmine/projects/uboonecode/wiki/CSR

h2. Running out of Disk Space?

* [[on ubdaq-prod-evb]]
* [[on near1 (/datalocal/)]]
* [[on sebXX (uB_DataMgmt_PCXX_seb06_data/disk_occ)]]

h2. [[What to do if dCache/enstore go down (no access to pnfs area)]]

h2. Collaborator has asked me, the DM expert, to prevent the deletion of one or more SN runs.

To prevent the deletion of one or more runs in the SN stream login as uboonepro. Head over the to the SN PUBS script directory located here /home/uboonepro/pubs/dstream_online/snova. Here you will find "frozen_runs.txt". In this file insert *new line separated* run numbers. The monitoring script will read this ASCII text file, and prevent the deletion of files in this text file.

h2. The daemon on ws02 has mysteriously died.

DM experts are currently debugging an issue related to the daemon on ws02 being killed by the kernel. If you are a DM expert on shift and you find the ws02 daemon has mysteriously died. Please execute the following command to copy the log files to a safe location, then please restart the daemon.
<pre>
mkdir -p /data/uboonepro/ws02_daemon_failures/`date +%D`; cp /home/uboonepro/pubs/log/ubdaq-prod-ws02.fnal.gov/* /data/uboonepro/ws02_daemon_failures/`date +%D`/
</pre>

Questions? "Ask Kirby":mailto:kirby@fnal.gov