NL Cloud Monitor Instructions

From Atlas Wiki
Jump to navigation Jump to search

Introduction

This page will give a step-by-step instruction for the shifters (of the ATLAS NL-cloud regional operation) to check through several key monitoring pages used by Atlas Distributed Computing (ADC). Those key monitoring pages were also monitored by official ADC shifters (e.g. ADCoS, DAST).

The general architecture of ADC operation is shown on the right.

General architecture of the ADC operation

The shifters that we are concerning here is part of the "regional operation team". The contribution will be credited by OTSMU.

Things to monitor

Follow the instructions below for checking different monitoring pages and notify the NL cloud squad team accordingly via adc-nl-cloud-support@nikhef.nl.


ADCoS eLog

ADCoS eLog is mainly used by ADC experts and ADCoS shifters to log the actions taken on a site concerning a site issues. For example, removing/adding site from/into the ATLAS production system. The shifter has to notify the squad team if there are issues not being followed up for a long while (~24 hours).

The eLog entries related to NL-cloud can be found here.

DDM Dashboard

DDM Dashboard is used for monitoring the data transfer activities between sites.

The main

Panda Monitor (Production)

Panda Monitor (Analysis)

GangaRobot

Shifters' calendar

Useful links