ADC Operation NL
Introduction
This page is summarizing the current setup/configuration of the NL cloud of the ATLAS Distributed Computing (ADC).
Apart from that, it also logs the used-to-happen issues during the day-to-day operation works.
Sites
-
combined Tier1 center:
Country | Institute | GOC site name |
---|---|---|
Netherlands | NIKHEF | NIKHEF-ELPROD |
Netherlands | SARA | SARA-MATRIX |
Country | Institute | GOC site name |
---|---|---|
Russia | ITEP | ITEP |
Russia | IHEP | RU-PROTVINO-IHEP |
Russia | JINR | JINR-LCG2 |
Russia | PNPI | RU-PNPI |
Russia | SINP | SINP |
Russia | RRC-KI | RRC-KI |
Ireland | CST | CSTCDIE |
Turkey | ULAKBIM | TR-10-ULAKBIM |
Data Management Services
- Storage Resource Manager v2.2: deployed on each site as common interface to storage elements
- gLite File Transfer Service (FTS): deployed at SARA serving the T1-T1 and T1-T2 data transfers to the sites of the NL cloud
- LCG File Catalog Service (LFC): deployed at SARA for cataloging grid files stored on the sites of the NL cloud
Data locations among SARA and NIKHEF
SRMv2 space tokens
Monitoring pages
- NL cloud storage usage summary
- Data transfers (T0-T1, T1-T1) in last 24 hrs
- Data transfers (T1-T2) in last 24 hrs
- active PanDA tasks in NL cloud
- specific storage usage of essential space tokens at NL T1
* NIKHEF-ELPROD_DATADISK usage * NIKHEF-ELPROD_ROFVAL usage * SARA-MATRIX_DATADISK usage * SARA-MATRIX_MCDISK usage
More information
- ADC eLog entries concerning NL cloud
- DDM operation wiki
- DDM browser
Daily operation logs
Date | Actions | Remarks |
---|---|---|
9 Oct. 2008 | requests cosmic and 1beam reprocessed ESD to NIKHEF-ELPROD_DATADISK | needed by NIKHEF physics group |
10 Oct. 2008 | NL Tier2s become on-line again for MC production | FTS performance issue at SARA fixed |
Trouble shooting logs
Sept. 2008 - huge transfer backlog from T2s to SARA
Problem fixed by running the FTS admin tool to slim down the FTS job history db table.
Sept. 2008 - SRM request timeout reading data from SARA
A cron job fixing orphan file issue of dCache loads PNFS server so it was stopped. Also observe a broken network switch involving 4 new dCache node. The problematic network switch has been replaced. A broken dCache node is also replaced.