Difference between revisions of "NDPF TODO List"
Line 52: | Line 52: | ||
zie [ http://goc.grid.sinica.edu.tw/gocwiki/R-GMA_server_upgrade_-_Patch_%23530 GOC wiki page] | zie [ http://goc.grid.sinica.edu.tw/gocwiki/R-GMA_server_upgrade_-_Patch_%23530 GOC wiki page] | ||
− | == Stuff that Groep | + | == Stuff that Groep needs to share with the group == |
this is a list of things that only David seems to know about. If you run into a problem during his absence and the answer is "David knows this, wait til he is back" then write it down here: | this is a list of things that only David seems to know about. If you run into a problem during his absence and the answer is "David knows this, wait til he is back" then write it down here: | ||
* SMS gateway: how to add new services | * SMS gateway: how to add new services |
Revision as of 13:58, 13 June 2007
Solve quota problems on bosheks
Message 198: From root@bosheks.nikhef.nl Mon Sep 18 12:28:09 2006 Date: Mon, 18 Sep 2006 12:28:08 +0200 From: root@bosheks.nikhef.nl (Cron Daemon) To: root@bosheks.nikhef.nl Subject: Cron <root@bosheks> quota -v dzero004 | mail -s 'quota on bosheks' a03@nikhef.nl X-Cron-Env: <SHELL=/bin/sh> X-Cron-Env: <HOME=/root> X-Cron-Env: <PATH=/usr/bin:/bin> X-Cron-Env: <LOGNAME=root> X-Cron-Env: <USER=root> quota: Quota file not found or has wrong format.
SARA VOMS server certs needed
These have been added by hand, but should be installed automatically by quattor.
Orphan process watchdog
Need cron job on WNs that check for orphaned processes and kill them. The idea would be to determine which pool accounts on the node have valid jobs running on the node; processes found for any other pool account would be killed.
Create fixed append-only CentOS mirror for NDPF
CentOS Updates, like their RHEL counterparts, roll over a few weeks after every release, so that the old RPMs are no longer available. The CentOS repository should be mirrored on stal locally in a way that this does not happen.
Check pool accounts
Apparently things can go wrong if we have e.g.
dteamsm01
and
dteam001
as pool accounts for 'dteamsm' and 'dteam' ... because 'dteamsm01' is a valid pool account for .dteam. Check and repair.
VOBOX installation
VObox pool account support.
Update of Resource Broker
R-GMA GIN Update
ganglia monitoring multicast
ganglia does not yes work across the various subnet due to some off multicast problems (although deel has the proper "router pim" and other magic statement). Need to investigate on deel and monitor some of the multicast traffic.
R-GMA updates
zie [ http://goc.grid.sinica.edu.tw/gocwiki/R-GMA_server_upgrade_-_Patch_%23530 GOC wiki page]
this is a list of things that only David seems to know about. If you run into a problem during his absence and the answer is "David knows this, wait til he is back" then write it down here:
- SMS gateway: how to add new services