NDPF Node Performance

From PDP/Grid Wiki
Jump to navigationJump to search

Node performance figures for the NDPF farm nodes

Note performance in the NDPF is internally expressen in GHzHr-equivalents, where a 1 GHzHrEquiv should be the integer performance of a single-core Pentium-3 processor with a clock frequency of 1GHz. To convert this into SpecInt-2000 numbers, a conversion factor of 410 has been applied: 1GHzHrEquiv corresponds to 410 SI2k, although this is a known understatement of the actual performance. Since this number has been hard-coded in all conversion script that publish accounting data to the outside world, it means that from now on the "official" SI2k ratings we get from vendors and our own tests must be converted to GHzHrEquiv values using this factor (410), and then be entered in the appropriate configuration file /etc/pbsaccdb.conf.

To convert from SpecInt2006 Rate base to SpecInt2000 numbers, multiply by a factor of 185. This factor was determined by comparing machine types for which both SpecInt2006 and SpecInt200 numbers were available.

As of 2006-09-01, the PRD facility contains 546 kSI2k

As of 2006-12-18, the PRD facility contains 785 kSI2k.

As of June 2008, the PRD facility will contain 2649 kSI2k including Halloween, or 2579 kSI2k excluding Halloween.

Node type: pizza0 (node18)

These system are dual-Pentium3 CPUs at 0.933 GHz, on an MSI motherboard. Equivalent systems rate at approximately 403-429 SI2k. The overhead incurred by running two simultaneous jobs has not been taken into account.

pbsaccdb factor: 416/410 = 1.015

0 (zero) systems of this type contribute 0 kSI2k.

Node type: AMDNCF (gfrc)

These systems are dual Athon MP2000+ systems. Equivalent systems rate at approx. 690 SI2k (the overhead incurred by running two simultaneous jobs has not been taken into account). Note that our own performance tests using the D0 MC application actually showed a true speed doubling compared to pizza0, so a factor of 2 would not have been unreasonable.

pbsaccdb factor: 690/410 = 1.68

0 (zero) systems of this type contribute 0 kSI2k.

Node type: Halloween (hall)

These systems are dual Xeon 2.8 GHz systems with 1MB L2 cache. Hyperthreading is not used. Equivalent systems rate at 1288 SI2k.

pbsaccdb factor: 1288/410 = 3.14

27 systems with 54 cores contribute 70 kSI2k.

Node type: Bulldozer (bull)

These systems are dual Xeon 3.2 GHz with 2 MB L2 cache. Hyperthreading is not used. Equivalent systems rate at 1555 (Dell PowerEdge 1850).

pbsaccdb factor: 1555/410 = 3.79

34 systems with 68 cores contribute 106 kSI2k.

HEP-SPEC06 information: result is 11.73 per box, meaning 5.87 per core.

Node type: Luilak-1 (lui1) and Luilak-2 (lui2)

These systems are Dell PowerEdge 1950 (Intel Xeon processor 5150, 2.66GHz) rated at 2764 when used with one process (specification by Dell, July 2006). With four simultaneous jobs, this degrades to 2240.

pbsaccdb factor: 2240/410 = 5.46

2*34 systems with 272 cores contribute 609 kSI2k.

HEP-SPEC06 results : 36.56 per box, meaning 9.14 per core.

Node type: Valentine

These systems are Supermicro X7DBE (Intel Woodcrest 5100, 2.5 GHz). The 102 nodes together are rated at 10077 SpecInt-2006 Rate base. This corresponds to 12.35 SpecInt2006 = 2285 SI2k = 5.57 GHz.hr per core.

pbsaccdb factor: 2285/410 = 5.57

102 systems with 816 cores contribute 10077 SpecInt-2006 = 1864 kSI2k.

HEP-SPEC06 results : 65.82 per box, meaning 8.23 per core.


Note on power consumption

For node wn-lui2-023 the following currents were measured under various conditions (OS: CentOS3.8/i386)

standby (off) 0.17A 39W
startup peak >1.50A 345W
setup screen or unloaded system 0.9 - 1.0A 230W
with 4x burnP6 and 4x "burnMMX P" 1.35A 310W
with continuous disk activity 1.1A 250W

The line voltage is approx. 229V (see mail "stroomgebruik grid resources" by wimh of 27-Sep-06 16:13).


Node type: Sint Maarten

These systems are HP BL460c G6 CTO Blade 160 systems (Intel Core-i7 L5520 processors). The 176 blades together are rated at 32560 SpecInt-2006 Rate base (using 16 processes, with 2 threads/core, though!). The use 8 processes without HT compared to 16 threads and HT results in an overall reduction of throughput of ~ 85%. This corresponds to 185 SI06Rate/node and thus 23.13 SpecInt2006rate per core = 4278 SI2k-effective = 10.43 GHz.hr per core with 16 job slots per system, or 19.6 SI06rate/core = 3637 SI2k-effective = 8.87 GHz.hr-equiv per core.

pbsaccdb factor: 4278/410 = 10.43 (16 job slots)

pbsaccdb factor: 3637/410 = 8.87 (8 job slots, no HT)

176 systems with 1408 cores contribute 32560 SpecInt-2006 = 6023 kSI2k.

HEP-SPEC06 results on EL5/x86_64: XXX per box, meaning YYY per core.

Node type: DELL PowerEdge 1435 SC

This is a new series of nodes fitted with dual/dual AMD Opteron 2220 processors. The following tests were done on CentOS 4.4.

standby (off) 0.2A 50W
unloaded system 0.9A 210W
with 4x burnP6 and 4x "burnMMX P" 1.25A 286W