|
[Sponsors] |
August 8, 2016, 09:18 |
|
#21 | |
Member
Kim Bindesbøll Andersen
Join Date: Oct 2010
Location: Aalborg, Denmark
Posts: 39
Rep Power: 16 |
Quote:
Specifying my present cluster the profesional advice was that the bandwidth of 1867 MHz DDR3 RAM matched E5-2667 v3 (octo-core). Assuming this is correct, then the ratio between CPU performance (fprate) and memory bandwidth (RAM speed) should be approx. that for the above combination. 2667 v3, fprate: 590 (dual processor), RAM speed: 1867 MHz, ratio (590/1867): 0.32 2667 V4, fprate: 724, RAM speed: 2400 MHz, ratio: 0.30 2687W v4, fprate: 888, RAM speed: 2400 MHz, ratio: 0.37 2683 V4, fprate: 933, RAM speed: 2400 MHz, ratio: 0.39 (Above assumes that the memory configuration are identical for all cases, except for the RAM frequency). As it can be seen, the 2667 v4 has sligthly less ratio (<0.32), which means that the memory bandwidth migth not be fully utilized. Whereas the others have higher ratio, meaning that the CPUs most likely are not fully utilized, but could benefit from more memory bandwidth. BR Kim |
||
August 8, 2016, 20:39 |
|
#22 |
New Member
dqwedff
Join Date: Aug 2016
Posts: 3
Rep Power: 10 |
Thanks for you help Robert!
After some reseach I found the Noctua NH-D15s, should do the job. Core affinity: very interesting point. I'll try that once my system is setup. Thanks! |
|
September 2, 2016, 12:47 |
|
#23 |
New Member
M-G
Join Date: Apr 2016
Posts: 28
Rep Power: 10 |
Dear Kim,
I cannot understand your idea for determining the memory bandwidth versus performance. 2667 v3 should compare to itself when you change the memory type not to 2667 v4 because fprate is affected by CPU performance too. Another question which rises here is that if fprate results are applicable for most CFD cases, then E5-2699 V4 has the highest rating among 2600 family. So memory bandwidth per core ratio is totally wrong for this case. It has 22 slower cores with the same 76.8 GB/s bandwidth of it's family. It's in an obvious contrast with using of HPC cluster ideology. How do you interpret it ? |
|
September 2, 2016, 17:29 |
|
#24 |
Senior Member
Robert
Join Date: Jun 2010
Posts: 117
Rep Power: 17 |
The 2699 scores 1090 and has 22 cores
A 2690 scores 922 and has 14 cores In this case ~50% more cores give ~20% more performance. The 2699 costs twice as much as the 2690 and would cost ~$4,000 per motherboard more. The issue is that there is a price performance balance between absolute performance and absolute performance per dollar. Part of this will depend on the licensing scheme of the code you use as to whether threads are limited or not. Additionally the use of fprate as a useful arbiter of relative performance may need to be examined as you go to the edges of CPU/memory bandwidth, as some of the SPEC applications may be less memory hungry than CFD which would boost the relative score. |
|
|
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
Suggestions needed for meshing a CC wing | niablis | Mesh Generation & Pre-Processing | 0 | February 17, 2016 00:32 |
Workstation with new E5 Xeon Ivy-bridge | Manuelo | Hardware | 23 | November 24, 2014 15:11 |
need opinion Workstation 2x Xeon e5 e2690 ? | laxwendrofzx9r | Hardware | 6 | June 5, 2012 10:04 |
amd opteron based workstation needed | ztdep | Hardware | 1 | December 5, 2010 07:23 |
PC vs. Workstation | Tim Franke | Main CFD Forum | 5 | September 29, 1999 16:01 |