|
[Sponsors] |
July 15, 2020, 18:55 |
Scalability of FLOW-3D
|
#1 |
New Member
toboto
Join Date: Jun 2016
Posts: 20
Rep Power: 10 |
Dear All,
Is there any benchmarks for the scalability of Flow3d v11.2 (NOT MP version) on clusters. I am asking since I got very disappointing results myself. I tried identical model on two machines:
I only gained about 10% acceleration with the workstation. Is that normal with the non-MP version of Flow3d? Is there any way to improve the performance on the workstation without the need to buy Flow3d/MP license? Best regards |
|
July 16, 2020, 10:02 |
|
#2 |
Member
nazrin
Join Date: Apr 2016
Posts: 35
Rep Power: 10 |
Hi Toboto,
For workstation, how many ram slot does the motherboard have and how many are populated with ram? |
|
July 17, 2020, 15:40 |
|
#3 |
New Member
toboto
Join Date: Jun 2016
Posts: 20
Rep Power: 10 |
||
July 21, 2020, 05:25 |
|
#4 |
Super Moderator
Alex
Join Date: Jun 2012
Location: Germany
Posts: 3,428
Rep Power: 49 |
I guess the non-MP version still runs in parallel, just with OpenMP instead of a hybrid OpenMP/MPI parallelization. So my advice will be based on that assumption.
In this case, you could first try to pin the simulation to one of the two CPUs. This will prevent high memory latency from from NUMA. How to do this depends on the operating system. "taskset" is commonly used in linux. And of course, only use 12 threads for this simulation. Additionally, disable hyperthreading in the bios, which is a good idea for CFD workstations anyway. If this does not speed things up, you can try to play around with core binding options, to prevent the OS from shuffling around threads to different cores every few seconds. Contact support to check how it could be done. If that still doesn't work, your case might just be too small to scale on many cores. Or so large that both machines run out of memory. Edit: or some other bottleneck, like file I/O. Or your time measurement might include a lot of single-threaded setup time... Last edited by flotus1; July 21, 2020 at 11:16. |
|
July 22, 2020, 04:22 |
|
#5 |
Member
Join Date: Oct 2019
Posts: 65
Rep Power: 7 |
Flow 3D software in non-HPC version 11.2 is somehow optimized for 8 cores. The performance acceleration falls drastically after 8 cores even using single EPYC 7302P occupied by 8 channel memory.
This is off topic but for your information and comparison: I had a very nice experience using 2920x in 8 cores, frequency limited to 3.8 GHz, vcore 1.08v, 4x8gx3200MHz of RAM. The simulation time was about 15 percent slower than EPYC 7302p by 16 cores. In my opinion, a new PC like an Intel 9800X light workstation will provide much more performance than paying for the HPC license in your old machine (with lower memory frequency) unless you plan for a large model. |
|
|
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
mass flow in is not equal to mass flow out | saii | CFX | 12 | March 19, 2018 06:21 |
Different flow patterns in CFX and Fluent | avi@lpsc | FLUENT | 4 | April 8, 2012 07:12 |
Flow meter Design | CD adapco Group Marketing | Siemens | 3 | June 21, 2011 09:33 |
potential flow vs. Euler flow | curious ... | Main CFD Forum | 23 | July 21, 2006 08:40 |
Plug Flow | Franck | Main CFD Forum | 3 | September 4, 2003 06:57 |