CFD Online Logo CFD Online URL
www.cfd-online.com
[Sponsors]
Home > Forums > General Forums > Hardware

Fluent run with limit around 10-15MB/s bandwidth

Register Blogs Community New Posts Updated Threads Search

Reply
 
LinkBack Thread Tools Search this Thread Display Modes
Old   September 18, 2023, 11:12
Default Fluent run with limit around 10-15MB/s bandwidth
  #1
Member
 
Nguyen Trong Hiep
Join Date: Aug 2018
Posts: 48
Rep Power: 8
hiep.nguyentrong is on a distinguished road
I have 3 computer node with 16 cores each, 1GE interconect.
I run fluent mpitest or fluent report bandwidth give me 100MB/s. But when i run a simple simulation, single phase, no interface, network monitor only show 10 to 15MB in/out.

Both intelmpi and openmpi give the same result.
How can i use all my network bandwidth in my simulation?
hiep.nguyentrong is offline   Reply With Quote

Old   September 18, 2023, 11:57
Default
  #2
Senior Member
 
Join Date: Oct 2011
Posts: 242
Rep Power: 16
naffrancois is on a distinguished road
Hello,

This probably simply means that your run does not require a high network bandwidth. The amount of data transfered to the network is roughly speaking proportional to the number of cells located at the mesh partitions boundaries. If your mesh is not too big, if your number of partitions is not too big, the amount of data is not so big and 1GbE limit is not reached, which is good ! Note that high latency of 1GbE can also impact the performance, but I would not worry too much with only 3 nodes.

If you want to see if your network slows down your calculations, monitor the runtime using 1, then 2 and finally 3 nodes. If it stalls between 1 and 2 or 2 and 3, then you may have a network limiting the performance.
naffrancois is offline   Reply With Quote

Old   September 18, 2023, 17:02
Default
  #3
Senior Member
 
Will Kernkamp
Join Date: Jun 2014
Posts: 365
Rep Power: 14
wkernkamp is on a distinguished road
Quote:
Originally Posted by hiep.nguyentrong View Post
I have 3 computer node with 16 cores each, 1GE interconect.
I run fluent mpitest or fluent report bandwidth give me 100MB/s. But when i run a simple simulation, single phase, no interface, network monitor only show 10 to 15MB in/out.

Both intelmpi and openmpi give the same result.
How can i use all my network bandwidth in my simulation?

In addition to the answer above, there may be another reason: The mpi communication happens between the nodes upon completion of an iteration step. So the majority of the time, the nodes are at work and not sending anything. So if for 15% of the time the nodes are communicating at maximum bandwith, you would see a bandwidth use of 15% * 100 MB/s = 15 MB/s
wkernkamp is offline   Reply With Quote

Old   September 18, 2023, 22:05
Default
  #4
Member
 
Nguyen Trong Hiep
Join Date: Aug 2018
Posts: 48
Rep Power: 8
hiep.nguyentrong is on a distinguished road
Thank you for reply,
In my case, i have 10mil poly mesh and 48 cores. Fluent report that each iteration took 3.328s, data transfer per ite: 300MB, LE wall-clock/iter: 1.346(40%).
Quote:
The mpi communication happens between the nodes upon completion of an iteration step
that's true. I mean the maximum data transfer is 15MB.
When i increase 3 computer node to 10, speed increase only 50%. That whh i think the network is the bottleneck
hiep.nguyentrong is offline   Reply With Quote

Old   September 18, 2023, 22:06
Default
  #5
Member
 
Nguyen Trong Hiep
Join Date: Aug 2018
Posts: 48
Rep Power: 8
hiep.nguyentrong is on a distinguished road
http://upanh.tv/image/jPryQA

Here is the networks monitor
hiep.nguyentrong is offline   Reply With Quote

Old   September 19, 2023, 16:27
Default
  #6
Senior Member
 
Will Kernkamp
Join Date: Jun 2014
Posts: 365
Rep Power: 14
wkernkamp is on a distinguished road
Quote:
Originally Posted by hiep.nguyentrong View Post
Thank you for reply,
In my case, i have 10mil poly mesh and 48 cores. Fluent report that each iteration took 3.328s, data transfer per ite: 300MB, LE wall-clock/iter: 1.346(40%).
that's true. I mean the maximum data transfer is 15MB.
When i increase 3 computer node to 10, speed increase only 50%. That whh i think the network is the bottleneck

For 10 nodes yes. I have build a network with old ConnectX=3 cards. That should work better for you. If you get dual port cards, you can build a ring config. Still fast and no noisy hub. (Only for people with some vacation time due, haha)
wkernkamp is offline   Reply With Quote

Old   September 19, 2023, 17:52
Default
  #7
Senior Member
 
Join Date: Jun 2011
Posts: 205
Rep Power: 16
CFDfan is on a distinguished road
Quote:
Originally Posted by hiep.nguyentrong View Post
http://upanh.tv/image/jPryQA

Here is the networks monitor
Hi, what's the name of the tool you've used for network traffic monitoring
CFDfan is offline   Reply With Quote

Reply


Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are Off
Pingbacks are On
Refbacks are On


Similar Threads
Thread Thread Starter Forum Replies Last Post
Stop Fluent from cycling though graphics after end of run sskp FLUENT 0 August 17, 2022 03:47
Preserve meshes in Fluent parametric run hwerner2014 FLUENT 2 February 28, 2022 16:29
Scripts to run fluent in Linux cluster via windows sanjaykummar2810 FLUENT 1 September 19, 2018 16:23
Automatic run Fluent Phuc_Nguyen FLUENT 1 October 19, 2017 07:31
Parallel run in fluent for multiphase flow apurv FLUENT 2 August 3, 2011 19:44


All times are GMT -4. The time now is 20:29.