CFD Online Logo CFD Online URL
www.cfd-online.com
[Sponsors]
Home > Forums > Software User Forums > SU2

SU2 not running with 8 and more nodes.

Register Blogs Community New Posts Updated Threads Search

Reply
 
LinkBack Thread Tools Search this Thread Display Modes
Old   December 4, 2023, 05:35
Default SU2 not running with 8 and more nodes.
  #1
New Member
 
Pravin
Join Date: Dec 2023
Posts: 2
Rep Power: 0
pravin is on a distinguished road
Hello everyone,

I am encountering an issue with SU2 during runtime. I have successfully compiled SU2 using Intel One API 2023 on operating system version 9.2. However, I face a problem running SU2 on 8 and 16 nodes. The specific issue is that SU2 fails to generate any output; it becomes unresponsive. I have attempted to run it with Intel OneAPI versions 2021, 2022, and 2023, but the problem persists. Notably, SU2 functions properly when using OpenMPI with the gnu compiler.
SU2 runs successfully on a single node and it generates outputs. SU2 works fine up to 7 nodes but it hangs and does not generate output on 8 and 16 nodes.

Please help me to resolve this issue.

Thank you,
Pravin

Last edited by pravin; December 5, 2023 at 02:40.
pravin is offline   Reply With Quote

Old   December 4, 2023, 08:30
Default
  #2
Senior Member
 
bigfoot
Join Date: Dec 2011
Location: Netherlands
Posts: 676
Rep Power: 21
bigfootedrockmidget is on a distinguished road
So the problem only occurs when compiling with this specific compiler? You say that it works when using openmpi, is that then with the gnu compiler?
bigfootedrockmidget is offline   Reply With Quote

Old   December 4, 2023, 09:22
Default
  #3
New Member
 
Pravin
Join Date: Dec 2023
Posts: 2
Rep Power: 0
pravin is on a distinguished road
Yes, using GNU compiler
pravin is offline   Reply With Quote

Old   December 19, 2023, 06:52
Exclamation Request for help
  #4
New Member
 
Join Date: Dec 2023
Posts: 2
Rep Power: 0
shunido is on a distinguished road
Hello All

I am having same issue here, on an HPC env , slurm is used to submit jobs , had no issues with other jobs , but for running SU2 MPI job , its hanging for more than 7 nodes ..

does anyone have somilar experience ?
shunido is offline   Reply With Quote

Old   December 19, 2023, 08:19
Default
  #5
Senior Member
 
bigfoot
Join Date: Dec 2011
Location: Netherlands
Posts: 676
Rep Power: 21
bigfootedrockmidget is on a distinguished road
When you are saying nodes, do you actually mean cores? HPC usually have multiple cores per node, so an issue could arise when the cores used in the job are distributed over multiple nodes and the communication between nodes is faulty. Maybe you can check by forcing your scheduler to run on 2cores/1node and 2cores/2nodes.


Since this does not seem to be a common issue (I do not have any issues running on multiple cores or nodes), it might be due to your specific compiler and mpi version, so I would just try some different compilers and mpi versions to see if it is due to some specific version.
bigfootedrockmidget is offline   Reply With Quote

Old   December 19, 2023, 08:25
Default
  #6
New Member
 
Join Date: Dec 2023
Posts: 2
Rep Power: 0
shunido is on a distinguished road
Yes , by nodes I means physical nodes , scheduler is configured to use max cores , and also on the job batch defined to use 64 cores from each node as it is ,

And yes it is about compiler , because it is working fine with OpenMPI4 , but we want to use intelOneApi and it is not producing outputs
shunido is offline   Reply With Quote

Reply


Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are Off
Pingbacks are On
Refbacks are On



All times are GMT -4. The time now is 14:28.