|
[Sponsors] |
Unable to run OpenFOAM 1.6-ext in parallel with more than one machine |
|
LinkBack | Thread Tools | Search this Thread | Display Modes |
January 24, 2014, 18:35 |
Unable to run OpenFOAM 1.6-ext in parallel with more than one machine
|
#1 | |
Senior Member
mahdi abdollahzadeh
Join Date: Mar 2011
Location: Covilha,Portugal
Posts: 153
Rep Power: 15 |
Dear all
I'm facing a problem running in parallel in multi node. I can run in one node without any problem. I am receiving this error all the time: Quote:
I should say that I didn't receive no error installing openfoam1.6 ext. http://sourceforge.net/p/openfoam-ex...ci/1.6.1/tree/ other users are using openfoam 2.x and they don't have my problem. will be too much thankful to have your helps mehdi Last edited by mm.abdollahzadeh; January 24, 2014 at 19:44. |
||
January 24, 2014, 20:05 |
|
#2 |
Retired Super Moderator
Bruno Santos
Join Date: Mar 2009
Location: Lisbon, Portugal
Posts: 10,982
Blog Entries: 45
Rep Power: 128 |
Greetings mm.abdollahzadeh,
Not much information to work with... therefore I'll have to ask a few questions:
Bruno
__________________
|
|
January 24, 2014, 20:25 |
|
#3 | |
Senior Member
mahdi abdollahzadeh
Join Date: Mar 2011
Location: Covilha,Portugal
Posts: 153
Rep Power: 15 |
Many thanks Burono
I have installed the openfoam in my folder, which is sharded on all nodes. we are using ROCKs. and below is the commands that I use Quote:
and the instruction from http://sourceforge.net/p/openfoam-ex..._5.5_64bit.txt best Mehdi |
||
January 24, 2014, 20:42 |
|
#4 |
Retired Super Moderator
Bruno Santos
Join Date: Mar 2009
Location: Lisbon, Portugal
Posts: 10,982
Blog Entries: 45
Rep Power: 128 |
Hi Mehdi,
If my diagnosis is correct, the problem is that the cluster does not accept using the customized Open-MPI installation that was done by default by the 1.6-ext installation scripts/instructions. You must use the MPI that the cluster already has got. Go into the where 1.6-ext is installed and edit the file "etc/prefs.sh", search for these lines: Code:
#export WM_MPLIB=SYSTEMOPENMPI #export OPENMPI_DIR=path_to_system_installed_openmpi #export OPENMPI_BIN_DIR=$OPENMPI_DIR/bin But now there is a difficult problem on this case... figuring out what is the folder for the Open-MPI installation the cluster is using, so that you can define it in the variable "OPENMPI_DIR", namely in the second line. I'm too tired right now to estimate what path it might be in, so I suggest that you ask someone you know that works with that cluster. Once you know the path to the MPI toolbox, replace "path_to_system_installed_openmpi" for the path, e.g. "/opt/openmpi-1.6.2", save and close the file. Then start a new terminal and run Allwmake again in the main 1.6-ext folder. Once it's finished, and hopefully it does it with success, try running in parallel once again. Best regards, Bruno
__________________
|
|
January 25, 2014, 20:07 |
|
#5 |
Senior Member
mahdi abdollahzadeh
Join Date: Mar 2011
Location: Covilha,Portugal
Posts: 153
Rep Power: 15 |
Dear Burno
Its some how solved but not completely. Now my cases are start to run. however there are still problems. even if for example I choose 30 processors ( no matter if work with orte or mpi) it starts running the case in one machine ( which just has 12 processor) !! I mention that other users are still running with out problem. best mahdi |
|
January 26, 2014, 07:09 |
|
#6 |
Retired Super Moderator
Bruno Santos
Join Date: Mar 2009
Location: Lisbon, Portugal
Posts: 10,982
Blog Entries: 45
Rep Power: 128 |
Hi Mahdi,
I can't see what you're seeing, unless you share it somehow (text or pictures). And are you certain that everything is properly compiled? What do these commands give you: Code:
which mpirun which mpicc echo $FOAM_MPI_LIBBIN ls -l $FOAM_MPI_LIBBIN Bruno
__________________
|
|
January 26, 2014, 09:10 |
|
#7 | |||||
Senior Member
mahdi abdollahzadeh
Join Date: Mar 2011
Location: Covilha,Portugal
Posts: 153
Rep Power: 15 |
Dear Bruno
May thanks for your replay. here is the out put of the commands: which mpirun Quote:
Quote:
Quote:
Quote:
Untitled.jpg it starts to run. I can see in master that Quote:
Untitledw.jpg best Regards mahdi |
||||||
January 26, 2014, 09:25 |
|
#8 |
Retired Super Moderator
Bruno Santos
Join Date: Mar 2009
Location: Lisbon, Portugal
Posts: 10,982
Blog Entries: 45
Rep Power: 128 |
Hi Mahdi,
Do you have another version of OpenFOAM, such as 2.2.2 or any other? And are you able to use it yourself on more than one machine? In addition, are you certain you are using the job script properly? Some job scripts need to specify the machines where to run. Best regards, Bruno
__________________
|
|
January 26, 2014, 09:35 |
|
#9 |
Senior Member
mahdi abdollahzadeh
Join Date: Mar 2011
Location: Covilha,Portugal
Posts: 153
Rep Power: 15 |
Dear Bruno
unfortunately, I don't have. but other users have openfoam 2.1.0 without problem. I think so that is correct. I have doubt that maybe the ARGS in my script are not consistent with openfoam extended?! best mahdi |
|
January 26, 2014, 10:18 |
|
#10 |
Retired Super Moderator
Bruno Santos
Join Date: Mar 2009
Location: Lisbon, Portugal
Posts: 10,982
Blog Entries: 45
Rep Power: 128 |
Hi Mahdi,
You need to ask the other fellow users, since I'm not familiar with the job scheduler you are using. The ARGS entry currently is only taking care of disabling (or enabling?) one connection port and enabling another. It's not taking care of the machines to be used. Try adding the following to the ARGS variable: Code:
-host compute-0-3-3,compute-0-7-7,compute-0-9-9 Code:
ARGS="--mca btl ^openib --mca btl_tcp_if_include eth0 -host compute-0-3-3,compute-0-7-7,compute-0-9-9" Bruno
__________________
|
|
January 26, 2014, 10:51 |
|
#11 | |
Senior Member
mahdi abdollahzadeh
Join Date: Mar 2011
Location: Covilha,Portugal
Posts: 153
Rep Power: 15 |
Dear Bruno
unfortunately it didnt work yet give me : Quote:
Best Regards Mahdi |
||
January 26, 2014, 12:07 |
|
#12 | |
Retired Super Moderator
Bruno Santos
Join Date: Mar 2009
Location: Lisbon, Portugal
Posts: 10,982
Blog Entries: 45
Rep Power: 128 |
Hi Mahdi,
Did you also try the one I had written? Code:
-host compute-0-3-3,compute-0-7-7,compute-0-9-9 Quote:
Code:
echo $FOAM_LIBBIN/$FOAM_MPI Best regards, Bruno
__________________
|
||
January 26, 2014, 12:43 |
|
#13 | ||
Senior Member
mahdi abdollahzadeh
Join Date: Mar 2011
Location: Covilha,Portugal
Posts: 153
Rep Power: 15 |
Dear Bruno
certainly I have tested the command that you suggest. but it didnt work. and I got: Quote:
Quote:
Mahdi |
|||
January 26, 2014, 14:44 |
|
#14 |
Retired Super Moderator
Bruno Santos
Join Date: Mar 2009
Location: Lisbon, Portugal
Posts: 10,982
Blog Entries: 45
Rep Power: 128 |
Hi Mahdi,
Sorry, then I have absolutely no idea. You need to talk to the system's administrator to assess what the problem is. In theory you should now be using the correct MPI toolbox, therefore the problem should be somewhere in the job script. Best regards, Bruno
__________________
|
|
January 27, 2014, 10:40 |
|
#15 | |
Senior Member
mahdi abdollahzadeh
Join Date: Mar 2011
Location: Covilha,Portugal
Posts: 153
Rep Power: 15 |
Thank you Bruno
I add this lines to my /etc/bashrc Quote:
many thanks Bruno best regards mahdi |
||
|
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
[OpenFOAM] Questions about Paraview to show Parallel run of OpenFOAM | padian | ParaView | 20 | September 24, 2018 13:52 |
First Parallel Run - need some help | Gian Maria | OpenFOAM | 3 | June 17, 2011 13:08 |
Cross-compiling OpenFOAM 1.7.0 on Linux for Windows 32 and 64bits with Mingw-w64 | wyldckat | OpenFOAM Announcements from Other Sources | 3 | September 8, 2010 07:25 |
Unable to run OF in parallel on a multiple-node cluster | quartzian | OpenFOAM | 3 | November 24, 2009 14:37 |
OpenFOAM 1.6 virtual machine | mahaputra | OpenFOAM Installation | 4 | October 30, 2009 04:01 |