|
[Sponsors] |
March 3, 2008, 17:48 |
I finally able to run my case
|
#21 |
Senior Member
Nishant
Join Date: Mar 2009
Location: Glasgow, UK
Posts: 166
Rep Power: 17 |
I finally able to run my case in parallel. There was some problem in the gcc installation. Now its working fine,.
Thank you .. Nishant
__________________
Thanks and regards, Nishant |
|
March 12, 2008, 11:17 |
I thought my parallel case is
|
#22 |
Senior Member
Nishant
Join Date: Mar 2009
Location: Glasgow, UK
Posts: 166
Rep Power: 17 |
I thought my parallel case is running, but actually it was not. However I can see the program running on queue. Error file is saying that..
ERROR: A daemon on node comp26 failed to start as expected. As i mentioned earlier my mpirun -hostfile machine <rooot> <case> -parallel command is running quite well on cluster directly. I mean to say that it's working fine if we run on master node (for us its kittyhawk.dcs.hull.ac.uk) but it fails on any other node ( like comp01/02/10/11 etc) I tried a hello mpi programm as well but that also failed to run using qsub and running quite well directly on master kittyhawk. my gcc compiler is unable to compile a program on any other node except master node kittyhawk. however they are using the right gcc (that is openfoam version of gcc) Again, I am using cluster's version of MPICH as PE. (#$ mpich -pe 4), which is installed at /usr/.....** The default PE environment here is >>score<< which we run using mpisub command. Do I need to use a local version of mpich in order to run in parallel using qsub? Or could it be possible to run openfoam program using score? can anybody suggst something? regards, Nishant
__________________
Thanks and regards, Nishant |
|
April 17, 2009, 12:21 |
Running OpenFOAM in parallel with SGE
|
#23 |
New Member
|
Hi,
to run openFOAM in parallel with SGE, you need to make sure that the following requisites are satisfied: 1) use an openmpi version >= 1.2.0 The reason is that any version prior to that isn't working with SGE. 2) Make sure that you define a parallel environment, for instance "orte", with the following definition (that's here for 8 parallel slots = 8 cores in parallel): pe_name orte slots 8 user_lists NONE xuser_lists NONE start_proc_args /bin/true stop_proc_args /bin/true allocation_rule $round_robin control_slaves TRUE job_is_first_task FALSE urgency_slots min 3) Submit your job with (for example with a run on 8 cores): qsub RUN.sh where RUN.sh contains: #!/bin/sh #$ -V ### number of processors and parallel environment #$ -pe orte 8 ### Job name #$ -N "mypartest" ### Start from current working directory #$ -cwd ### Generate the hostfile HOSTFILE=system/hostfile awk '{print $1" cpu=1"}' ${PE_HOSTFILE} > ${PWD}/${HOSTFILE} ### Run application SOLVER=icoFoam ${MPI_ARCH_PATH}/bin/mpirun -np ${NSLOTS} --hostfile ${PWD}/${HOSTFILE} ${FOAM_APPBIN}/${SOLVER} -parallel exit $? You will also find further information at: http://www.open-mpi.org/faq/?categor...run-scheduling Alternatively, you can try to compile MPICH from source. I've been able to run v1.2.7p1 without any dramas. This is quite straightforward if you take a look at the Allwmake scripts in $WM_THIRD_PARTY. Hope this helps... |
|
January 18, 2010, 09:10 |
Problem with openFoam and SGE
|
#24 |
New Member
Cárdenas
Join Date: Sep 2009
Posts: 5
Rep Power: 17 |
Hello to all,
I'm trying to run openFoam on a SGE Sun Cluster. When running the job on parallel in a single node, everything works. But when trying to run the same job on different nodes, I get the following error message: epsilon2.o31752: PHP Code:
------------------------------------------------------------------------------ #!/bin/tcsh # This is a simple example of a SGE batch script #$-o /nfs/home/cardenas/Documents/OpenFOAM/Cases/Platte/Laenge120mm/Pulsierend/eps ilon2 -j y #$-N epsilon2 #$-pe batch_64_2 2 #$-S /bin/tcsh touch $HOME/.ssh/known_hosts cd /nfs/home/cardenas/Documents/OpenFOAM/Cases/Platte/Laenge120mm/Pulsierend/epsilon2 touch -a ./*.* touch -a ./system/* source /nfs/home/cardenas/OpenFOAM/OpenFOAM-1.6.x/etc/cshrc cat $PE_HOSTFILE |awk '{ print $1 " cpu=" $2}' > $HOME/mpi/machines.LINUX.$JOB_ID sleep 10; mpirun --hostfile $HOME/mpi/machines.LINUX.$JOB_ID -np 2 icoFoam -parallel >log ----------------------------------------------------------------------------------------------- It seems that something with the Host Keys is not working properly, but since I'm not expirienced in SGE, I would appreciete any suggestions and hints. Thank you very much Alejandro |
|
January 19, 2010, 04:10 |
|
#25 | ||
Senior Member
Mark Olesen
Join Date: Mar 2009
Location: https://olesenm.github.io/
Posts: 1,715
Rep Power: 40 |
Quote:
The very first thing it to determine if GridEngine support has been compiled into your openmpi. Use the command "ompi_info" to list all the backends and grep for gridengine. If it's not there, you should recompile openmpi using the --with-sge configure option (see the third-party Allwmake). ^^^ what is this? Touching a file into existence doesn't make the hosts known! Quote:
If the final backend uses rsh, ssh, or the GridEngine builtin transport will depend on what you have configured as the 'rsh_command' and 'rsh_daemon' in GridEngine. BTW: your example is using cshell. Be certain that the queue is configured with the corresponding shell_start_mode. Be default this will be 'posix_compliant' (ie, use /bin/sh) and not 'unix_behavior' (ie, use #! to determine the shell/program). |
|||
October 27, 2011, 11:03 |
Pending but not running
|
#26 |
New Member
Join Date: Aug 2010
Posts: 7
Rep Power: 16 |
Hi,
I also tried to run OpenFoam in parallel with SGE. I use the following script to submit the job: Code:
#!/bin/csh #$ -V ###set queue #$ -q normal ### number of processors and parallel environment #$ -pe OpenFOAM 4 #$ -S /bin/csh ### Job name #$ -N "mypartest" ### Start from current working directory #$ -cwd source ./soft/OpenFOAM/OpenFOAM-2.0.0/etc/cshrc ### Run application mpirun -np ${NSLOTS} pisoFoam -parallel Code:
xhost: Command not found. : Command not found. : Command not found. : Command not found. : Command not found. /soft/OpenFOAM/OpenFOAM-2.0.0/etc/cshrc : No such file or directory. I don´t know why the grid engine can´t find the command. Does anybody have an idea why it doesn’t work? Or are there any settings I have to modify? I´m thankful for any help Stefan Last edited by schteff; November 21, 2011 at 05:26. |
|
May 4, 2012, 12:56 |
|
#27 | |
New Member
Ricardo Reis
Join Date: May 2012
Posts: 2
Rep Power: 0 |
Quote:
Code:
HOSTFILE=system/hostfile awk '{print $1" cpu="$2}' ${PE_HOSTFILE} > ${PWD}/${HOSTFILE} |
||
May 4, 2012, 12:57 |
|
#28 | |
New Member
Ricardo Reis
Join Date: May 2012
Posts: 2
Rep Power: 0 |
Quote:
/soft/OpenFOAM/OpenFOAM-2.0.0/etc/cshrc without the initial . ? |
||
August 22, 2012, 10:27 |
solution to "Host key verification failed"
|
#29 |
New Member
Timo Kulju
Join Date: Aug 2009
Posts: 22
Rep Power: 17 |
Hi!
If somebody is having problems with host-keys, adding a line Code:
StrictHostKeyChecking no Code:
/etc/ssh/ssh_config Code:
export OMPI_MCA_orte_rsh_agent=ssh |
|
|
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
CFX integration with Sun Grid Engine | mausmi | CFX | 2 | February 4, 2016 17:30 |
Grid Engine OpenFOAM15dev and OpenMPI124 | tian | OpenFOAM Installation | 11 | February 26, 2009 11:43 |
Running parallel job using qsub on sun grid engine | nishant_hull | OpenFOAM Running, Solving & CFD | 5 | February 7, 2008 15:52 |
IC engine | Araz Banaeizadeh | Main CFD Forum | 0 | June 28, 2006 23:56 |
CFX and Sun Grid Engine | David Hargreaves | CFX | 1 | August 26, 2005 00:50 |