|
[Sponsors] |
October 15, 2010, 02:32 |
MPI problem with fluent
|
#1 |
New Member
aryanet
Join Date: Oct 2010
Posts: 5
Rep Power: 16 |
Hi there,
I am new to run fluent in linux centOS. I have installed fluent 6.3 on three machines. but when I run the command below: /data/Fluent.Inc/bin/fluent -g 3d -cnf=/root/host -t4 It ends up with the following output: Code:
[root@MDS1 ~]# /data/Fluent.Inc/bin/fluent -g 3d -cnf=/root/host -t4 /data/Fluent.Inc/fluent6.3.26/bin/fluent -r6.3.26 -g 3d -cnf=/root/host -t4 /data/Fluent.Inc/fluent6.3.26/cortex/lnamd64/cortex.3.7.3 -f fluent -g (fluent "3d -pethernet -host -r6.3.26 -t4 -mpi=hp -cnf=/root/host -path/data/Fluent.Inc") Loading "/data/Fluent.Inc/fluent6.3.26/lib/fluent.dmp.114-64" Done. /data/Fluent.Inc/fluent6.3.26/bin/fluent -r6.3.26 3d -pethernet -host -t4 -mpi=hp -cnf=/root/host -path/data/Fluent.Inc -cx MDS1:47715:35420 Starting /data/Fluent.Inc/fluent6.3.26/lnamd64/3d_host/fluent.6.3.26 host -cx MDS1:47715:35420 "(list (rpsetvar (QUOTE parallel/function) "fluent 3d -node -r6.3.26 -t4 -pethernet -mpi=hp -cnf=/root/host ") (rpsetvar (QUOTE parallel/rhost) "") (rpsetvar (QUOTE parallel/ruser) "") (rpsetvar (QUOTE parallel/nprocs_string) "4") (rpsetvar (QUOTE parallel/auto-spawn?) #t) (rpsetvar (QUOTE parallel/trace-level) 0) (rpsetvar (QUOTE parallel/remote-shell) 0) (rpsetvar (QUOTE parallel/path) "/data/Fluent.Inc") (rpsetvar (QUOTE parallel/hostsfile) "/root/host") )" Welcome to Fluent 6.3.26 Copyright 2006 Fluent Inc. All Rights Reserved Loading "/data/Fluent.Inc/fluent6.3.26/lib/flprim.dmp.1119-64" Done. Host spawning Node 0 on machine "MDS1" (unix). /data/Fluent.Inc/fluent6.3.26/bin/fluent -r6.3.26 3d -node -t4 -pethernet -mpi=hp -cnf=/root/host -mport 127.0.0.1:127.0.0.1:45271:0 Starting /data/Fluent.Inc/fluent6.3.26/multiport/mpi/lnamd64/hp/bin/mpirun -TCP -f /tmp/fluent-appfile.16803 mpirun: No route to host mpirun: Bad file descriptor |
|
October 22, 2010, 17:02 |
|
#2 |
Member
Basharat
Join Date: Feb 2010
Posts: 37
Rep Power: 16 |
the error is not with your mpi but with the parallel connectivity. check your ssh or rsh then run it again and make sure to stop the firewall. tell me then if you get any error again.
__________________
Rgds Martin |
|
October 23, 2010, 17:39 |
|
#3 | |
New Member
aryanet
Join Date: Oct 2010
Posts: 5
Rep Power: 16 |
Quote:
Code:
[root@MDS1 bin]# /data/Fluent.Inc/bin/fluent -g 3d -cnf=/root/host -t2 /data/Fluent.Inc/fluent6.3.26/bin/fluent -r6.3.26 -g 3d -cnf=/root/host -t2 /data/Fluent.Inc/fluent6.3.26/cortex/lnamd64/cortex.3.7.3 -f fluent -g (fluent "3d -pethernet -host -r6.3.26 -t2 -mpi=hp -cnf=/root/host -path/data/Fluent.Inc") Loading "/data/Fluent.Inc/fluent6.3.26/lib/fluent.dmp.114-64" Done. /data/Fluent.Inc/fluent6.3.26/bin/fluent -r6.3.26 3d -pethernet -host -t2 -mpi=hp -cnf=/root/host -path/data/Fluent.Inc -cx MDS1:44145:40097 Starting /data/Fluent.Inc/fluent6.3.26/lnamd64/3d_host/fluent.6.3.26 host -cx MDS1:44145:40097 "(list (rpsetvar (QUOTE parallel/function) "fluent 3d -node -r6.3.26 -t2 -pethernet -mpi=hp -cnf=/root/host ") (rpsetvar (QUOTE parallel/rhost) "") (rpsetvar (QUOTE parallel/ruser) "") (rpsetvar (QUOTE parallel/nprocs_string) "2") (rpsetvar (QUOTE parallel/auto-spawn?) #t) (rpsetvar (QUOTE parallel/trace-level) 0) (rpsetvar (QUOTE parallel/remote-shell) 0) (rpsetvar (QUOTE parallel/path) "/data/Fluent.Inc") (rpsetvar (QUOTE parallel/hostsfile) "/root/host") )" Welcome to Fluent 6.3.26 Copyright 2006 Fluent Inc. All Rights Reserved Loading "/data/Fluent.Inc/fluent6.3.26/lib/flprim.dmp.1119-64" Done. Host spawning Node 0 on machine "MDS1" (unix). /data/Fluent.Inc/fluent6.3.26/bin/fluent -r6.3.26 3d -node -t2 -pethernet -mpi=hp -cnf=/root/host -mport 127.0.0.1:127.0.0.1:37906:0 Starting /data/Fluent.Inc/fluent6.3.26/multiport/mpi/lnamd64/hp/bin/mpirun -TCP -f /tmp/fluent-appfile.16143 HP-MPI licensed for execution of Fluent. 0: mpt_connect: error: connect failed: Connection refused 0: mpt_establish_connection: error: unable to connect: Illegal seek 0: mpt_connect: error: connect failed: Connection refused 0: mpt_establish_connection: error: unable to connect: Illegal seek 0: mpt_connect: error: connect failed: Connection refused 0: mpt_establish_connection: error: unable to connect: Illegal seek 0: mpt_connect_to_server: error: cannot establish connection; bye.: Illegal seek MPI Application rank 0 exited before MPI_Finalize() with status 0 Would you help plz? |
||
October 24, 2010, 02:45 |
|
#4 |
Member
Basharat
Join Date: Feb 2010
Posts: 37
Rep Power: 16 |
ok ..
you must check your proper connectivity there not dynamics IP but static. then if you know about ssh configuration then do it for the parallel computing. I suggest you to use the -ssh in your command to run fluent. and also do the permissive of SElinux also. sometimes it also stops the suspicious connectivity. anyways do the following and let me know. cheers
__________________
Rgds Martin |
|
October 24, 2010, 02:57 |
|
#5 | |
New Member
aryanet
Join Date: Oct 2010
Posts: 5
Rep Power: 16 |
Quote:
I ran the fluent with -ssh switch, but nothing happened new. SElinux is completely disabled also. I've really get stuck... |
||
October 24, 2010, 06:40 |
|
#6 |
Member
Basharat
Join Date: Feb 2010
Posts: 37
Rep Power: 16 |
ok then check whether your fluent is installed proper i mean there sometimes mpi folder doesnt exist.
I didnt experience that kind of error before. there must be some human error.
__________________
Rgds Martin |
|
October 24, 2010, 15:37 |
|
#7 |
New Member
Grzegorz Kondora
Join Date: Oct 2010
Posts: 13
Rep Power: 16 |
Check if MPI is correctly installed. Run FLUENT with -ssh option. Before that do: cd ~; cd .ssh;
ssh-keygen -dsa; {blank passphare - ENTER, ENTER}; cat id_dsa.pub > authorized_keys2; ssh 127.0.0.1; {confirm with yes}; ssh 127.0.1.1; {confirm with yes}; check if you can ssh to 127.0.0.1 and 127.0.1.1 without typing a password. Generally: google -> "ssh without password". Try: fluent 2d -ssh -mpi=intel when MPI is not working. Hope I helped. |
|
October 24, 2010, 15:45 |
|
#8 |
New Member
Grzegorz Kondora
Join Date: Oct 2010
Posts: 13
Rep Power: 16 |
Sorry, i didn't read that you are using different machines, so instead of doing ssh without password to localhost, try this: http://linuxproblem.org/art_9.html
|
|
October 24, 2010, 16:13 |
|
#9 |
New Member
aryanet
Join Date: Oct 2010
Posts: 5
Rep Power: 16 |
Well, I'm sure ssh has configured properly.
But, thanx from your helps. |
|
December 23, 2015, 13:35 |
|
#10 |
New Member
mohammad
Join Date: Mar 2014
Posts: 16
Rep Power: 12 |
Hi
since i am new in Linux,Ubuntu i have some sort of same problem I try to open Fluent on my device, but i encounter with this error: 999999: mpt_get_dot_address: warning : UNI - SERVER _ > 127.0.0.1 check your system network configuration! 999999: mpt_get_dot_address: warning : UNI - SERVER _ > 127.0.0.1 check your system network configuration! starting / user/ansys_inc/v150/fluent/fluent15.0.0/(.....), mpirun: rsh: command not found if anyone can help me to get through this problem i would really appreciate it thanks in advance |
|
February 6, 2016, 10:53 |
|
#11 | |
Member
vlg
Join Date: Jul 2011
Location: My home :)
Posts: 81
Rep Power: 18 |
Your problem is possibly solved by adding
Code:
-ssh Code:
fluent -ssh .... Code:
which rsh Using rsh on Ubuntu: on each computing node: Code:
sudo apt-get install rsh-server sudo apt-get install rsh-client Code:
sudo apt-get install rsh-client Quote:
Last edited by villager; February 6, 2016 at 11:10. Reason: added info about rsh |
||
February 6, 2016, 11:05 |
|
#12 | |
Member
vlg
Join Date: Jul 2011
Location: My home :)
Posts: 81
Rep Power: 18 |
1) The first thing to try is to run without cnf option.
FLUENT would not make him wait to spawn so much process on your current machine, that you specify with -t option: Code:
/data/Fluent.Inc/bin/fluent -g 3d -t2 E.g., your machines are machine1 and machine2. Code:
ssh machine1 Code:
ssh machine2 Run via ssh with explicit node list (for example, we would require two processes on each machine): Code:
/data/Fluent.Inc/bin/fluent -g 3d -t2 -ssh -cnf=machine1:2,machine2:2 You could change ssh to rsh everywhere, though. I didn't use it, but I think the workaround is almost the same. Cheers, John. Quote:
|
||
February 7, 2016, 07:29 |
|
#13 | |
New Member
mohammad
Join Date: Mar 2014
Posts: 16
Rep Power: 12 |
Thank you Villager, my problem solved
Quote:
|
||
June 21, 2016, 11:46 |
|
#14 |
New Member
Join Date: Aug 2015
Posts: 6
Rep Power: 11 |
hello, i'm now having the same problem with u, how did u solve your problem? Would u plz help me? thank u very much!
|
|
June 21, 2016, 11:47 |
|
#15 |
New Member
Join Date: Aug 2015
Posts: 6
Rep Power: 11 |
||
June 21, 2016, 23:02 |
|
#16 |
New Member
Join Date: Aug 2015
Posts: 6
Rep Power: 11 |
hi,i have the same problem with parallel connective, so i checked the ssh and rsh and stopped the firewall, but the problem are still exist. would you please help me to solve this problem?
|
|
January 5, 2017, 07:33 |
check your hosts file
|
#17 | |
New Member
lv
Join Date: Jan 2017
Posts: 2
Rep Power: 0 |
the real public IP address + your hostname must be in the hosts file, which is in the folder /etc
Quote:
|
||
October 30, 2017, 22:04 |
|
#18 |
New Member
lv
Join Date: Jan 2017
Posts: 2
Rep Power: 0 |
in ubuntu16.04, 127.0.1.1 + hostname is the default form in the hosts file
|
|
|
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
Problem running Fluent on amd64 | Manfred | FLUENT | 10 | March 23, 2013 07:47 |
Problem running fluent with InfiniBand | blackpuma | FLUENT | 10 | August 28, 2011 02:16 |
Fluent boundary conditions problem | bobo | FLUENT | 2 | July 3, 2009 07:28 |
Problem in running fluent 6.3 (64 bit) on ubuntu 8.1 (64 bit) | Mir5 | FLUENT | 3 | April 29, 2009 11:32 |
Problem using parallel Fluent | Gustavo | FLUENT | 0 | June 28, 2004 00:12 |