Discussion:
[OMPI users] mpirun hangs
Mota, Thyago
2018-08-15 14:55:39 UTC
Permalink
Hello.

I have openmpi 2.0.4 installed on a Cent OS 7. When I try to run "mpirun"
it *hangs*. Below is the output I get using the debug option:

$ mpirun -d

[elm:07778] procdir: /tmp/openmpi-sessions-***@elm_0/12011/0/0

[elm:07778] jobdir: /tmp/openmpi-sessions-***@elm_0/12011/0

[elm:07778] top: openmpi-sessions-***@elm_0

[elm:07778] tmp: /tmp

[elm:07778] sess_dir_cleanup: job session dir does not exist

[elm:07778] procdir: /tmp/openmpi-sessions-***@elm_0/12011/0/0

[elm:07778] jobdir: /tmp/openmpi-sessions-***@elm_0/12011/0

[elm:07778] top: openmpi-sessions-***@elm_0

[elm:07778] tmp: /tmp

If I use "strace mpirun" I get the following lines at the end.

mprotect(0x7f173c619000, 4096, PROT_READ) = 0

munmap(0x7f173cae5000, 2114000) = 0

munmap(0x7f173c8e0000, 2114264) = 0

munmap(0x7f173c3c3000, 2466952) = 0

futex(0x7f1741f1aa00, FUTEX_LOCK_PI, 1, NULL) = -1 ESRCH (No such process)

pause(

Any idea what could be causing the problem?

Thanks.

Thyago
Jeff Squyres (jsquyres) via users
2018-08-15 15:46:57 UTC
Permalink
There can be lots of reasons that this happens. Can you send all the information listed here?

https://www.open-mpi.org/community/help/
Post by Mota, Thyago
Hello.
$ mpirun -d
[elm:07778] tmp: /tmp
[elm:07778] sess_dir_cleanup: job session dir does not exist
[elm:07778] tmp: /tmp
If I use "strace mpirun" I get the following lines at the end.
mprotect(0x7f173c619000, 4096, PROT_READ) = 0
munmap(0x7f173cae5000, 2114000) = 0
munmap(0x7f173c8e0000, 2114264) = 0
munmap(0x7f173c3c3000, 2466952) = 0
futex(0x7f1741f1aa00, FUTEX_LOCK_PI, 1, NULL) = -1 ESRCH (No such process)
pause(
Any idea what could be causing the problem?
Thanks.
Thyago
_______________________________________________
users mailing list
https://lists.open-mpi.org/mailman/listinfo/users
--
Jeff Squyres
***@cisco.com
Mota, Thyago
2018-08-15 16:34:13 UTC
Permalink
Attached is the config.log and the ompi_info

Thanks.

On Wed, Aug 15, 2018 at 11:46 AM, Jeff Squyres (jsquyres) via users <
Post by Jeff Squyres (jsquyres) via users
There can be lots of reasons that this happens. Can you send all the
information listed here?
https://www.open-mpi.org/community/help/
Post by Mota, Thyago
Hello.
I have openmpi 2.0.4 installed on a Cent OS 7. When I try to run
$ mpirun -d
[elm:07778] tmp: /tmp
[elm:07778] sess_dir_cleanup: job session dir does not exist
[elm:07778] tmp: /tmp
If I use "strace mpirun" I get the following lines at the end.
mprotect(0x7f173c619000, 4096, PROT_READ) = 0
munmap(0x7f173cae5000, 2114000) = 0
munmap(0x7f173c8e0000, 2114264) = 0
munmap(0x7f173c3c3000, 2466952) = 0
futex(0x7f1741f1aa00, FUTEX_LOCK_PI, 1, NULL) = -1 ESRCH (No such
process)
Post by Mota, Thyago
pause(
Any idea what could be causing the problem?
Thanks.
Thyago
_______________________________________________
users mailing list
https://lists.open-mpi.org/mailman/listinfo/users
--
Jeff Squyres
_______________________________________________
users mailing list
https://lists.open-mpi.org/mailman/listinfo/users
--
Mota, Thyago
***@moravian.edu
Assistant Professor of Computer Science
Office: HSB 206
Phone: (610) 861-1403
Moravian College <http://www.moravian.edu/>
Jeff Squyres (jsquyres) via users
2018-08-15 17:11:33 UTC
Permalink
Sorry; I'm still not clear on exactly what you are trying to do, or how it is failing for you. Your copy-n-pasted output in the first mail appeared to be incorrect -- there was no command on the mpirun command line, and the strace output appears to be truncated.

Are you only running on a single node? Are you able to run hostname(1), or the MPI "hello world" or "ring" applications?

Can you upgrade to a newer version of Open MPI? 2.0.x is very old -- 3.1.0 is the most recent.
Post by Mota, Thyago
Attached is the config.log and the ompi_info
Thanks.
There can be lots of reasons that this happens. Can you send all the information listed here?
https://www.open-mpi.org/community/help/
Post by Mota, Thyago
Hello.
$ mpirun -d
[elm:07778] tmp: /tmp
[elm:07778] sess_dir_cleanup: job session dir does not exist
[elm:07778] tmp: /tmp
If I use "strace mpirun" I get the following lines at the end.
mprotect(0x7f173c619000, 4096, PROT_READ) = 0
munmap(0x7f173cae5000, 2114000) = 0
munmap(0x7f173c8e0000, 2114264) = 0
munmap(0x7f173c3c3000, 2466952) = 0
futex(0x7f1741f1aa00, FUTEX_LOCK_PI, 1, NULL) = -1 ESRCH (No such process)
pause(
Any idea what could be causing the problem?
Thanks.
Thyago
_______________________________________________
users mailing list
https://lists.open-mpi.org/mailman/listinfo/users
--
Jeff Squyres
_______________________________________________
users mailing list
https://lists.open-mpi.org/mailman/listinfo/users
--
Mota, Thyago
Assistant Professor of Computer Science
Office: HSB 206
Phone: (610) 861-1403
Moravian College
<ompi_info_all.tgz><config.log.tgz>_______________________________________________
users mailing list
https://lists.open-mpi.org/mailman/listinfo/users
--
Jeff Squyres
***@cisco.com
Loading...