Discussion:
[OMPI users] funny SIGSEGV in 'ompi_info'
Paul Kapinos
2016-11-14 12:28:27 UTC
Permalink
Dear developers,
also the following issue is defintely raised by a misconfiguration of Open MPI,
SIGSEGV's in 'ompi_info' isn'n a good thing, thus this one mail.

Just call:
$ export OMPI_MCA_mtl="^tcp,^ib"
$ ompi_info --param all all --level 9
... and take a look at the below core dump of 'ompi_info' like below one.

(yes we know that "^tcp,^ib" is a bad idea).

Have a nice day,

Paul Kapinos

P.S. Open MPI: 1.10.4 and 2.0.1 have the same behaviour

--------------------------------------------------------------------------
[lnm001:39957] *** Process received signal ***
[lnm001:39957] Signal: Segmentation fault (11)
[lnm001:39957] Signal code: Address not mapped (1)
[lnm001:39957] Failing at address: (nil)
[lnm001:39957] [ 0] /lib64/libpthread.so.0(+0xf100)[0x2b30f1a79100]
[lnm001:39957] [ 1]
/opt/MPI/openmpi-1.10.4/linux/intel_16.0.2.181/lib/libopen-pal.so.13(+0x2f11f)[0x2b30f084911f]
[lnm001:39957] [ 2]
/opt/MPI/openmpi-1.10.4/linux/intel_16.0.2.181/lib/libopen-pal.so.13(+0x2f265)[0x2b30f0849265]
[lnm001:39957] [ 3]
/opt/MPI/openmpi-1.10.4/linux/intel_16.0.2.181/lib/libopen-pal.so.13(opal_info_show_mca_params+0x91)[0x2b30f0849031]
[lnm001:39957] [ 4]
/opt/MPI/openmpi-1.10.4/linux/intel_16.0.2.181/lib/libopen-pal.so.13(opal_info_do_params+0x1f4)[0x2b30f0848e84]
[lnm001:39957] [ 5] ompi_info[0x402643]
[lnm001:39957] [ 6] /lib64/libc.so.6(__libc_start_main+0xf5)[0x2b30f1ca7b15]
[lnm001:39957] [ 7] ompi_info[0x4022a9]
[lnm001:39957] *** End of error message ***
zsh: segmentation fault (core dumped) ompi_info --param all all --level 9
--------------------------------------------------------------------------
--
Dipl.-Inform. Paul Kapinos - High Performance Computing,
RWTH Aachen University, IT Center
Seffenter Weg 23, D 52074 Aachen (Germany)
Tel: +49 241/80-24915
Loading...