Discussion:
[OMPI users] segfault (shared memory initialization) after program ended
Oliver
2018-09-26 00:39:59 UTC
Permalink
hi -

I have an application that consistently segfault when I do
"mpirun --oversubscribe" and the following message came AFTER application
runs. My running environment: MacOS with openmpi 3.1.2.

Is this a problme with my application? or my environment? any help?

thanks

Oliver

--------------------------------------------------------------------------
A system call failed during shared memory initialization that should
not have. It is likely that your MPI job will now either abort or
experience performance degradation.

Local host: pi.local
System call: unlink(2)
/var/folders/h2/ph7pgd4n3_z9v2pd0hk5nc6w0000gn/T//ompi.pi.501/pid.45364/1/vader_segment.pi.c1c00001.7
Error: No such file or directory (errno 2)
--------------------------------------------------------------------------
mpirun(45364,0x70000e1c9000) malloc: *** mach_vm_map(size=1125899906846720)
failed (error code=3)
*** error: can't allocate region
*** set a breakpoint in malloc_error_break to debug
[pi:45364] *** Process received signal ***
[pi:45364] Signal: Segmentation fault: 11 (11)
[pi:45364] Signal code: Address not mapped (1)
[pi:45364] Failing at address: 0x0
[pi:45364] [ 0] 0 libsystem_platform.dylib 0x00007fff7d999f5a
_sigtramp + 26
[pi:45364] [ 1] 0 ??? 0x000000002d595060
0x0 + 760828000
[pi:45364] [ 2] 0 mca_rml_oob.so 0x0000000103aeadaf
orte_rml_oob_send_buffer_nb + 956
[pi:45364] [ 3] 0 libopen-rte.40.dylib 0x000000010357d0fa
pmix_server_log_fn + 449
[pi:45364] [ 4] 0 mca_pmix_pmix2x.so 0x000000010394f6d6
server_log + 857
[pi:45364] [ 5] 0 mca_pmix_pmix2x.so 0x0000000103982d42
pmix_server_log + 1257
[pi:45364] [ 6] 0 mca_pmix_pmix2x.so 0x00000001039731e0
server_message_handler + 5032
[pi:45364] [ 7] 0 mca_pmix_pmix2x.so 0x00000001039a9822
pmix_ptl_base_process_msg + 723
[pi:45364] [ 8] 0 libevent-2.1.6.dylib 0x00000001036b6719
event_process_active_single_queue + 376
[pi:45364] [ 9] 0 libevent-2.1.6.dylib 0x00000001036b3cb3
event_base_loop + 1074
[pi:45364] [10] 0 mca_pmix_pmix2x.so 0x0000000103988ce7
progress_engine + 26
[pi:45364] [11] 0 libsystem_pthread.dylib 0x00007fff7d9a3661
_pthread_body + 340
[pi:45364] [12] 0 libsystem_pthread.dylib 0x00007fff7d9a350d
_pthread_body + 0
[pi:45364] [13] 0 libsystem_pthread.dylib 0x00007fff7d9a2bf9
thread_start + 13
[pi:45364] *** End of error message ***
Segmentation fault: 11
--
Oliver
Loading...