Eric Chamberland
2017-04-25 18:39:39 UTC
Hi,
just testing the 3.x branch... I launch:
mpirun -n 8 echo "hello"
and I get:
--------------------------------------------------------------------------
There are not enough slots available in the system to satisfy the 8 slots
that were requested by the application:
echo
Either request fewer slots for your application, or make more slots
available
for use.
--------------------------------------------------------------------------
I have to oversubscribe, so what do I have to do to bypass this
"limitation"?
Thanks,
Eric
configure log:
http://www.giref.ulaval.ca/~cmpgiref/ompi_3.x/2017.04.25.10h46m08s_config.log
http://www.giref.ulaval.ca/~cmpgiref/ompi_3.x/2017.04.25.10h46m08s_ompi_info_all.txt
here is the complete message:
[zorg:30036] [[INVALID],INVALID] plm:rsh_lookup on agent ssh : rsh path NULL
[zorg:30036] plm:base:set_hnp_name: initial bias 30036 nodename hash
810220270
[zorg:30036] plm:base:set_hnp_name: final jobfam 49136
[zorg:30036] [[49136,0],0] plm:rsh_setup on agent ssh : rsh path NULL
[zorg:30036] [[49136,0],0] plm:base:receive start comm
[zorg:30036] [[49136,0],0] plm:base:setup_job
[zorg:30036] [[49136,0],0] plm:base:setup_vm
[zorg:30036] [[49136,0],0] plm:base:setup_vm creating map
[zorg:30036] [[49136,0],0] setup:vm: working unmanaged allocation
[zorg:30036] [[49136,0],0] using default hostfile
/opt/openmpi-3.x_debug/etc/openmpi-default-hostfile
[zorg:30036] [[49136,0],0] plm:base:setup_vm only HNP in allocation
[zorg:30036] [[49136,0],0] plm:base:setting slots for node zorg by cores
[zorg:30036] [[49136,0],0] complete_setup on job [49136,1]
[zorg:30036] [[49136,0],0] plm:base:launch_apps for job [49136,1]
--------------------------------------------------------------------------
There are not enough slots available in the system to satisfy the 8 slots
that were requested by the application:
echo
Either request fewer slots for your application, or make more slots
available
for use.
--------------------------------------------------------------------------
[zorg:30036] [[49136,0],0] plm:base:orted_cmd sending orted_exit commands
[zorg:30036] [[49136,0],0] plm:base:receive stop comm
just testing the 3.x branch... I launch:
mpirun -n 8 echo "hello"
and I get:
--------------------------------------------------------------------------
There are not enough slots available in the system to satisfy the 8 slots
that were requested by the application:
echo
Either request fewer slots for your application, or make more slots
available
for use.
--------------------------------------------------------------------------
I have to oversubscribe, so what do I have to do to bypass this
"limitation"?
Thanks,
Eric
configure log:
http://www.giref.ulaval.ca/~cmpgiref/ompi_3.x/2017.04.25.10h46m08s_config.log
http://www.giref.ulaval.ca/~cmpgiref/ompi_3.x/2017.04.25.10h46m08s_ompi_info_all.txt
here is the complete message:
[zorg:30036] [[INVALID],INVALID] plm:rsh_lookup on agent ssh : rsh path NULL
[zorg:30036] plm:base:set_hnp_name: initial bias 30036 nodename hash
810220270
[zorg:30036] plm:base:set_hnp_name: final jobfam 49136
[zorg:30036] [[49136,0],0] plm:rsh_setup on agent ssh : rsh path NULL
[zorg:30036] [[49136,0],0] plm:base:receive start comm
[zorg:30036] [[49136,0],0] plm:base:setup_job
[zorg:30036] [[49136,0],0] plm:base:setup_vm
[zorg:30036] [[49136,0],0] plm:base:setup_vm creating map
[zorg:30036] [[49136,0],0] setup:vm: working unmanaged allocation
[zorg:30036] [[49136,0],0] using default hostfile
/opt/openmpi-3.x_debug/etc/openmpi-default-hostfile
[zorg:30036] [[49136,0],0] plm:base:setup_vm only HNP in allocation
[zorg:30036] [[49136,0],0] plm:base:setting slots for node zorg by cores
[zorg:30036] [[49136,0],0] complete_setup on job [49136,1]
[zorg:30036] [[49136,0],0] plm:base:launch_apps for job [49136,1]
--------------------------------------------------------------------------
There are not enough slots available in the system to satisfy the 8 slots
that were requested by the application:
echo
Either request fewer slots for your application, or make more slots
available
for use.
--------------------------------------------------------------------------
[zorg:30036] [[49136,0],0] plm:base:orted_cmd sending orted_exit commands
[zorg:30036] [[49136,0],0] plm:base:receive stop comm