Sebastian Antunez N.
2016-11-19 23:22:35 UTC
Hello Guys
I have a cluster of HPC and I update OFED, Firmware etc.
Post reboot and run mpirun -machinefile nodes8 -n 128
/home/HPL/run_hpl/xhpl show the following error
bash: /usr/mpi/gcc/openmpi-1.8.8/bin/orted: No such file or directory
bash: /usr/mpi/gcc/openmpi-1.8.8/bin/orted: No such file or directory
bash: /usr/mpi/gcc/openmpi-1.8.8/bin/orted: No such file or directory
--------------------------------------------------------------------------
ORTE was unable to reliably start one or more daemons.
This usually is caused by:
* not finding the required libraries and/or binaries on
one or more nodes. Please check your PATH and LD_LIBRARY_PATH
settings, or configure OMPI with --enable-orterun-prefix-by-default
* lack of authority to execute on one or more specified nodes.
Please verify your allocation and authorities.
* the inability to write startup files into /tmp
(--tmpdir/orte_tmpdir_base).
Please check with your sys admin to determine the correct location to use.
* compilation of the orted with dynamic libraries when static are required
(e.g., on Cray). Please check your configure cmd line and consider using
one of the contrib/platform definitions for your system type.
* an inability to create a connection back to mpirun due to a
lack of common network interfaces and/or no route found between
them. Please check network connectivity (including firewalls
and network routing requirements).
Before update I have version 1.6.4 and the cluster not show errors when I
run the mpirun
I changed the Enviroment Variables but persist the error.
Is possible ypur comment who resolved the issue.
Regards
Sebastian Antunez
I have a cluster of HPC and I update OFED, Firmware etc.
Post reboot and run mpirun -machinefile nodes8 -n 128
/home/HPL/run_hpl/xhpl show the following error
bash: /usr/mpi/gcc/openmpi-1.8.8/bin/orted: No such file or directory
bash: /usr/mpi/gcc/openmpi-1.8.8/bin/orted: No such file or directory
bash: /usr/mpi/gcc/openmpi-1.8.8/bin/orted: No such file or directory
--------------------------------------------------------------------------
ORTE was unable to reliably start one or more daemons.
This usually is caused by:
* not finding the required libraries and/or binaries on
one or more nodes. Please check your PATH and LD_LIBRARY_PATH
settings, or configure OMPI with --enable-orterun-prefix-by-default
* lack of authority to execute on one or more specified nodes.
Please verify your allocation and authorities.
* the inability to write startup files into /tmp
(--tmpdir/orte_tmpdir_base).
Please check with your sys admin to determine the correct location to use.
* compilation of the orted with dynamic libraries when static are required
(e.g., on Cray). Please check your configure cmd line and consider using
one of the contrib/platform definitions for your system type.
* an inability to create a connection back to mpirun due to a
lack of common network interfaces and/or no route found between
them. Please check network connectivity (including firewalls
and network routing requirements).
Before update I have version 1.6.4 and the cluster not show errors when I
run the mpirun
I changed the Enviroment Variables but persist the error.
Is possible ypur comment who resolved the issue.
Regards
Sebastian Antunez