Discussion:
[OMPI users] Problem with 'orted: command not found'
Maciek Lewiński
2016-05-02 21:36:11 UTC
Permalink
Hi, I'm having problem with Open MPI version 1.10.2.
I've installed two virtual machines on VirtualBox, both are the same images
of Ubuntu 12.04 64bit.
Both have the same accounts, both have everything configured almost exactly
the same.
I have configured OMPI only with the --prefix to specify my location of
install folder which is /home/$USER/.openmpi.
Users on both machines are identical.

On both machines running mpirun on one of the example programs work
perfectly, on both machines I've added bin and lib to corresponding PATHs,
env command ran on master and through ssh on slave1 gives these results:
***@osboxes:~/cloud$ env | grep PATH
LD_LIBRARY_PATH=:/usr/local/lib:/usr/local/lib:/home/osboxes/.openmpi/lib
PATH=/usr/lib/lightdm/lightdm:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin:/usr/games:/usr/local/bin:/usr/local/bin:/home/osboxes/.openmpi/bin

On both hosts I have installed ssh which works, I've enabled passwordless
connection, which again, works,

I have created an NFS folder in which I hold the hello_c program from
examples and hosts file that looks like this:
192.168.0.191 master
192.168.0.190 slave1

The same two lines are placed in /etc/hosts for ssh to work.

On both machines, running:
***@osboxes:~/cloud$ mpirun -np 1 ./hello_c
Hello, world, I am 0 of 1, (Open MPI v1.10.2, package: Open MPI
***@osboxes Distribution, ident: 1.10.2, repo rev:
v1.10.1-145-g799148f, Jan 21, 2016, 126)

Works, even running this command on slave1 through ssh from master works as
expected.

Yet when I try to execute the following command I get the error:
***@osboxes:~/cloud$ mpirun -np 2 --hostfile hosts ./hello_c
bash: orted: command not found
--------------------------------------------------------------------------
ORTE was unable to reliably start one or more daemons.
This usually is caused by:

* not finding the required libraries and/or binaries on
one or more nodes. Please check your PATH and LD_LIBRARY_PATH
settings, or configure OMPI with --enable-orterun-prefix-by-default

* lack of authority to execute on one or more specified nodes.
Please verify your allocation and authorities.

* the inability to write startup files into /tmp
(--tmpdir/orte_tmpdir_base).
Please check with your sys admin to determine the correct location to use.

* compilation of the orted with dynamic libraries when static are required
(e.g., on Cray). Please check your configure cmd line and consider using
one of the contrib/platform definitions for your system type.

* an inability to create a connection back to mpirun due to a
lack of common network interfaces and/or no route found between
them. Please check network connectivity (including firewalls
and network routing requirements).
--------------------------------------------------------------------------

The same happens when I use -host master,slave1 instead of --hostfile
hosts. I'm slowly running out if ideas, I've tried anything I could find on
the internet or OMPI FAQ, nothing seems to work. What am I doing wrong?
Jeff Squyres (jsquyres)
2016-05-02 21:40:53 UTC
Permalink
The key is this error:

bash: orted: command not found

Meaning: you need to set your PATH and LD_LIBRARY_PATH properly for non-interactive logins. See https://www.open-mpi.org/faq/?category=running#adding-ompi-to-path.
Post by Maciek Lewiński
Hi, I'm having problem with Open MPI version 1.10.2.
I've installed two virtual machines on VirtualBox, both are the same images of Ubuntu 12.04 64bit.
Both have the same accounts, both have everything configured almost exactly the same.
I have configured OMPI only with the --prefix to specify my location of install folder which is /home/$USER/.openmpi.
Users on both machines are identical.
LD_LIBRARY_PATH=:/usr/local/lib:/usr/local/lib:/home/osboxes/.openmpi/lib
PATH=/usr/lib/lightdm/lightdm:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin:/usr/games:/usr/local/bin:/usr/local/bin:/home/osboxes/.openmpi/bin
On both hosts I have installed ssh which works, I've enabled passwordless connection, which again, works,
192.168.0.191 master
192.168.0.190 slave1
The same two lines are placed in /etc/hosts for ssh to work.
Works, even running this command on slave1 through ssh from master works as expected.
bash: orted: command not found
--------------------------------------------------------------------------
ORTE was unable to reliably start one or more daemons.
* not finding the required libraries and/or binaries on
one or more nodes. Please check your PATH and LD_LIBRARY_PATH
settings, or configure OMPI with --enable-orterun-prefix-by-default
* lack of authority to execute on one or more specified nodes.
Please verify your allocation and authorities.
* the inability to write startup files into /tmp (--tmpdir/orte_tmpdir_base).
Please check with your sys admin to determine the correct location to use.
* compilation of the orted with dynamic libraries when static are required
(e.g., on Cray). Please check your configure cmd line and consider using
one of the contrib/platform definitions for your system type.
* an inability to create a connection back to mpirun due to a
lack of common network interfaces and/or no route found between
them. Please check network connectivity (including firewalls
and network routing requirements).
--------------------------------------------------------------------------
The same happens when I use -host master,slave1 instead of --hostfile hosts. I'm slowly running out if ideas, I've tried anything I could find on the internet or OMPI FAQ, nothing seems to work. What am I doing wrong?
_______________________________________________
users mailing list
Subscription: https://www.open-mpi.org/mailman/listinfo.cgi/users
Link to this post: http://www.open-mpi.org/community/lists/users/2016/05/29068.php
--
Jeff Squyres
***@cisco.com
For corporate legal information go to: http://www.cisco.com/web/about/doing_business/legal/cri/
Maciek Lewiński
2016-05-02 22:14:45 UTC
Permalink
I already had correct paths in .bashrc:

export
PATH=/usr/lib/lightdm/lightdm:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin:/usr/games:/usr/local/bin:/usr/local/bin:/home/$USER/.openmpi/bin

export
LD_LIBRARY_PATH=:/usr/local/lib:/usr/local/lib:/home/$USER/.openmpi/lib

I can run mpi normally from slave1 so I'm sure they work. I also exported
these paths in .profile just to be sure a moment before but it didn't work.
Still the same error.
Post by Maciek Lewiński
bash: orted: command not found
Meaning: you need to set your PATH and LD_LIBRARY_PATH properly for
non-interactive logins. See
https://www.open-mpi.org/faq/?category=running#adding-ompi-to-path.
Post by Maciek Lewiński
Hi, I'm having problem with Open MPI version 1.10.2.
I've installed two virtual machines on VirtualBox, both are the same
images of Ubuntu 12.04 64bit.
Post by Maciek Lewiński
Both have the same accounts, both have everything configured almost
exactly the same.
Post by Maciek Lewiński
I have configured OMPI only with the --prefix to specify my location of
install folder which is /home/$USER/.openmpi.
Post by Maciek Lewiński
Users on both machines are identical.
On both machines running mpirun on one of the example programs work
perfectly, on both machines I've added bin and lib to corresponding PATHs,
Post by Maciek Lewiński
LD_LIBRARY_PATH=:/usr/local/lib:/usr/local/lib:/home/osboxes/.openmpi/lib
PATH=/usr/lib/lightdm/lightdm:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin:/usr/games:/usr/local/bin:/usr/local/bin:/home/osboxes/.openmpi/bin
Post by Maciek Lewiński
On both hosts I have installed ssh which works, I've enabled
passwordless connection, which again, works,
Post by Maciek Lewiński
I have created an NFS folder in which I hold the hello_c program from
192.168.0.191 master
192.168.0.190 slave1
The same two lines are placed in /etc/hosts for ssh to work.
Hello, world, I am 0 of 1, (Open MPI v1.10.2, package: Open MPI
v1.10.1-145-g799148f, Jan 21, 2016, 126)
Post by Maciek Lewiński
Works, even running this command on slave1 through ssh from master works
as expected.
Post by Maciek Lewiński
bash: orted: command not found
--------------------------------------------------------------------------
Post by Maciek Lewiński
ORTE was unable to reliably start one or more daemons.
* not finding the required libraries and/or binaries on
one or more nodes. Please check your PATH and LD_LIBRARY_PATH
settings, or configure OMPI with --enable-orterun-prefix-by-default
* lack of authority to execute on one or more specified nodes.
Please verify your allocation and authorities.
* the inability to write startup files into /tmp
(--tmpdir/orte_tmpdir_base).
Post by Maciek Lewiński
Please check with your sys admin to determine the correct location to
use.
Post by Maciek Lewiński
* compilation of the orted with dynamic libraries when static are
required
Post by Maciek Lewiński
(e.g., on Cray). Please check your configure cmd line and consider
using
Post by Maciek Lewiński
one of the contrib/platform definitions for your system type.
* an inability to create a connection back to mpirun due to a
lack of common network interfaces and/or no route found between
them. Please check network connectivity (including firewalls
and network routing requirements).
--------------------------------------------------------------------------
Post by Maciek Lewiński
The same happens when I use -host master,slave1 instead of --hostfile
hosts. I'm slowly running out if ideas, I've tried anything I could find on
the internet or OMPI FAQ, nothing seems to work. What am I doing wrong?
Post by Maciek Lewiński
_______________________________________________
users mailing list
Subscription: https://www.open-mpi.org/mailman/listinfo.cgi/users
http://www.open-mpi.org/community/lists/users/2016/05/29068.php
--
Jeff Squyres
http://www.cisco.com/web/about/doing_business/legal/cri/
_______________________________________________
users mailing list
Subscription: https://www.open-mpi.org/mailman/listinfo.cgi/users
http://www.open-mpi.org/community/lists/users/2016/05/29069.php
Jeff Squyres (jsquyres)
2016-05-03 01:19:03 UTC
Permalink
Make sure you check that these paths are set for *non-interactive* logins.
export PATH=/usr/lib/lightdm/lightdm:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin:/usr/games:/usr/local/bin:/usr/local/bin:/home/$USER/.openmpi/bin
export LD_LIBRARY_PATH=:/usr/local/lib:/usr/local/lib:/home/$USER/.openmpi/lib
I can run mpi normally from slave1 so I'm sure they work. I also exported these paths in .profile just to be sure a moment before but it didn't work. Still the same error.
bash: orted: command not found
Meaning: you need to set your PATH and LD_LIBRARY_PATH properly for non-interactive logins. See https://www.open-mpi.org/faq/?category=running#adding-ompi-to-path.
Post by Maciek Lewiński
Hi, I'm having problem with Open MPI version 1.10.2.
I've installed two virtual machines on VirtualBox, both are the same images of Ubuntu 12.04 64bit.
Both have the same accounts, both have everything configured almost exactly the same.
I have configured OMPI only with the --prefix to specify my location of install folder which is /home/$USER/.openmpi.
Users on both machines are identical.
LD_LIBRARY_PATH=:/usr/local/lib:/usr/local/lib:/home/osboxes/.openmpi/lib
PATH=/usr/lib/lightdm/lightdm:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin:/usr/games:/usr/local/bin:/usr/local/bin:/home/osboxes/.openmpi/bin
On both hosts I have installed ssh which works, I've enabled passwordless connection, which again, works,
192.168.0.191 master
192.168.0.190 slave1
The same two lines are placed in /etc/hosts for ssh to work.
Works, even running this command on slave1 through ssh from master works as expected.
bash: orted: command not found
--------------------------------------------------------------------------
ORTE was unable to reliably start one or more daemons.
* not finding the required libraries and/or binaries on
one or more nodes. Please check your PATH and LD_LIBRARY_PATH
settings, or configure OMPI with --enable-orterun-prefix-by-default
* lack of authority to execute on one or more specified nodes.
Please verify your allocation and authorities.
* the inability to write startup files into /tmp (--tmpdir/orte_tmpdir_base).
Please check with your sys admin to determine the correct location to use.
* compilation of the orted with dynamic libraries when static are required
(e.g., on Cray). Please check your configure cmd line and consider using
one of the contrib/platform definitions for your system type.
* an inability to create a connection back to mpirun due to a
lack of common network interfaces and/or no route found between
them. Please check network connectivity (including firewalls
and network routing requirements).
--------------------------------------------------------------------------
The same happens when I use -host master,slave1 instead of --hostfile hosts. I'm slowly running out if ideas, I've tried anything I could find on the internet or OMPI FAQ, nothing seems to work. What am I doing wrong?
_______________________________________________
users mailing list
Subscription: https://www.open-mpi.org/mailman/listinfo.cgi/users
Link to this post: http://www.open-mpi.org/community/lists/users/2016/05/29068.php
--
Jeff Squyres
For corporate legal information go to: http://www.cisco.com/web/about/doing_business/legal/cri/
_______________________________________________
users mailing list
Subscription: https://www.open-mpi.org/mailman/listinfo.cgi/users
Link to this post: http://www.open-mpi.org/community/lists/users/2016/05/29069.php
_______________________________________________
users mailing list
Subscription: https://www.open-mpi.org/mailman/listinfo.cgi/users
Link to this post: http://www.open-mpi.org/community/lists/users/2016/05/29070.php
--
Jeff Squyres
***@cisco.com
For corporate legal information go to: http://www.cisco.com/web/about/doing_business/legal/cri/
Maciek Lewiński
2016-05-03 09:27:40 UTC
Permalink
Thank you! I have set up my env paths at the end of the script and thanks
to you I just noticed that at the beginning of the bashrc script there's a
simple IF that returns when it's opened as non-interactive. I moved my
exports above it and it finally works. Again, thank you very much.
Post by Jeff Squyres (jsquyres)
Make sure you check that these paths are set for *non-interactive* logins.
Post by Maciek Lewiński
export
PATH=/usr/lib/lightdm/lightdm:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin:/usr/games:/usr/local/bin:/usr/local/bin:/home/$USER/.openmpi/bin
Post by Maciek Lewiński
export
LD_LIBRARY_PATH=:/usr/local/lib:/usr/local/lib:/home/$USER/.openmpi/lib
Post by Maciek Lewiński
I can run mpi normally from slave1 so I'm sure they work. I also
exported these paths in .profile just to be sure a moment before but it
didn't work. Still the same error.
Post by Maciek Lewiński
bash: orted: command not found
Meaning: you need to set your PATH and LD_LIBRARY_PATH properly for
non-interactive logins. See
https://www.open-mpi.org/faq/?category=running#adding-ompi-to-path.
Post by Maciek Lewiński
Post by Maciek Lewiński
Hi, I'm having problem with Open MPI version 1.10.2.
I've installed two virtual machines on VirtualBox, both are the same
images of Ubuntu 12.04 64bit.
Post by Maciek Lewiński
Post by Maciek Lewiński
Both have the same accounts, both have everything configured almost
exactly the same.
Post by Maciek Lewiński
Post by Maciek Lewiński
I have configured OMPI only with the --prefix to specify my location
of install folder which is /home/$USER/.openmpi.
Post by Maciek Lewiński
Post by Maciek Lewiński
Users on both machines are identical.
On both machines running mpirun on one of the example programs work
perfectly, on both machines I've added bin and lib to corresponding PATHs,
LD_LIBRARY_PATH=:/usr/local/lib:/usr/local/lib:/home/osboxes/.openmpi/lib
PATH=/usr/lib/lightdm/lightdm:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin:/usr/games:/usr/local/bin:/usr/local/bin:/home/osboxes/.openmpi/bin
Post by Maciek Lewiński
Post by Maciek Lewiński
On both hosts I have installed ssh which works, I've enabled
passwordless connection, which again, works,
Post by Maciek Lewiński
Post by Maciek Lewiński
I have created an NFS folder in which I hold the hello_c program from
192.168.0.191 master
192.168.0.190 slave1
The same two lines are placed in /etc/hosts for ssh to work.
Hello, world, I am 0 of 1, (Open MPI v1.10.2, package: Open MPI
v1.10.1-145-g799148f, Jan 21, 2016, 126)
Post by Maciek Lewiński
Post by Maciek Lewiński
Works, even running this command on slave1 through ssh from master
works as expected.
Post by Maciek Lewiński
Post by Maciek Lewiński
bash: orted: command not found
--------------------------------------------------------------------------
Post by Maciek Lewiński
Post by Maciek Lewiński
ORTE was unable to reliably start one or more daemons.
* not finding the required libraries and/or binaries on
one or more nodes. Please check your PATH and LD_LIBRARY_PATH
settings, or configure OMPI with --enable-orterun-prefix-by-default
* lack of authority to execute on one or more specified nodes.
Please verify your allocation and authorities.
* the inability to write startup files into /tmp
(--tmpdir/orte_tmpdir_base).
Post by Maciek Lewiński
Post by Maciek Lewiński
Please check with your sys admin to determine the correct location
to use.
Post by Maciek Lewiński
Post by Maciek Lewiński
* compilation of the orted with dynamic libraries when static are
required
Post by Maciek Lewiński
Post by Maciek Lewiński
(e.g., on Cray). Please check your configure cmd line and consider
using
Post by Maciek Lewiński
Post by Maciek Lewiński
one of the contrib/platform definitions for your system type.
* an inability to create a connection back to mpirun due to a
lack of common network interfaces and/or no route found between
them. Please check network connectivity (including firewalls
and network routing requirements).
--------------------------------------------------------------------------
Post by Maciek Lewiński
Post by Maciek Lewiński
The same happens when I use -host master,slave1 instead of --hostfile
hosts. I'm slowly running out if ideas, I've tried anything I could find on
the internet or OMPI FAQ, nothing seems to work. What am I doing wrong?
Post by Maciek Lewiński
Post by Maciek Lewiński
_______________________________________________
users mailing list
Subscription: https://www.open-mpi.org/mailman/listinfo.cgi/users
http://www.open-mpi.org/community/lists/users/2016/05/29068.php
Post by Maciek Lewiński
--
Jeff Squyres
http://www.cisco.com/web/about/doing_business/legal/cri/
Post by Maciek Lewiński
_______________________________________________
users mailing list
Subscription: https://www.open-mpi.org/mailman/listinfo.cgi/users
http://www.open-mpi.org/community/lists/users/2016/05/29069.php
Post by Maciek Lewiński
_______________________________________________
users mailing list
Subscription: https://www.open-mpi.org/mailman/listinfo.cgi/users
http://www.open-mpi.org/community/lists/users/2016/05/29070.php
--
Jeff Squyres
http://www.cisco.com/web/about/doing_business/legal/cri/
_______________________________________________
users mailing list
Subscription: https://www.open-mpi.org/mailman/listinfo.cgi/users
http://www.open-mpi.org/community/lists/users/2016/05/29071.php
Gilles Gouaillardet
2016-05-03 01:23:25 UTC
Permalink
If OpenMPI is installed at the same path on every node, the easiest optin
is to re-configure with
--enable-mpirun-prefix-by-default
an other option is to use
`which mpirun` instead of mpirun
and yet an other option is to
mpirun --prefix=$USER/.openmpi

Cheers,

Gilles
Post by Maciek Lewiński
export
PATH=/usr/lib/lightdm/lightdm:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin:/usr/games:/usr/local/bin:/usr/local/bin:/home/$USER/.openmpi/bin
export
LD_LIBRARY_PATH=:/usr/local/lib:/usr/local/lib:/home/$USER/.openmpi/lib
I can run mpi normally from slave1 so I'm sure they work. I also exported
these paths in .profile just to be sure a moment before but it didn't work.
Still the same error.
Post by Maciek Lewiński
bash: orted: command not found
Meaning: you need to set your PATH and LD_LIBRARY_PATH properly for
non-interactive logins. See
https://www.open-mpi.org/faq/?category=running#adding-ompi-to-path.
Post by Maciek Lewiński
Hi, I'm having problem with Open MPI version 1.10.2.
I've installed two virtual machines on VirtualBox, both are the same
images of Ubuntu 12.04 64bit.
Post by Maciek Lewiński
Both have the same accounts, both have everything configured almost
exactly the same.
Post by Maciek Lewiński
I have configured OMPI only with the --prefix to specify my location of
install folder which is /home/$USER/.openmpi.
Post by Maciek Lewiński
Users on both machines are identical.
On both machines running mpirun on one of the example programs work
perfectly, on both machines I've added bin and lib to corresponding PATHs,
LD_LIBRARY_PATH=:/usr/local/lib:/usr/local/lib:/home/osboxes/.openmpi/lib
PATH=/usr/lib/lightdm/lightdm:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin:/usr/games:/usr/local/bin:/usr/local/bin:/home/osboxes/.openmpi/bin
Post by Maciek Lewiński
On both hosts I have installed ssh which works, I've enabled
passwordless connection, which again, works,
Post by Maciek Lewiński
I have created an NFS folder in which I hold the hello_c program from
192.168.0.191 master
192.168.0.190 slave1
The same two lines are placed in /etc/hosts for ssh to work.
Hello, world, I am 0 of 1, (Open MPI v1.10.2, package: Open MPI
v1.10.1-145-g799148f, Jan 21, 2016, 126)
Post by Maciek Lewiński
Works, even running this command on slave1 through ssh from master
works as expected.
Post by Maciek Lewiński
bash: orted: command not found
--------------------------------------------------------------------------
Post by Maciek Lewiński
ORTE was unable to reliably start one or more daemons.
* not finding the required libraries and/or binaries on
one or more nodes. Please check your PATH and LD_LIBRARY_PATH
settings, or configure OMPI with --enable-orterun-prefix-by-default
* lack of authority to execute on one or more specified nodes.
Please verify your allocation and authorities.
* the inability to write startup files into /tmp
(--tmpdir/orte_tmpdir_base).
Post by Maciek Lewiński
Please check with your sys admin to determine the correct location to
use.
Post by Maciek Lewiński
* compilation of the orted with dynamic libraries when static are
required
Post by Maciek Lewiński
(e.g., on Cray). Please check your configure cmd line and consider
using
Post by Maciek Lewiński
one of the contrib/platform definitions for your system type.
* an inability to create a connection back to mpirun due to a
lack of common network interfaces and/or no route found between
them. Please check network connectivity (including firewalls
and network routing requirements).
--------------------------------------------------------------------------
Post by Maciek Lewiński
The same happens when I use -host master,slave1 instead of --hostfile
hosts. I'm slowly running out if ideas, I've tried anything I could find on
the internet or OMPI FAQ, nothing seems to work. What am I doing wrong?
Post by Maciek Lewiński
_______________________________________________
users mailing list
Subscription: https://www.open-mpi.org/mailman/listinfo.cgi/users
http://www.open-mpi.org/community/lists/users/2016/05/29068.php
--
Jeff Squyres
http://www.cisco.com/web/about/doing_business/legal/cri/
_______________________________________________
users mailing list
Subscription: https://www.open-mpi.org/mailman/listinfo.cgi/users
http://www.open-mpi.org/community/lists/users/2016/05/29069.php
Loading...