[OMPI users] Exhausting QPs?

Nathan Hjelm

2018-03-14 04:21:02 UTC

Yalla works because MXM defaults to using unconnected datagrams (I donât think it uses RC unless you ask). Is this a fully connected algorithm? I ask because (3584 - 28) * 28 * 3 (default number of QPs/remote process in btl/openib) = 298704 > 262144. This is the problem with RC. Mellanox solved it by adding another protocol on mlx5 systems called DC. The openib btl does not (and probably never will) support DC. The recommended path is to use OpenUCX. That is effectively the replacement for ibverbs in the long run.

-Nathan

Post by Ben Menadue
Hi,
--------------------------------------------------------------------------
A process failed to create a queue pair. This usually means either
the device has run out of queue pairs (too many connections) or
there are insufficient resources available to allocate a queue pair
(out of memory). The latter can happen if either 1) insufficient
memory is available, or 2) no more physical memory can be registered
with the device.
http://www.open-mpi.org/faq/?category=openfabrics#ib-locked-pages
Local host: r3735
Local device: mlx5_0
Queue pair type: Reliable connected (RC)
--------------------------------------------------------------------------
[347071.005636] mlx5_core 0000:06:00.0: mlx5_cmd_check:727:(pid 31507): CREATE_QP(0x500) op_mod(0x0) failed, status bad resource(0x5), syndrome (0x65b500)
Iâm pretty sure 0x65b500 means "out of queue pairsâ.
Our HCAs support 262144 QPs, and while some of these will be used for e.g. IPoIB and Lustre, I wouldnât expect to be running out at such a low number of cores â and indeed, Iâve run much larger jobs without seeing this issue.
This is using the 1.10 series, with the ob1 PML with the openib BTL. If they use Yalla, it works fine, but it would still be good to get it working using the âstandardâ communication path, without needing the accelerators.
I was wondering if anyone seen this before, and if anyone had any suggestions for how to proceed?
Thanks,
Ben
_______________________________________________
users mailing list
https://lists.open-mpi.org/mailman/listinfo/users