Good to know it works finally for you. The knem improves only for the large messages transfers betwen MPI processes within the node. It must be that memlock limit you set in the /etc/security/limits.conf made it work. You can mark the question as answered. Thanks for the update!
↧