Quantcast
Channel: Mellanox Interconnect Community: Message List
Viewing all articles
Browse latest Browse all 6275

Re: MXM ERROR failed to create send cq: Cannot allocate memory

$
0
0

Good morning, thanks a lot for your answer.

I spent the whole day yesterday trying to troubleshoot this problem. You are right, I had a system locked memory of 64Kb, which was peanuts.

 

I have solved these problems, I had to:

- add knem to the kernel modules that are loaded

- add a special line for root in /etc/security/limits.conf (apparently MXM memory-related stuff is used by root, even if it is a normal user that launches the SRUN command), the line is: root - memlock unlimited so now in this file I have:

  1. root - memlock unlimited
  2. * - memlock unlimited

- chmod 0666 /dev/knem (so that it is usable by anyone)

 

so now my basic "hello world" program works, and I have other problems, but I think they are related to programming and not to the cluster configuration.

 

Thanks a lot for your precious help.

Regards,

Andrea


Viewing all articles
Browse latest Browse all 6275

Trending Articles



<script src="https://jsc.adskeeper.com/r/s/rssing.com.1596347.js" async> </script>