Re: Trunks, pvlans, infiniband world
Hi Daniel,you have no loops in this setup ...in any case, I suggest you have a look here: Designing an HPC Cluster with Mellanox InfiniBand Solutions And maybe start with this: Understanding Up/Down...
View ArticleRe: Windows: Network cable unplugged
I was having the same problem, but now the next question is:Where is the best place to run subnet manager? A standalone server? Server should have IB connected directly to it? What about running it...
View ArticleRe: Network topo for multi MPI clusters and one storage cluster?
Hello John,I looking at the pdf; design 1 should work. The independent clusters MPI traffic should remain local. Design 2 could be modified for storage high availability if you wanted to go this...
View ArticleRe: Network topo for multi MPI clusters and one storage cluster?
Hi Scot,Many thanks. We'll probably go with design #1 and use default SM as you suggest.-John
View ArticleRe: Network topo for multi MPI clusters and one storage cluster?
Please let me know how it works – should be ideal config. Regards,Scot SchultzDirector, HPC and Technical ComputingMellanox Technologies350 Oakmead Parkway, Suite 100, Sunnyvale CA, 94085Office:...
View ArticleRe: Trunks, pvlans, infiniband world
Hi, there will be 3x chassis at first with 16 blades in each Each blade will have a duel port 40g mezz card connecting to the chassis 40g switches to get external ssd San storage I'll be using srp...
View ArticleTEST - IGNORE : MLNX OFED 3.2 centos 7.2 with RT kernel error
Hello.I've updated my 7.1 centos to 7.2 and got a new kernelNext I compiled and installed RT kernel like in this article: How to build the CentOS 7 RT kernel - Hardware - WikiI need to say that I...
View ArticleConnectX-4 CX456A does not work with opensm
I have two servers each installed with a ConnectX-4 VPI 100Gb NIC (model:CX456A,two ports). The two ports are connected back to back using two copper cable. I have no problem when the two ports are set...
View ArticleRe: ConnectX-4 CX456A does not work with opensm
I also tried it with SB7700 IB switch. The configuration shows that the subnet manager is enabled:=================================================================SB7700-IB-100Gb [standalone: master]...
View ArticleRe: ConnectX-4 CX456A does not work with opensm
Hi Weijia, Can you please provide from the switch the following outputs: >show interface ib 1/1 transceiver>show interface ib 1/2 transceiver>show images Can you also change the second port to...
View ArticleRe: ConnectX-4 CX456A does not work with opensm
my 2c, the issue is not with the subnet manger, issue is that the physical link between the 2 servers (in the b2b setup) or between the servers to the switch (in the switch setup) is not linking up...
View ArticleRe: MHQH19B-XTR - MFE_NO_FLASH_DETECTED
Hi Francesco Ghini, Please open a case to Mellanox support by sending an email to support@mellanox.com.This issue cannot be fixed with a simple workaround and a support assistence is needed here. Best...
View ArticleRe: Can VMA and DPDK be used together?
Hi,These are completely two different products, one is a Network stack (VMA) which emulates RDMA over kernel sockets and the other one is a user splace accelerator software which accelerates processes...
View ArticleRe: ConnectX-4 CX456A does not work with opensm
Thank you Sophie, here is the result I get:SB7700-IB-100Gb [standalone: master] # show interface ib 1/1 transceiverIB1/1 state: Unknown cable. identifier : (0x11)...
View ArticleRe: ConnectX-4 CX456A does not work with opensm
Thank you Eddie for the thoughts, I'm sure the physical link is corerctly linked up because the Ethernet mode is working without touching the hardware.
View ArticleRe: ConnectX-4 CX456A does not work with opensm
I think you have ethernet cable that can't support IB mode. Could you check cable model in CLI?
View ArticleRe: Error in ipoib
Hi Sophie Naudin1. ofed_info | head -1MLNX_OFED_LINUX-3.1-1.0.3 (OFED-3.1-1.0.3):2. yum erase iptables - You're right, the firewallwasin the system. If I use IPoIB, I needrdmamodules in the system?
View Article