Hello John,
I looking at the pdf; design 1 should work. The independent clusters MPI traffic should remain local. Design 2 could be modified for storage high availability if you wanted to go this route, but is not needed otherwise.
Regarding the routing algorithm should just use out of the box SM; I don’t see any other option here to improve upon with this configuration.