A torus interconnect is a switch-less network topology for connecting processing nodes in a parallel computer system. interconnect. It is not limited to 8 nodes but can consist of any number of nodes in a similar rectilinear array.
Introduction
In geometry, a torus is created by revolving a circle about an axis coplanar to the circle. While this is a general definition in geometry, the topological properties of this type of shape describes the network topology in its essence.
Geometry illustration
The following images are 1D, and 2D torus. 1D torus is a simple circle, and 2D torus has the shape of doughnut. The animation below illustrates how a 2D torus is generated from a rectangle by connecting its two pairs of opposite edges. Here the concept of torus is used to describe essentially the beginning and ending of a sequence of nodes are connected, like a doughnut. To better illustrate the concept, and understand what the topology means in network interconnect, we give 3 examples of parallel interconnected nodes using torus topology. At one dimension, a torus topology is equivalent to a ring interconnect network, of a shape of a circle. At 2D, it's equivalent to a 2D mesh, but with extra connection at the edge nodes, which is the definition of 2D torus.
Torus network topology
We can generalize the rule from the figures above. Torus interconnect is a switch-less topology that can be seen as a mesh interconnect with nodes arranged in a rectilinear array of N = 2, 3, or more dimensions, with processors connected to their nearest neighbors, and corresponding processors on opposite edges of the array connected.Torus interconnect#cite note-1| In this lattice, each node has 2N connections. This topology got the name from the fact that the lattice formed in this way is topologically homogeneous to an N-dimensional torus.
Visualization
The first 3 dimensions of torus network topology are easier to visualize and are described below:
1D Torus: it is one dimension, n nodes are connected in closed loop with each node connected to its 2 nearest neighbors communication can take place in 2 directions, +x and −x. 1D torus is same as ring interconnection.
2D Torus: it is two dimension with degree of 4, the nodes are imagined laid out in a two-dimensional rectangular lattice of n rows and n columns, with each node connected to its 4 nearest neighbors, and corresponding nodes on opposite edges connected. The connection of opposite edges can be visualized by rolling the rectangular array into a "tube" to connect two opposite edges and then bending the "tube" into a torus to connect the other two. communication can take place in 4 directions, +x, −x, +y, and −y. The total nodes of 2D Torus is n2
3D Torus: it is three dimension, the nodes are imagined in a three-dimensional lattice in the shape of a rectangular prism, with each node connected with its 6 neighbors, with corresponding nodes on opposing faces of the array connected. Each edge consists of n nodes. communication can take place in 6 directions, +x, −x, +y, −y, +z, −z. Each edge of 3D Torus consist of n nodes. The total nodes of 3D Torus is n3
ND Torus: it can have N dimension, each node of N dimension torus has 2N neighbors, communication can take place in 2N directions. Each edge is consist of n nodes. Total nodes of this torus is nN. The main motivation of having higher dimension of torus is to achieve higher bandwidth, lower latency, and higher scalability.
Higher-dimensional arrays are difficult to visualize but we can see from above rule that each higher dimension adds another pair of nearest neighbor connections to each node.
Performance
A number of supercomputers on the TOP500 list use three-dimensional torus networks, e.g. IBM's Blue Gene/L and Blue Gene/P, and the Cray XT3. IBM's Blue Gene/Q uses a five-dimensional torus network. Fujitsu's K computer and the PRIMEHPC FX10 use a proprietary three-dimensional torus 3D mesh interconnect called Tofu.
Sandeep Palur and Dr. Ioan Raicu from Illinois Institute of Technology conducted experiments to simulate 3D torus performance. Their experiments ran on a computer with 250GB RAM, 48 cores and x86_64 architecture. The simulator they used was ROSS. They mainly focused on three aspects 1. Varying network size 2. Varying number of servers 3. Varying message size. They concluded that throughput decreases with the increase of servers and network size. Otherwise, throughput increases with the increase of message size.
6D Torus product performance
Limited developed a 6D torus computer model called "Tofu". In their model, a 6D torus can achieve 100 GB/s off-chip bandwidth, 12 times higher scalability than a 3D torus, and high fault tolerance. The model is used in the K computer and Fugaku.