[Product Innovation]
10-Gbit Ethernet Switching Processor Blasts Barriers To The Backbone
With 12 1-Gbit ports and one 10-Gbit port, this device places enterprise networks and ISPs firmly on top of the Internet backbone.
The incessant drive for faster, lower-cost connections to the Internet backbone has set the stage for a struggle between the local-area network (LAN) and the wide-area network (WAN) to achieve the heart and soul of the metropolitan-area network (MAN). On one side, the WAN wants to keep the installed base of OC-192-based protocols and systems intact. At the same time, enterprise networks and Internet service providers (ISPs) on the other side wish to leverage the high-volume, low-cost, well-known Ethernet technology that forms the basis of much of their infrastructure. Doing so allows them to migrate their Ethernet systems to 10-Gbit backbone connections at minimal cost, without having to drastically modify their existing network.
While the IEEE's High-Speed Study Group (HSSG) sorts through the myriad of political and technological issues involved in defining the physical links required, semiconductor manufacturers like Allayer Communications are hard at work readying silicon solutions for the back end. These ICs will provide the necessary network switching and processing that enterprises and ISPs are demanding. Such devices will allow the migration of 10/100-Mbit Ethernet to full 1-Gbit networks, which can then be connected directly to the Internet backbone over OC-192, or switched Ethernet over the MAN (at 10 Gbits/s). This will drastically improve ISP performance. It also will speed the delivery of popular, high-content multimedia web pages now being designed, while ensuring that the enterprise has a high-speed migration path.
The latest device from Allayer, the AL1032 switching processor, comprises 12 1-Gbit Ethernet ports with one 10-Gbit port. It's designed to connect Ethernet to the OC-192 optical backbone via the XGbitMII (XGMII), an already defined media-independent interface (Fig. 1). The AL1032 is the first device for multinode network switches to provide direct access to the Internet over an OC-192 optical connection. On-board flow-control ensures a steady 10-Gbit output, though in reality, the network traffic tends to be bursty. This means it isn't likely that all 12 ports will be at full throttle at any given moment.
Of course, the way that the AL1032 works sounds deceptively easy. Gigabit Ethernet is well defined, and the OC-192, 10-Gbit optical links to the backbone are readily available. It's just a matter of de-ciding which one to use. The problem, however, arises when multiple Gigabit Ethernet ports converge on a single chip to increase integration and, thus, save on cost and space. There, they must be processed, switched, and routed via a 10-Gbit port to the backbone.
The sheer enormity of the data coming in and going out has brought many chip designs to their knees, as a lot more than simply switching must take place. Packet processing occurs that requires packet classification via policy filters, packet identification, and tagging, while routing requires support for multicasting and virtual LANs. Other featureslink aggregation and control, support of network management functions, and class-of-service queuingmust be provided, too. All of this has to be done at wire speed and in real time if latency issues are to be avoided.
To date, a number of approaches have been tried. But for various reasons, none has succeeded. For example, one shared-memory approach achieves the high-speed output, but the interconnections between chips that allow scaling to multiple ports aren't defined. Another approach on the market has the required modularity to scale the ports, but it can't provide the high-speed link. Allayer leverages an advanced memory technology and policy engine, along with proprietary hashing algorithms for the lookup table, in order to achieve the modularity and high speed that's required.
How Does It Do That? Data coming into the AL1032's 12 ports is immediately stored in a receiver FIFO. Then, the required packet information is sent to the parser registers, which extract the address and class-of-service fields and forward them to the lookup engine (Fig. 1, again). The lookup engine does a comparison to the internal, 32-kbyte, media-access-control (MAC) address table. That's loaded at power up and updated in real time as the chip processes data.
The address table and the virtual-LAN (VLAN) database determine the destination port for a given frame. Multicast MAC addresses, including IP multicasts, can also be stored and searched. The device supports both port-based and tagged (802.1q and 802.3ac) VLAN lookup. It also supports 4-kbyte VLAN addresses with the 802.1's multiple spanning-tree option as well as the 802.1v VLAN classification by protocol and port, with flexible and programmable ingress and egress checking rules for VLAN processing.
VLAN support is key because it allows groups of users to be programmed to exclude multicast traffic from other ports. This improves efficiency as the network doesn't have the overhead and security concerns of having to listen to the traffic intended for other users.
To perform address lookup in real time at these speeds, Allayer had to come up with extremely efficient hashing algorithms. According to David Wong, director of marketing at Allayer, "Everyone can put this kind of speed into a chip. The question is, 'How fast can you do the lookup?'" He goes on to explain that "the first generation of switches out there were all fast enough to switch. The problem was that they couldn't read the table fast enough."
Approaches to date have used an external content-addressable memory (CAM). Typically, that's fast because it avoids the processing that hashing algorithms require. Given a fast-enough algorithm, however, that reduces an address into a short-enough key (such as 16 bits versus 64 bits), and the tradeoff in processing time falls in favor of hashing. In addition, it's important to note that the algorithms are processed in hardware, using state machines to improve performance.