• Networking (Thursday 10:30AM-Noon)
    Room A110/112
    Chair: Steven Lumetta, University of Illinois

    • Title: EMP: Zero-copy OS-bypass NIC-driven Gigabit Ethernet Message Passing
    • Authors:
      Piyush Shivam (Ohio State University)
      Pete Wyckoff (Ohio Supercomputer Center)
      Dhabaleswar Panda (Ohio State University)
    • Abstract:
      Modern interconnects like Myrinet and Gigabit Ethernet offer Gb/s speeds which has put the onus of reducing the communication latency on messaging software. This has led to the development of OS bypass protocols which removed the kernel from the critical path and hence reduced the end-to-end latency. With the advent of programmable NICs, many aspects of protocol processing can be offloaded from user space to the NIC leaving the host processor to dedicate more cycles to the application. Many host-offload messaging systems exist for Myrinet; however, nothing similar exits for Gigabit Ethernet. In this paper we propose Ethernet Message Passing (EMP), a completely new zero-copy, OS-bypass messaging layer for Gigabit Ethernet on Alteon NICs where the entire protocol processing is done at the NIC. This messaging system delivers very good performance (latency of 23 us, and throughput of 880 Mb/s). To the best of our knowledge, this is the first NIC-level implementation of a zero-copy message passing layer for Gigabit Ethernet.

    • Title: Design and implementation of FMPL, a fast message-passing library for remote memory operations
    • Authors:
      Osamu Tatebe (National Institute of Advanced Industrial Science and Technology)
      Umpei Nagashima (National Institute of Advanced Industrial Science and Technology)
      Satoshi Sekiguchi (National Institute of Advanced Industrial Science and Technology)
      Hisayoshi Kitabayashi (Hitachi Business Solution)
      Yoshiyuki Hayashida (Hitachi, Ltd.)
    • Abstract:
      A fast message-passing library FMPL has been designed and developed to maximize communication performance by utilizing general architectural communication support such as remote memory operations, as well as to maximize total performance by eliminating dynamic communication overhead and overlapping communication and computation. FMPL provides a low-cost general-purpose point-to-point communication and collective communication such as broadcast, barrier synchronization and reduction. On a Hitachi SR8000, FMPL achieves an 8-byte latency of 12.8usec., while MPI achieves 20usec. FMPL is designed for building more highly functional message-passing libraries like BLACS as well as applications that need maximum performance.

    • Title: A New Routing Mechanism for Networks with Irregular Topology
    • Authors:
      V. Puente (University of Cantabria)
      J. A. Gregorio (University of Cantabria)
      R. Beivide (University of Cantabria)
      F. Vallejo (University of Cantabria)
      A. IbaŅez (University of Cantabria)
    • Abstract:
      Selecting a Pseudo-Hamiltonian cycle in any irregular network and applying a restricted packet injection mechanism to avoid the exhaustion of the storage resources, a new fully adaptive routing algorithm has been developed and tested. Our new routing mechanism outperforms the most relevant routing proposals for networks with irregular topology. In all the tested cases a significant improvement has been obtained. The most spectacular gains were obtained for big networks. For a 512-node network, uniform traffic, and virtual cut-through flow control, our mechanism can outperform, in some cases, the classic up*/down* algorithm by almost a factor of 2.