We modeled in [1] how end-to-end latency varies with data passing semantics and scales with CPU, network, and memory speeds. We validated our model on computers with various CPU and memory speeds connected by the CreditNet ATM network at 155 Mbps.
This work validates the scaling model of [1] with respect to network speeds. We report end-to-end latencies over the CreditNet ATM network at 513 Mbps and compare them with those predicted by the scaling model from the measurements at 155 Mbps [1].
Figure 1 : End-to-end latency for AAL5 packets at 513 Mbps raw transmission rate, with early demultiplexing.
Table 1 shows the least-squares linear fit of the curves of Figure 1, along with the latencies estimated from Table 6 of [1] using the scaling model. The throughput for single 60 KB AAL5 packets predicted by the scaling model is 132, 348, or 384 Mbps for copy, emulated copy, or emulated share semantics, respectively. The corresponding measured throughputs were 136, 341, and 380 Mbps (Note that the latter two are comparable to the main memory copy bandwidth of the computers used, 351 Mbps).
---------------+---+--------------------------
Semantics | | Early demultiplexing
---------------+---+--------------------------
Copy | E | 0.0581 B + 141
| A | 0.0561 B + 129
---------------+---+--------------------------
Emulated copy | E | 0.0205 B + 153
| A | 0.0212 B + 139
---------------+---+--------------------------
Emulated share | E | 0.0186 B + 137
| A | 0.0192 B + 119
---------------+---+--------------------------
Table 1: Estimated (E) and actual (A) end-to-end latencies, in usec.
B is the data length in bytes.
Also note that emulated copy is transparently compatible with the copy semantics of APIs such as those of Unix and Windows NT, for application buffers of arbitrary alignment, location, and length, whereas Fbufs implement entirely different semantics, and the Solaris scheme only works for application buffers that are page-aligned and of length multiple of the page size.
In [2], we discuss the hardware support required for emulated copy in multiple-packet communication.
[2] J. Brustoloni and P. Steenkiste. ``Copy Emulation in Checksummed, Multiple-Packet Communication'', in Proc. INFOCOM'97, IEEE, April 1997. Also available from http://www.cs.cmu.edu/~jcb [3] H. J. Chu. ``Zero-Copy TCP in Solaris'', in Proc. Winter Tech. Conf., USENIX, Jan. 1996.
[4] P. Druschel and L. Peterson. ``Fbufs: A High-Bandwidth Cross-Domain Transfer Facility", in Proc. 14th SOSP, ACM, Dec. 1993, pp. 189-202.