Error Control for Large-Scale Multicast Applications

Introduction

The recent popularity of the MBONE has led to the deployment of multicast applications such as video-conferencing, distributed interactive simulation, and news distribution. Many of these applications require reliable data delivery, which is not provided by IP Multicast, necessitating higher level protocols that work on top of the network multicast service. Large scale applications (e.g., DIS) complicate the problem because of the large numbers of participants distributed over a wide geographical area, and the highly dynamic nature of their topology and population. These applications pose challenges that traditional error control schemes such as TCP cannot handle efficiently.

The challenges currently faced by reliable multicast within the context of IP Multicast include the following (depicted with the aid of Fig. 1):

Request implosion: this problem occurs when the loss of a packet triggers simultaneous requests from a large number of receivers, leading wasted bandwidth and multiple redundant requests arriving at all other members in the group. In Fig. 1, the loss of a packet triggers requests from receivers E4 through E7, which propagate to all memebers of the multicast group. This can become a serious problem as the group gets large. The challenge lies in limiting these requests to one.
Reply implosion: this problem occurs when in response to a request, many endpoints multicast the same reply. For example, in Fig. 1, following the requests from endpoints E4 through E7 the remaining receivers each multicast a reply. The consequences can be as bad as before, leading to reply implosion. The challenge lies in generating only one reply.
Exposure: In Fig. 1, loss affected only a subset of the receivers, namely E4 through E7. However, requests and replies were multicast to the entire group, exposing all members to a local fault. This is obviously undesirable in large groups dispersed over a wide geographical area. The challenge lies in allowing only affected endpoints to be exposed to recovery messages.

Currently, the most popular solutions to these problems are based on one of two approaches: (a) schemes employing randomized back-off timers, and (b) schemes employing a tree hierarchy. In the former, the general mechanism is as follows: when a loss is detected, each member starts a random timer before sending a request. The member which selected the shortest timer sends a request, which cancels all other requests. The reception of the request starts back-off timers among all the members which have the reply. A reply will be sent by the member with the shortest timer, which will cancel all other replies.

Randomized back-off schemes offer a tradeoff between implosion and latency: the longer the timeout interval the lower the implosion, but at the expense of higher latency. With large multicast groups, the latency penalty can be quite high. In addition, these schemes have no inherent ability to perform local recovery (to reduce exposure). Instead, they rely on limiting the scope of recovery messages using the TTL value in the IP header. However, this is a crude method of limiting exposure, as it limits messages to a radius, not a subtree. On the plus side, these schemes are very robust: a loss will be recovered even if only one member of the group has the requested data (but at the expense of exposure).

Schemes employing a tree hierarchy arrange group members into a tree. Requests are restricted to travel from children to their parents, and replies from parents to children. Thus requests and replies cannot propagate to the whole group, restricting implosion and eliminating exposure. However, for best results the tree hierarchy must follow the topology of the multicast tree created by the routing protocol. This is hard to do without access to routing information, thus some employ a static tree structure. In addition, whenever the group structure changes, as for example when members join or leave, the tree needs to be restructured, which is an expensive operation.

Thus, currently proposed solutions offer different tradeoffs: randomized back-off schemes offer good robustness, but trade latency for reducing implosion and rely on crude methods to limit exposure. Tree-based schemes offer good implosion control, low latency and low exposure; but tree formation is a problem, and do not adapt well to dynamic membership changes because they either use a static tree structure, or use complex operations to form a dynamic tree.

This leads to two additional challenges faced by reliable multicast:

Minimize recovery latency: this is the latency experienced by a member from the instant a loss is detected until a reply is received. The challenge lies in keeping this lower than the unicast delay to the source, since recovery can be done localy in most cases.
Adaptability to dynamic membership changes: this is a measure of how efficiency (in terms of loss of service, duplicate messages and added latency) of error recovery is affected by changes in the group topology and membership. The challenge lies in making recovery insensitive to such changes.

In this project, we address all the above 5 problems by following a different approach: we modify the IP Multicast service model to allow routers to offer a small set of new forwarding services to the members of a multicast group. Applications develop error control schemes that can leverage off these new services to provide reliability. The services do not impose a specific error control scheme. The implementation of these services at the routers eliminates the need for endpoints to learn about group topology. Additionally, it allows easy access to topology information, which leads to an efficient implementation. These forwarding services are conceptually simple, and while they do require some changes at the routers, they do not violate the end-to-end argument, and merge well with current IP routing and group management protocols.

Research Plan Solution Overview Infocom Paper Presentation People ARL