draft-irtf-iccrg-welzl-congestion-control-open-research-03.txt | draft-irtf-iccrg-welzl-congestion-control-open-research-04.txt | |||
---|---|---|---|---|
Network Working Group Michael Welzl | Network Working Group Michael Welzl | |||
Internet Draft Dimitri Papadimitriou | Internet Draft Dimitri Papadimitriou | |||
Document: draft-irtf-iccrg-welzl- Editors | Expires: November 16, 2009 Editors | |||
congestion-control-open-research-03.txt | ||||
Expires: October 16, 2009 Michael Scharf | ||||
Bob Briscoe | ||||
April 17, 2009 | ||||
Michael Scharf | ||||
Bob Briscoe | ||||
Open Research Issues in Internet Congestion Control | Open Research Issues in Internet Congestion Control | |||
draft-irtf-iccrg-welzl-congestion-control-open-research-03.txt | draft-irtf-iccrg-welzl-congestion-control-open-research-04.txt | |||
Status of this Memo | Status of this Memo | |||
This Internet-Draft is submitted to IETF in full conformance with the | This Internet-Draft is submitted to IETF in full conformance with the | |||
provisions of BCP 78 and BCP 79. | provisions of BCP 78 and BCP 79. | |||
Internet-Drafts are working documents of the Internet Engineering | Internet-Drafts are working documents of the Internet Engineering | |||
Task Force (IETF), its areas, and its working groups. Note that | Task Force (IETF), its areas, and its working groups. Note that | |||
other groups may also distribute working documents as Internet- | other groups may also distribute working documents as Internet- | |||
Drafts. | Drafts. | |||
skipping to change at page 2, line 13 | skipping to change at page 2, line 13 | |||
scale solutions can be confidently engineered and deployed. | scale solutions can be confidently engineered and deployed. | |||
Conventions used in this document | Conventions used in this document | |||
The key words "MUST", "MUST NOT", "REQUIRED", "SHALL", "SHALL NOT", | The key words "MUST", "MUST NOT", "REQUIRED", "SHALL", "SHALL NOT", | |||
"SHOULD", "SHOULD NOT", "RECOMMENDED", "MAY", and "OPTIONAL" in this | "SHOULD", "SHOULD NOT", "RECOMMENDED", "MAY", and "OPTIONAL" in this | |||
document are to be interpreted as described in RFC-2119 [i]. | document are to be interpreted as described in RFC-2119 [i]. | |||
Table of Contents | Table of Contents | |||
1. Introduction...................................................3 | 1. Introduction..................................................3 | |||
2. Global Challenges..............................................4 | 2. Global Challenges.............................................4 | |||
2.1 Heterogeneity..............................................4 | 2.1 Heterogeneity.............................................4 | |||
2.2 Stability..................................................6 | 2.2 Stability.................................................6 | |||
2.3 Fairness...................................................7 | 2.3 Fairness..................................................7 | |||
3. Detailed Challenges............................................9 | 3. Detailed Challenges...........................................9 | |||
3.1 Challenge 1: Network Support...............................9 | 3.1 Challenge 1: Network Support..............................9 | |||
3.2 Challenge 2: Corruption Loss..............................14 | 3.1.1 Performance and Robustness.........................12 | |||
3.3 Challenge 3: Packets Sizes................................16 | 3.1.2 Granularity of network component functions.........12 | |||
3.4 Challenge 4: Flow Startup.................................20 | 3.1.3 Information Acquisition............................13 | |||
3.5 Challenge 5: Multi-domain Congestion Control..............22 | 3.1.4 Feedback signaling.................................14 | |||
3.6 Challenge 6: Precedence for Elastic Traffic...............25 | 3.2 Challenge 2: Corruption Loss.............................14 | |||
3.7 Challenge 7: Misbehaving Senders and Receivers............26 | 3.3 Challenge 3: Packet Size.................................16 | |||
3.8 Other challenges..........................................27 | 3.4 Challenge 4: Flow Startup................................20 | |||
4. Security Considerations.......................................32 | 3.5 Challenge 5: Multi-domain Congestion Control.............22 | |||
5. Contributors..................................................32 | 3.5.1 Multi-domain Transport of Congestion Signals.......22 | |||
6. References....................................................32 | 3.5.2 Multi-domain Information Exchange..................23 | |||
6.1 Normative References.........................................32 | 3.5.3 Multi-domain Pseudowires...........................24 | |||
Acknowledgments...............................................40 | 3.6 Challenge 6: Precedence for Elastic Traffic..............25 | |||
3.7 Challenge 7: Misbehaving Senders and Receivers...........26 | ||||
3.8 Other Challenges.........................................27 | ||||
3.8.1 RTT Estimation.....................................27 | ||||
3.8.2 Malfunctioning Devices.............................29 | ||||
3.8.3 Dependence on RTT..................................30 | ||||
3.8.4 Congestion Control in Multi-layered Networks.......30 | ||||
3.8.5 Multipath End-to-end Congestion Control and Traffic | ||||
Engineering........................................31 | ||||
3.8.6 ALGs and Middleboxes...............................31 | ||||
4. Security Considerations......................................32 | ||||
5. References...................................................33 | ||||
5.1 Normative References.....................................33 | ||||
5.2 Informative References...................................35 | ||||
6. Acknowledgments..............................................40 | ||||
7. Author's Addresses...........................................41 | ||||
8. Contributors.................................................41 | ||||
Acknowledgments.................................................42 | ||||
1. Introduction | 1. Introduction | |||
This document describes some of the open research topics in the | This document describes some of the open research topics in the | |||
domain of Internet congestion control that are known today. We begin | domain of Internet congestion control that are known today. We begin | |||
by reviewing some proposed definitions of congestion and congestion | by reviewing some proposed definitions of congestion and congestion | |||
control based on current understandings. | control based on current understandings. | |||
Congestion can be defined as a state or condition that occurs when | Congestion can be defined as a state or condition that occurs when | |||
the network resources are overloaded resulting into impairments for | the network resources are overloaded resulting into impairments for | |||
network users as objectively measured by the probability of loss | network users as objectively measured by the probability of loss | |||
and/or of delay). The overload results in the reduction of utility in | and/or of delay. The overload results in the reduction of utility in | |||
networks that support both spatial and temporal multiplexing, but no | networks that support both spatial and temporal multiplexing, but no | |||
reservation [Keshav]. Congestion control is a (typically distributed) | reservation [Keshav07]. Congestion control is a (typically | |||
algorithm to share network resources among competing traffic sources. | distributed) algorithm to share network resources among competing | |||
traffic sources. | ||||
Two components of distributed congestion control have been defined in | Two components of distributed congestion control have been defined in | |||
the context of primal-dual modeling [Kelly98]. Primal congestion | the context of primal-dual modeling [Kelly98]. Primal congestion | |||
control refers to the algorithm executed by the traffic sources | control refers to the algorithm executed by the traffic sources | |||
algorithm for controlling their sending rates or window sizes. This | algorithm for controlling their sending rates or window sizes. This | |||
is normally a closed-loop control, where this operation depends on | is normally a closed-loop control, where this operation depends on | |||
feedback. TCP algorithms fall in this category. Dual congestion | feedback. TCP algorithms fall in this category. Dual congestion | |||
control is implemented by the routers through gathering information | control is implemented by the routers through gathering information | |||
about the traffic traversing them. A dual congestion control | about the traffic traversing them. A dual congestion control | |||
algorithm updates, implicitly or explicitly, a congestion measure or | algorithm updates, implicitly or explicitly, a congestion measure or | |||
congestion rate and sends it back, implicitly or explicitly, to the | congestion rate and sends it back, implicitly or explicitly, to the | |||
skipping to change at page 3, line 50 | skipping to change at page 4, line 5 | |||
successful over many years but have begun to reach their limits, as | successful over many years but have begun to reach their limits, as | |||
the heterogeneity of both the data link and physical layer and | the heterogeneity of both the data link and physical layer and | |||
applications are pulling TCP congestion control (which performs | applications are pulling TCP congestion control (which performs | |||
poorly as the bandwidth or delay increases) beyond its natural | poorly as the bandwidth or delay increases) beyond its natural | |||
operating regime. A side effect of these deficits is that there is an | operating regime. A side effect of these deficits is that there is an | |||
increasing share of hosts that use non-standardized congestion | increasing share of hosts that use non-standardized congestion | |||
control enhancements (for instance, many Linux distributions have | control enhancements (for instance, many Linux distributions have | |||
been shipped with "CUBIC" as default TCP congestion control | been shipped with "CUBIC" as default TCP congestion control | |||
mechanism). | mechanism). | |||
While the original Jacobson algorithm requires no congestion-related | While the original Van Jacobson algorithm requires no congestion- | |||
state in routers, more recent modifications have departed from the | related state in routers, more recent modifications have departed | |||
strict application of the end-to-end principle [Saltzer84] in order | from the strict application of the end-to-end principle [Saltzer84] | |||
to avoid congestion collapse. Active Queue Management (AQM) in | in order to avoid congestion collapse. Active Queue Management (AQM) | |||
routers, e.g., RED and its variants such as Weighted RED (WRED), | in routers, e.g., RED and its variants such as Weighted RED (WRED), | |||
Stabilized RED (SRED), Adaptive RED (ARED), xCHOKE [Pan00], RED with | Stabilized RED (SRED), Adaptive RED (ARED), xCHOKE [Pan00], RED with | |||
In/Out (RIO) [Clark98], improves performance by keeping queues small | In/Out (RIO) [Clark98], improves performance by keeping queues small | |||
(implicit feedback via dropped packets), while Explicit Congestion | (implicit feedback via dropped packets), while Explicit Congestion | |||
Notification (ECN) [Floyd94] [RFC3168] passes one bit of congestion | Notification (ECN) [Floyd94] [RFC3168] passes one bit of congestion | |||
information back to senders when an AQM would normally drop a packet. | information back to senders when an AQM would normally drop a packet. | |||
These measures do improve performance, but there is a limit to how | These measures do improve performance, but there is a limit to how | |||
much can be accomplished without more information from routers. The | much can be accomplished without more information from routers. The | |||
requirement of extreme scalability together with robustness has been | requirement of extreme scalability together with robustness has been | |||
a difficult hurdle to accelerating information flow. Primal-Dual | a difficult hurdle to accelerating information flow. Primal-Dual | |||
TCP/AQM distributed algorithm stability and equilibrium properties | TCP/AQM distributed algorithm stability and equilibrium properties | |||
skipping to change at page 5, line 42 | skipping to change at page 5, line 45 | |||
It is always possible to tune congestion control parameters based on | It is always possible to tune congestion control parameters based on | |||
some knowledge of the environment and the application scenario. | some knowledge of the environment and the application scenario. | |||
However, the interaction of multiple congestion control techniques | However, the interaction of multiple congestion control techniques | |||
interacting with each other is not yet well understood. The | interacting with each other is not yet well understood. The | |||
fundamental question is whether it is possible to define one | fundamental question is whether it is possible to define one | |||
congestion control mechanism that operates reasonably well in the | congestion control mechanism that operates reasonably well in the | |||
whole range of scenarios that exist in the Internet. Hence, it is an | whole range of scenarios that exist in the Internet. Hence, it is an | |||
important research question how new Internet congestion control | important research question how new Internet congestion control | |||
mechanisms would have to be designed, which maximum degree of | mechanisms would have to be designed, which maximum degree of | |||
dynamics they can efficiently handle, and whether they can keep the | dynamics they can efficiently handle, and whether they can keep the | |||
genererality of the existing end-to-end solutions. | generality of the existing end-to-end solutions. | |||
Some improvements to congestion control could be realized by simple | Some improvements to congestion control could be realized by simple | |||
changes of single functions in end-system or network components. | changes of single functions in end-system or network components. | |||
However, new mechanism(s) might also require a fundamental redesign | However, new mechanism(s) might also require a fundamental redesign | |||
of the overall network architecture, and they may even affect the | of the overall network architecture, and they may even affect the | |||
design of Internet applications. This can imply significant | design of Internet applications. This can imply significant | |||
interoperability and backward compatibility challenges and/or create | interoperability and backward compatibility challenges and/or create | |||
network accessibility obstacles. In particular, networks and/or | network accessibility obstacles. In particular, networks and/or | |||
applications that do not use or support a new congestion control | applications that do not use or support a new congestion control | |||
mechanism could be penalized by a significantly worse performance | mechanism could be penalized by a significantly worse performance | |||
skipping to change at page 6, line 37 | skipping to change at page 6, line 40 | |||
Control theoretic modeling of a realistic network can be quite | Control theoretic modeling of a realistic network can be quite | |||
difficult, especially when taking distinct packet sizes and | difficult, especially when taking distinct packet sizes and | |||
heterogeneous RTTs into account. It has therefore become common | heterogeneous RTTs into account. It has therefore become common | |||
practice to model simpler cases and to leave the more complicated | practice to model simpler cases and to leave the more complicated | |||
(realistic) situations for simulations. Clearly, if a mechanism is | (realistic) situations for simulations. Clearly, if a mechanism is | |||
not stable in a simple scenario, it is generally useless; this method | not stable in a simple scenario, it is generally useless; this method | |||
therefore helps to eliminate faulty congestion control candidates at | therefore helps to eliminate faulty congestion control candidates at | |||
an early stage. | an early stage. | |||
Some fundamental facts, which are known from control theory are | Some fundamental facts known from control theory are useful as | |||
useful as guidelines when designing a congestion control mechanism. | guidelines when designing a congestion control mechanism. For | |||
For instance, a controller should only be fed a system state that | instance, a controller should only be fed a system state that | |||
reflects its output. A (low-pass) filter function should be used in | reflects its output. A (low-pass) filter function should be used in | |||
order to pass only states to the controller that are expected to last | order to pass only states to the controller that are expected to last | |||
long enough for its action to be meaningful [Jain88]. Action should | long enough for its action to be meaningful [Jain88]. Action should | |||
be carried out whenever such feedback arrives, as it is a fundamental | be carried out whenever such feedback arrives, as it is a fundamental | |||
principle of control that the control frequency should be equal to | principle of control that the control frequency should be equal to | |||
the feedback frequency. Reacting faster leads to oscillations and | the feedback frequency. Reacting faster leads to oscillations and | |||
instability while reacting slower makes the system tardy [Jain90]. | instability while reacting slower makes the system tardy [Jain90]. | |||
TCP stability can be attributed to two key aspects which were | TCP stability can be attributed to two key aspects which were | |||
introduced in [Jacobson88]: the AIMD control law during congestion | introduced in [Jacobson88]: the AIMD control law during congestion | |||
skipping to change at page 7, line 20 | skipping to change at page 7, line 23 | |||
The reasoning in [Jacobson88] assumes all senders to be acting at the | The reasoning in [Jacobson88] assumes all senders to be acting at the | |||
same time. The stability of TCP under more realistic network | same time. The stability of TCP under more realistic network | |||
conditions has been investigated in a large number of ensuing works, | conditions has been investigated in a large number of ensuing works, | |||
leading to no clear conclusion that TCP would also be asymptotically | leading to no clear conclusion that TCP would also be asymptotically | |||
stable under arbitrary network conditions. On the other hand, | stable under arbitrary network conditions. On the other hand, | |||
research has concluded that stability can be assured with constraints | research has concluded that stability can be assured with constraints | |||
on dynamics that are less stringent than the "conservation of packets | on dynamics that are less stringent than the "conservation of packets | |||
principle". From control theory, only rate increase (not the target | principle". From control theory, only rate increase (not the target | |||
rate) needs to be inversely proportional to RTT (whereas window-based | rate) needs to be inversely proportional to RTT (whereas window-based | |||
control converges on a target rate inversely proportional to RTT). | control converges on a target rate inversely proportional to RTT). | |||
With rate-based control, high-speed congestion control converges on a | A congestion control mechanism can therefore converge on a rate that | |||
rate that is independent of RTT as long as its dynamics depends on | is independent of RTT as long as its dynamics depend on RTT (e.g. | |||
RTT (e.g. FAST TCP [Jin04]). | FAST TCP [Jin04]). | |||
However in the stability analysis of TCP and of these more modern | In the stability analysis of TCP and of these more modern controls, | |||
controls the stability impact of Slow Start (which can be significant | the impact of Slow Start on stability (which can be significant as | |||
as short-lived HTTP flows often never leave this phase) is not | short-lived HTTP flows often never leave this phase) is not entirely | |||
entirely clear. | clear. | |||
2.3 Fairness | 2.3 Fairness | |||
Recently, the way the Internet community reasons about fairness has | Recently, the way the Internet community reasons about fairness has | |||
been called into deep questioning [Bri07]. Much of the community has | been called into deep questioning [Bri07]. Much of the community has | |||
taken fairness to mean approximate equality between the rates of | taken fairness to mean approximate equality between the rates of | |||
flows (flow rate fairness) that experience equivalent path congestion | flows (flow rate fairness) that experience equivalent path congestion | |||
as with TCP [RFC2581] and TFRC [RFC3448]. [RFC3714] depicts the | as with TCP [RFC2581] and TFRC [RFC3448]. [RFC3714] depicts the | |||
resulting situation as "The Amorphous Problem of Fairness". | resulting situation as "The Amorphous Problem of Fairness". | |||
skipping to change at page 8, line 6 | skipping to change at page 8, line 7 | |||
In comparison, the debate between max-min, proportional and TCP | In comparison, the debate between max-min, proportional and TCP | |||
fairness is about mere details. These three all share the assumption | fairness is about mere details. These three all share the assumption | |||
that equal flow rates are desirable; they merely differ in the second | that equal flow rates are desirable; they merely differ in the second | |||
order issue of how to share out excess capacity in a network of many | order issue of how to share out excess capacity in a network of many | |||
bottlenecks. In contrast, cost fairness should lead to extremely | bottlenecks. In contrast, cost fairness should lead to extremely | |||
unequal flow rates by design. Equivalently, equal flow rates would | unequal flow rates by design. Equivalently, equal flow rates would | |||
typically be considered extremely unfair. | typically be considered extremely unfair. | |||
The two traditional approaches are not protocol options that can each | The two traditional approaches are not protocol options that can each | |||
be followed in different parts of an inter-network. They result in | be followed in different parts of an inter-network. They lead to | |||
research agendas and issues that are different in their respective | research agendas that are different in their respective objectives, | |||
objectives resulting in different set of open issues. | resulting in a different set of open issues. | |||
If we assume TCP-friendliness as a goal with flow rate as the metric, | If we assume TCP-friendliness as a goal with flow rate as the metric, | |||
open issues would be: | open issues would be: | |||
- Should rate fairness depend on the packet rate or the bit rate? | - Should rate fairness depend on the packet rate or the bit rate? | |||
- Should the flow rate depend on RTT (as in TCP) or should only flow | - Should the flow rate depend on RTT (as in TCP) or should only flow | |||
dynamics depend on RTT (e.g. as in Fast TCP [Jin04])? | dynamics depend on RTT (e.g. as in Fast TCP [Jin04])? | |||
- How to estimate whether a particular flow start strategy is fair, | - How to estimate whether a particular flow start strategy is fair, | |||
or whether a particular fast recovery strategy after a reduction in | or whether a particular fast recovery strategy after a reduction in | |||
rate due to congestion is fair? | rate due to congestion is fair? | |||
skipping to change at page 8, line 52 | skipping to change at page 9, line 4 | |||
- Which policy enforcement should be used by networks and what are | - Which policy enforcement should be used by networks and what are | |||
the interactions between application policy and network policy | the interactions between application policy and network policy | |||
enforcement? | enforcement? | |||
- How to design a new policy enforcement framework that will | - How to design a new policy enforcement framework that will | |||
appropriately compete with existing flows aiming for rate equality | appropriately compete with existing flows aiming for rate equality | |||
(e.g. TCP)? | (e.g. TCP)? | |||
The question of how to reason about fairness is a pre-requisite to | The question of how to reason about fairness is a pre-requisite to | |||
agreeing on the research agenda. If the relevant metric is flow-rate | agreeing on the research agenda. If the relevant metric is flow-rate | |||
it places constraints at protocol design-time, whereas if the metric | it places constraints at protocol design-time, whereas if the metric | |||
is congestion volume the constraints move to run-time, while design- | is the congestion volume the constraints move to run-time, while | |||
time constraints can be relaxed [Bri08]. However, that question does | design-time constraints can be relaxed [Bri08]. However, that | |||
not require more research in itself, it is merely a debate that needs | question does not require more research in itself, it is merely a | |||
to be resolved by studying existing research and by assessing how bad | debate that needs to be resolved by studying existing research and by | |||
fairness problems could become if they are not addressed rigorously, | assessing how bad fairness problems could become if they are not | |||
and whether we can rely on trust to maintain approximate fairness | addressed rigorously, and whether we can rely on trust to maintain | |||
without requiring policing complexity [Floyd08]. The latter points | approximate fairness without requiring policing complexity [Floyd08]. | |||
may themselves lead to additional research. However, it is also | The latter points may themselves lead to additional research. | |||
accepted that more research will not necessarily lead to convince | However, it is also accepted that more research will not necessarily | |||
either side to change their opinions. More debate would be needed. It | lead to convince either side to change their opinions. More debate | |||
seems also that if an architecture is built to support cost-fairness | would be needed. It seems also that if the architecture is built to | |||
then equal-costs result in flow-rate fairness as a degenerate case; | support cost-fairness then equal-costs result in flow-rate fairness | |||
that is, flow-rate fairness can be seen as a special case of cost- | as a degenerate case; that is, flow-rate fairness can be seen as a | |||
fairness. One can be used to build the other, but not vice-versa. | special case of cost-fairness. One can be used to build the other, | |||
but not vice-versa. | ||||
In the rest of this document, "fairness" means flow rate fairness. | ||||
3. Detailed Challenges | 3. Detailed Challenges | |||
3.1 Challenge 1: Network Support | 3.1 Challenge 1: Network Support | |||
This challenge is the most critical to get right. Changes to the | This challenge is perhaps the most critical to get right. Changes to | |||
balance of functions between the endpoints and network equipment | the balance of functions between the endpoints and network equipment | |||
could require a change to the per-datagram data plane interface | could require a change to the per-datagram data plane interface | |||
between the transport and network layers. Network equipment vendors | between the transport and network layers. Network equipment vendors | |||
need to be assured that any new interface is stable enough (on decade | need to be assured that any new interface is stable enough (on decade | |||
timescales) to build into firmware and hardware, and OS vendors will | timescales) to build into firmware and hardware, and OS vendors will | |||
not use a new interface unless it is likely to be widely deployed. | not use a new interface unless it is likely to be widely deployed. | |||
Network components can be involved in congestion control in two ways: | Network components can be involved in congestion control in two ways: | |||
first, they can implicitly optimize their functions, such as queue | first, they can implicitly optimize their functions, such as queue | |||
management and scheduling strategies, in order to support the | management and scheduling strategies, in order to support the | |||
operation of an end-to-end congestion control. Second, network | operation of an end-to-end congestion control. Second, network | |||
skipping to change at page 10, line 10 | skipping to change at page 10, line 16 | |||
network component that processes and stores packets. Various | network component that processes and stores packets. Various | |||
approaches have been proposed and also deployed, such as different | approaches have been proposed and also deployed, such as different | |||
AQM techniques. Even though these implicit techniques are known to | AQM techniques. Even though these implicit techniques are known to | |||
improve network performance during congestion phases, they are still | improve network performance during congestion phases, they are still | |||
only partly deployed in the Internet. This may be due to the fact | only partly deployed in the Internet. This may be due to the fact | |||
that finding optimal and robust parameterizations for these | that finding optimal and robust parameterizations for these | |||
mechanisms is a non-trivial problem. Indeed, the problem with various | mechanisms is a non-trivial problem. Indeed, the problem with various | |||
AQM schemes is the difficulty to identify correct values of the | AQM schemes is the difficulty to identify correct values of the | |||
parameter set that affects the performance of the queuing scheme (due | parameter set that affects the performance of the queuing scheme (due | |||
to variation in the number of sources, the capacity and the feedback | to variation in the number of sources, the capacity and the feedback | |||
delay) [Fioriu00] [Hollot01] [Zhang03]. Many AQM schemes (RED, REM, | delay) [Firoiu00] [Hollot01] [Zhang03]. Many AQM schemes (RED, REM, | |||
BLUE, PI-Controller but also Adaptive Virtual Queue (AVQ)) do not | BLUE, PI-Controller but also Adaptive Virtual Queue (AVQ)) do not | |||
define a systematic rule for setting their parameters. | define a systematic rule for setting their parameters. | |||
The second class of approaches uses explicit notification. By using | The second class of approaches uses explicit notification. By using | |||
explicit feedback from the network, connection endpoints can obtain | explicit feedback from the network, connection endpoints can obtain | |||
more accurate information about the current network characteristics | more accurate information about the current network characteristics | |||
on the path. This allows endpoints to make more precise decisions | on the path. This allows endpoints to make more precise decisions | |||
that can better prevent packet loss and that can also improve rate | that can better prevent packet loss and that can also improve rate | |||
equality among different flows. | equality among different flows. | |||
Explicit feedback techniques fall into three broad categories: | Explicit feedback techniques fall into three broad categories: | |||
o Explicit congestion feedback: whether one bit Explicit Congestion | - Explicit congestion feedback: one bit Explicit Congestion | |||
Notification (ECN) [RFC3168] or proposals for more than one bit | Notification (ECN) [RFC3168] or proposals for more than one bit | |||
[Xia05]; | [Xia05]; | |||
o Explicit per-datagram rate feedback: the eXplicit Control Protocol | - Explicit per-datagram rate feedback: the eXplicit Control Protocol | |||
(XCP) [Katabi02] [Falk07], the Rate Control Protocol (RCP) | (XCP) [Katabi02] [Falk07], the Rate Control Protocol (RCP) | |||
[Dukki05]; | [Dukki05]; | |||
o Explicit rate feedback: by in-band signaling, such as by Quick- | - Explicit rate feedback: by in-band signaling, such as by Quick- | |||
-Start [RFC4782], or by means of out-of-band signaling, e.g. | Start [RFC4782], or by means of out-of-band signaling, e.g. | |||
CADPC/PTP [Welzl03]. | CADPC/PTP [Welzl03]. | |||
Explicit router feedback can address some of the inherent | Explicit router feedback can address some of the inherent | |||
shortcomings of TCP. For instance, XCP was developed to overcome the | shortcomings of TCP. For instance, XCP was developed to overcome the | |||
inefficiency, unfairness and instability that TCP suffers from when | inefficiency, unfairness and instability that TCP suffers from when | |||
the per-flow bandwidth-delay product increases. By decoupling | the per-flow bandwidth-delay product increases. By decoupling | |||
resource utilization/congestion control from fairness control, XCP | resource utilization/congestion control from fairness control, XCP | |||
achieves fair bandwidth allocation, high utilization, a small | achieves fair bandwidth allocation, high utilization, a small | |||
standing queue size, and near-zero packet drops, with both steady and | standing queue size, and near-zero packet drops, with both steady and | |||
highly varying traffic. Importantly, XCP does not maintain any per- | highly varying traffic. Importantly, XCP does not maintain any per- | |||
skipping to change at page 11, line 18 | skipping to change at page 11, line 22 | |||
of flow completion times [Dukki06], taking an optimistic approach to | of flow completion times [Dukki06], taking an optimistic approach to | |||
flows likely to arrive in the next RTT and tolerating larger | flows likely to arrive in the next RTT and tolerating larger | |||
instantaneous queue sizes [Dukki05]. XCP on the other hand gives very | instantaneous queue sizes [Dukki05]. XCP on the other hand gives very | |||
poor flow completion times for short flows. | poor flow completion times for short flows. | |||
Both implicit and explicit router support should be considered in the | Both implicit and explicit router support should be considered in the | |||
context of the end-to-end argument [Saltzer84], which is one of the | context of the end-to-end argument [Saltzer84], which is one of the | |||
key design principles of the Internet. It suggests that functions | key design principles of the Internet. It suggests that functions | |||
that can be realized both in the end-systems and in the network | that can be realized both in the end-systems and in the network | |||
should be implemented in the end-systems. This principle ensures that | should be implemented in the end-systems. This principle ensures that | |||
the network provides a general service and that remains as simple as | the network provides a general service and that it remains as simple | |||
possible (any additional complexity is placed above the IP layer, | as possible (any additional complexity is placed above the IP layer, | |||
i.e., at the edges) so as to ensure evolvability, reliability and | i.e., at the edges) so as to ensure evolvability, reliability and | |||
robustness. Furthermore, the fate-sharing principle, enunciated by | robustness. Furthermore, the fate-sharing principle ([Clark88] | |||
Dave Clark in "Design Philosophy of the DARPA Internet Protocols", | "Design Philosophy of the DARPA Internet Protocols") mandates that an | |||
mandates that an end-to-end Internet protocol design should not rely | end-to-end Internet protocol design should not rely on the | |||
on the maintenance of any per-flow state (i.e., information about the | maintenance of any per-flow state (i.e., information about the state | |||
state of the end-to-end communication) inside the network [RFC1958] | of the end-to-end communication) inside the network and that the | |||
and that the network state (e.g. routing state) maintained by the | network state (e.g. routing state) maintained by the Internet shall | |||
Internet shall minimize its interaction with the states maintained at | minimize its interaction with the states maintained at the end- | |||
the end-points/hosts. | points/hosts [RFC1958]. | |||
However, as discussed for instance in [Moors02], congestion control | However, as discussed for instance in [Moors02], congestion control | |||
cannot be realized as a pure end-to-end function only. Congestion is | cannot be realized as a pure end-to-end function only. Congestion is | |||
an inherent network phenomenon and can only be resolved efficiently | an inherent network phenomenon and can only be resolved efficiently | |||
by some cooperation of end-systems and the network. Congestion | by some cooperation of end-systems and the network. Congestion | |||
control in today's Internet protocols follows the end-to-end design | control in today's Internet protocols follows the end-to-end design | |||
principle insofar as only minimal feedback from the network is used | principle insofar as only minimal feedback from the network is used, | |||
(e. g., packet loss and delay). The end-systems only decide how to | e.g., packet loss and delay. The end-systems only decide how to | |||
react and how to avoid congestion. The crux is that, on the one hand, | react and how to avoid congestion. The crux is that, on the one hand, | |||
there would be substantial benefit by further assistance from the | there would be substantial benefit by further assistance from the | |||
network, but, on the other hand, such router support could lead to | network, but, on the other hand, such network support could lead to | |||
duplication of functions, which might even harmfully interact with | duplication of functions, which might even harmfully interact with | |||
end-to-end protocol mechanisms. The different requirements of | end-to-end protocol mechanisms. The different requirements of | |||
applications (cf. the fairness discussion in Section 2.3) call for a | applications (cf. the fairness discussion in Section 2.3) call for a | |||
variety of different congestion control approaches, but putting such | variety of different congestion control approaches, but putting such | |||
per-flow behavior inside the network should be avoided, as such | per-flow behavior inside the network should be avoided, as such | |||
design would clearly be at odds with the end-to-end and fate sharing | design would clearly be at odds with the end-to-end and fate sharing | |||
design principles. | design principles. | |||
The end-to-end and fate sharing principles are generally regarded as | The end-to-end and fate sharing principles are generally regarded as | |||
the key ingredients for ensuring a scalable and survivable network | the key ingredients for ensuring a scalable and survivable network | |||
design. In order to ensure that new congestion control mechanisms are | design. In order to ensure that new congestion control mechanisms are | |||
scalable, violating these principles must therefore be avoided. | scalable, violating these principles must therefore be avoided. | |||
In general, network support of congestion control raises many issues | In general, network support of congestion control raises many issues | |||
that have not been completely solved yet. | that have not been completely solved yet. | |||
3.1.1 Performance and robustness | 3.1.1 Performance and Robustness | |||
Congestion control is subject to some tradeoffs: on one hand, it must | Congestion control is subject to some tradeoffs: on one hand, it must | |||
allow high link utilizations and fair resource sharing but on the | allow high link utilizations and fair resource sharing but on the | |||
other hand, the algorithms must also be robust in particular during | other hand, the algorithms must also be robust in particular during | |||
congestion phases. | congestion phases. | |||
Router support can help to improve performance but it can also result | Router support can help to improve performance but it can also result | |||
in additional complexity and more control loops. This requires a | in additional complexity and more control loops. This requires a | |||
careful design of the algorithms in order to ensure stability and | careful design of the algorithms in order to ensure stability and | |||
avoid e.g. oscillations. A further challenge is the fact that | avoid e.g. oscillations. A further challenge is the fact that | |||
skipping to change at page 12, line 45 | skipping to change at page 12, line 48 | |||
- What is the minimum support that is needed from the network in | - What is the minimum support that is needed from the network in | |||
order to achieve significantly better performance than with | order to achieve significantly better performance than with | |||
end-to-end mechanisms and the current IP header limitations that | end-to-end mechanisms and the current IP header limitations that | |||
provide at most unary ECN signals? | provide at most unary ECN signals? | |||
3.1.2 Granularity of network component functions | 3.1.2 Granularity of network component functions | |||
There are several degrees of freedom concerning the involvement of | There are several degrees of freedom concerning the involvement of | |||
network entities, ranging from some few additional functions in | network entities, ranging from some few additional functions in | |||
network management procedures on the one end, and additional per | network management procedures on the one end to additional per | |||
packet processing on the other end of the solution space. | packet processing on the other end of the solution space. | |||
Furthermore, different amounts of state can be kept in routers (no | Furthermore, different amounts of state can be kept in routers (no | |||
per-flow state, partial per-flow state, soft state, hard state). The | per-flow state, partial per-flow state, soft state, hard state). The | |||
additional router processing is a challenge for Internet scalability | additional router processing is a challenge for Internet scalability | |||
and could also increase end-to-end latencies. | and could also increase end-to-end latencies. | |||
There are many solutions that do not require per-flow state and thus | There are many solutions that do not require per-flow state and thus | |||
do not cause a large processing overhead. However, scalability issues | do not cause a large processing overhead. However, scalability issues | |||
could also be caused, for instance, by synchronization mechanisms for | could also be caused, for instance, by synchronization mechanisms for | |||
state information among parallel processing entities, which are e. g. | state information among parallel processing entities, which are e. g. | |||
used in high-speed router hardware designs. | used in high-speed router hardware designs. | |||
Open questions are: | Open questions are: | |||
- What granularity of router processing can be realized without | - What granularity of router processing can be realized without | |||
affecting Internet scalability? | affecting Internet scalability? | |||
- How can additional processing efforts be kept at a minimum? | - How can additional processing efforts be kept at a minimum? | |||
3.1.3 Information acquisition | 3.1.3 Information Acquisition | |||
In order to support congestion control, network components have to | In order to support congestion control, network components have to | |||
obtain at least a subset of the following information. Obtaining that | obtain at least a subset of the following information. Obtaining that | |||
information may result in complex tasks. | information may result in complex tasks. | |||
1. Capacity of (outgoing) links | 1. Capacity of (outgoing) links | |||
Link characteristics depend on the realization of lower protocol | Link characteristics depend on the realization of lower protocol | |||
layers. Routers operating at IP layer do not necessarily know the | layers. Routers operating at IP layer do not necessarily know the | |||
link layer network topology and link capacities, and these are not | link layer network topology and link capacities, and these are not | |||
always constant (e. g., on shared wireless links or bandwidth-on- | always constant (e. g., on shared wireless links or bandwidth-on- | |||
demand links). Depending on the network technology, there can be | demand links). Depending on the network technology, there can be | |||
queues or bottlenecks that are not directly visible at the IP layer. | queues or bottlenecks that are not directly visible at the IP | |||
networking layer. | ||||
Difficulties also arise when using IP-in-IP tunnels [RFC 2003] IPsec | Difficulties also arise when using IP-in-IP tunnels [RFC 2003] | |||
tunnels [RFC4301], IP encapsulated in L2TP [RFC2661], GRE [RFC1701], | IPsec tunnels [RFC4301], IP encapsulated in L2TP [RFC2661], GRE | |||
PPTP [RFC2637] or MPLS [RFC3031] [RFC3032] [RFC5129]. In these cases, | [RFC1701] [RFC2784], PPTP [RFC2637] or MPLS [RFC3031] [RFC3032] | |||
link information could be determined by cross-layer information | [RFC5129]. In these cases, link information could be determined by | |||
exchange, but this requires link layer technology specific | cross-layer information exchange, but this requires link layer | |||
interfaces. An alternative could be online measurements, but this can | technology specific interfaces. An alternative could be online | |||
cause significant additional network overhead. General guidelines for | measurements, but this can cause significant additional network | |||
encapsulation and decapsulation of explicit congestion information | overhead. General guidelines for encapsulation and decapsulation | |||
are currently in preparation [ECN-tunnel]. | of explicit congestion information are currently in preparation | |||
[ECN-tunnel]. | ||||
2. Traffic carried over (outgoing) links | 2. Traffic carried over (outgoing) links | |||
Accurate online measurement of data rates is challenging when traffic | Accurate online measurement of data rates is challenging when | |||
is bursty. For instance, measuring a "current link load" requires | traffic is bursty. For instance, measuring a "current link load" | |||
defining the right measurement interval/ sampling interval. This is a | requires defining the right measurement interval / sampling | |||
challenge for proposals that require knowledge e.g. about the current | interval. This is a challenge for proposals that require knowledge | |||
link utilization. | e.g. about the current link utilization. | |||
3. Internal buffer statistics | 3. Internal buffer statistics | |||
Some proposals use buffer statistics such as a virtual queue length | Some proposals use buffer statistics such as a virtual queue | |||
to trigger feedback. However, network components can include multiple | length to trigger feedback. However, network components can | |||
distributed buffer stages that make it difficult to obtain such | include multiple distributed buffer stages that make it difficult | |||
metrics. | to obtain such metrics. | |||
Open questions are: Can and should this information be made | Open questions are: | |||
available, e.g., by additional interfaces or protocols? | ||||
- Can and should this information be made available, e.g., by | ||||
additional interfaces or protocols? | ||||
3.1.4 Feedback signaling | 3.1.4 Feedback signaling | |||
Explicit notification mechanisms can be realized either by in-band | Explicit notification mechanisms can be realized either by in-band | |||
signaling (notifications piggybacked along with the data traffic) or | signaling (notifications piggybacked along with the data traffic) or | |||
by out-of-band signaling [Sarola07]. The latter case requires | by out-of-band signaling [Sarola07]. The latter case requires | |||
additional protocols and a secure binding between the signals and the | additional protocols and a secure binding between the signals and the | |||
packets they refer to. Out-of-band signaling can be further | packets they refer to. Out-of-band signaling can be further | |||
subdivided into path-coupled and path-decoupled approaches. | subdivided into path-coupled and path-decoupled approaches. | |||
skipping to change at page 15, line 35 | skipping to change at page 15, line 43 | |||
checksum only covers all the necessary header fields and this | checksum only covers all the necessary header fields and this | |||
checksum does not show an error, it is possible for errors to be | checksum does not show an error, it is possible for errors to be | |||
found in the payload using a second checksum. Such error detection is | found in the payload using a second checksum. Such error detection is | |||
possible with UDP-Lite and DCCP; it was found to work well over a | possible with UDP-Lite and DCCP; it was found to work well over a | |||
GPRS network in a study [Chester04] and poorly over a WiFi network in | GPRS network in a study [Chester04] and poorly over a WiFi network in | |||
another study [Rossi06] [Welzl08]. Note that, while UDP-Lite and DCCP | another study [Rossi06] [Welzl08]. Note that, while UDP-Lite and DCCP | |||
enable the detection of corruption, the specifications of these | enable the detection of corruption, the specifications of these | |||
protocols do not foresee any specific reaction to it for the time | protocols do not foresee any specific reaction to it for the time | |||
being. | being. | |||
The idea of having a transport endpoint detect and accordingly react | The idea of having a transport end-point detecting and accordingly | |||
(or not) to corruption poses a number of interesting questions | reacting (or not) to corruption poses a number of interesting | |||
regarding cross-layer interactions. As IP is designed to operate over | questions regarding cross-layer interactions. As IP is designed to | |||
arbitrary link layers, it is therefore difficult to design a | operate over arbitrary link layers, it is therefore difficult to | |||
congestion control mechanism on top of it, which appropriately reacts | design a congestion control mechanism on top of it, which | |||
to corruption - especially as the specific data link layers that are | appropriately reacts to corruption - especially as the specific data | |||
in use along an end-to-end path are typically unknown to entities at | link layers that are in use along an end-to-end path are typically | |||
the transport layer. | unknown to entities at the transport layer. | |||
While the IETF has not yet specified how a congestion control | While the IETF has not yet specified how a congestion control | |||
mechanism should react to corruption, proposals exist in the | mechanism should react to corruption, proposals exist in the | |||
literature. For instance, TCP Westwood sets the congestion window | literature. For instance, TCP Westwood sets the congestion window | |||
equal to the measured bandwidth at time of congestion in response to | equal to the measured bandwidth at the time of congestion in response | |||
three DupACKs or a timeout. This measurement is obtained by counting | to three DupACKs or a timeout. This measurement is obtained by | |||
and filtering the ACK rate. This setting provides a significant | counting and filtering the ACK rate. This setting provides a | |||
goodput improvement in noisy channels because the "blind" by half | significant goodput improvement in noisy channels because the "blind" | |||
window reduction of standard TCP is avoided, i.e. the window is not | by half window reduction of standard TCP is avoided, i.e. the window | |||
reduced by too much [Mascolo01]. | is not reduced by too much [Mascolo01]. | |||
Open questions concerning corruption loss include: | Open questions concerning corruption loss include: | |||
- How should corruption loss be detected? | - How should corruption loss be detected? | |||
- How should a source react when it is known that corruption has | - How should a source react when it is known that corruption has | |||
occurred? | occurred? | |||
- Can an ECN-capable flow infer that loss must be due to corruption | - Can an ECN-capable flow infer that loss must be due to corruption | |||
just from lack of explicit congestion notifications around a loss | just from lack of explicit congestion notifications around a loss | |||
episode [LT-TCP]? Or could this inference be dangerous given the | episode [LT-TCP]? Or could this inference be dangerous given the | |||
transport doesn't know whether queues on the path are all ECN- | transport does not know whether all queues on the path are ECN- | |||
capable? | capable or not? | |||
3.3 Challenge 3: Packets Sizes | 3.3 Challenge 3: Packet Size | |||
TCP does not take packet size into account when responding to losses | TCP does not take packet size into account when responding to losses | |||
or ECN. Over past years, the performance of TCP congestion avoidance | or ECN. Over past years, the performance of TCP congestion avoidance | |||
algorithms has been extensively studied. The well known "square root | algorithms has been extensively studied. The well known "square root | |||
formula" provides the performance of the TCP congestion avoidance | formula" provides the performance of the TCP congestion avoidance | |||
algorithm for TCP Reno [RFC2581]. [Padhye98] enhances the model to | algorithm for TCP Reno [RFC2581]. [Padhye98] enhances the model to | |||
account for timeouts, receiver window, and delayed ACKs. | account for timeouts, receiver window, and delayed ACKs. | |||
For the sake of the present discussion, we will assume that the TCP | For the sake of the present discussion, we will assume that the TCP | |||
throughput is expressed using the simplified formula. Using this | throughput is expressed using the simplified formula. Using this | |||
skipping to change at page 16, line 40 | skipping to change at page 16, line 48 | |||
inversely proportional to the RTT and the square root of the drop | inversely proportional to the RTT and the square root of the drop | |||
probability: | probability: | |||
S 1 | S 1 | |||
B ~ C --- ------- | B ~ C --- ------- | |||
RTT sqrt(p) | RTT sqrt(p) | |||
where, | where, | |||
S is the TCP segment size (in bytes) | S is the TCP segment size (in bytes) | |||
RTT is the end-to-end round trip time of the TCP connection | RTT is the end-to-end round trip time of the TCP | |||
(in seconds) | connection (in seconds) | |||
p is the packet drop probability | p is the packet drop probability | |||
Neglecting the fact that the TCP rate linearly depends on it, | Neglecting the fact that the TCP rate linearly depends on it, | |||
choosing the ideal packet size is a trade-off between high throughput | choosing the ideal packet size is a trade-off between high throughput | |||
(the larger a packet, the smaller the relative header overhead) and | (the larger a packet, the smaller the relative header overhead) and | |||
low delay (the smaller a packet, the shorter the time that is needed | low delay (the smaller a packet, the shorter the time that is needed | |||
until it is filled with data). Observing that TCP is not suited for | until it is filled with data). Observing that TCP is not optimal for | |||
applications such as streaming media (since reliable in-order | applications with streaming media (since reliable in-order delivery | |||
delivery and congestion control can cause arbitrarily long delays), | and congestion control can cause arbitrarily long delays), this | |||
this trade-off has not usually been considered for TCP applications, | trade-off has not usually been considered for TCP applications, and | |||
and the influence of the packet size on the sending rate is has not | the influence of the packet size on the sending rate is has not | |||
typically been seen as a significant issue, given there are still few | typically been seen as a significant issue, given there are still few | |||
paths through the Internet that support packets larger than the 1500B | paths through the Internet that support packets larger than the 1500B | |||
common with Ethernet. | common with Ethernet. | |||
The situation is already different for the Datagram Congestion | The situation is already different for the Datagram Congestion | |||
Control Protocol (DCCP) [RFC4340], which has been designed to enable | Control Protocol (DCCP) [RFC4340], which has been designed to enable | |||
unreliable but congestion-controlled datagram transmission, avoiding | unreliable but congestion-controlled datagram transmission, avoiding | |||
the arbitrary delays associated with TCP. DCCP is intended for | the arbitrary delays associated with TCP. DCCP is intended for | |||
applications such as streaming media that can benefit from control | applications such as streaming media that can benefit from control | |||
over the tradeoffs between delay and reliable in-order delivery. | over the tradeoffs between delay and reliable in-order delivery. | |||
DCCP provides for a choice of modular congestion control mechanisms. | DCCP provides for a choice of modular congestion control mechanisms. | |||
DCCP uses Congestion Control Identifiers (CCIDs) to specify the | DCCP uses Congestion Control Identifiers (CCIDs) to specify the | |||
congestion control mechanism. Three profiles are currently specified: | congestion control mechanism. Three profiles are currently specified: | |||
- DCCP Congestion Control ID 2 (CCID 2) [RFC4341]: | - DCCP Congestion Control ID 2 (CCID 2) [RFC4341]: | |||
TCP-like Congestion Control. CCID 2 sends data using a close | TCP-like Congestion Control. CCID 2 sends data using a close | |||
variant of TCP's congestion control mechanisms, incorporating a | approximation of TCP's congestion control, incorporating a | |||
variant of SACK [RFC2018, RFC3517]. CCID 2 is suitable for senders | variant of SACK [RFC2018, RFC3517]. CCID 2 is suitable for senders | |||
who can adapt to the abrupt changes in congestion window typical of | which can adapt to the abrupt changes in congestion window typical | |||
TCP's AIMD congestion control, and particularly useful for senders | of TCP's AIMD congestion control, and particularly useful for | |||
who would like to take advantage of the available bandwidth in an | senders which would like to take advantage of the available | |||
environment with rapidly changing conditions. | bandwidth in an environment with rapidly changing conditions. | |||
- DCCP Congestion Control ID 3 (CCID 3) [RFC4342]: | - DCCP Congestion Control ID 3 (CCID 3) [RFC4342]: | |||
TCP-Friendly Rate Control (TFRC) [RFC3448bis] is a congestion | TCP-Friendly Rate Control (TFRC) [RFC3448bis] is a congestion | |||
control mechanism designed for unicast flows operating in a best- | control mechanism designed for unicast flows operating in a best- | |||
effort Internet environment. It is reasonably fair when competing | effort Internet environment. It is reasonably fair when competing | |||
for bandwidth with TCP flows, but has a much lower variation of | for bandwidth with TCP flows, but has a much lower variation of | |||
throughput over time compared with TCP, making it more suitable for | throughput over time than TCP, making it more suitable for | |||
applications such as streaming media where a relatively smooth | applications such as streaming media where a relatively smooth | |||
sending rate is of importance. CCID 3 is appropriate for flows that | sending rate is of importance. CCID 3 is appropriate for flows that | |||
would prefer to minimize abrupt changes in the sending rate, | would prefer to minimize abrupt changes in the sending rate, | |||
including streaming media applications with small or moderate | including streaming media applications with small or moderate | |||
receiver buffering before playback. | receiver buffering before playback. | |||
- DCCP Congestion Control ID 4 [draft-ietf-ccid4-02.txt]: | - DCCP Congestion Control ID 4 [draft-ietf-ccid4-04.txt]: | |||
TFRC Small Packets (TFRC-SP) [RFC4828], a variant of TFRC | TFRC Small Packets (TFRC-SP) [RFC4828], a variant of the TFRC | |||
mechanism has been designed for applications that exchange small | mechanism has been designed for applications that exchange small | |||
packets. The objective of TFRC-SP is to achieve the same bandwidth | packets. The objective of TFRC-SP is to achieve the same bandwidth | |||
in bps (bits per second) as a TCP flow using packets of up to 1500 | in bps (bits per second) as a TCP flow using packets of up to 1500 | |||
bytes. TFRC-SP enforces a minimum interval of 10 ms between data | bytes. TFRC-SP enforces a minimum interval of 10 ms between data | |||
packets to prevent a single flow from sending small packets | packets to prevent a single flow from sending small packets | |||
arbitrarily frequently. TFRC is a congestion control mechanism for | arbitrarily frequently. CCID 4 has been designed to be used either | |||
unicast flows operating in a best-effort Internet environment, and | by applications that use a small fixed segment size, or by | |||
is designed for DCCP that controls the sending rate based on a | applications that change their sending rate by varying the segment | |||
stochastic Markov model for TCP Reno. CCID 4 has been designed to | size. Because CCID 4 is intended for applications that use a fixed | |||
be used either by applications that use a small fixed segment size, | small segment size, or that vary their segment size in response to | |||
or by applications that change their sending rate by varying the | congestion, the transmit rate derived from the TCP throughput | |||
segment size. Because CCID 4 is intended for applications that use | equation is reduced by a factor that accounts for the packet header | |||
a fixed small segment size, or that vary their segment size in | size, as specified in [RFC4828]. | |||
response to congestion, the transmit rate derived from the TCP | ||||
throughput equation is reduced by a factor that accounts for packet | ||||
header size, as specified in [RFC4828]. | ||||
The resulting open questions are: | The resulting open questions are: | |||
- How does TFRC-SP operate under various network conditions? | - How does TFRC-SP operate under various network conditions? | |||
- How to design congestion control so as to scale with packet | - How to design congestion control so as to scale with packet | |||
size (dependency of congestion algorithm on packet size)? | size (dependency of congestion algorithm on packet size)? | |||
Today, many network resources are designed so that packet processing | Today, many network resources are designed so that packet processing | |||
cannot be overloaded even for incoming loads at the maximum bit-rate | cannot be overloaded even for incoming loads at the maximum bit-rate | |||
of the line. If packet processing can handle sustained load r [packet | of the line. If packet processing can handle sustained load r [packet | |||
per second] and the minimum packet size is h [bit] (i.e. packet | per second] and the minimum packet size is h [bit] (i.e. packet | |||
headers with no payload), then a line rate of x [bit per second] will | headers with no payload), then a line rate of x [bit per second] will | |||
never be able to overload packet processing as long as x =< r.h. | never be able to overload packet processing as long as x =< r.h. | |||
However, realistic equipment is often designed to only cope with a | However, realistic equipment is often designed to only cope with a | |||
skipping to change at page 18, line 36 | skipping to change at page 18, line 43 | |||
Therefore, it is likely that most congestion seen on today's Internet | Therefore, it is likely that most congestion seen on today's Internet | |||
is due to an excess of bits rather than packets, although packet- | is due to an excess of bits rather than packets, although packet- | |||
congestion is not impossible for runs of small packets (e.g. TCP ACKs | congestion is not impossible for runs of small packets (e.g. TCP ACKs | |||
or DoS attacks with small UDP datagrams). | or DoS attacks with small UDP datagrams). | |||
This observation raises additional open issues: | This observation raises additional open issues: | |||
- Will bit congestion remain prevalent? | - Will bit congestion remain prevalent? | |||
Being able to assume that congestion is generally due to excess | Being able to assume that congestion is generally due to excess | |||
bits not excess packets is a useful simplifying assumption in the | bits, not excess packets is a useful simplifying assumption in the | |||
design of congestion control protocols. Can we rely on this | design of congestion control protocols. Can we rely on this | |||
assumption into the future? An alternative view of the future is | assumption for the future? An alternative view is that in-network | |||
that in-network processing will become commonplace, so that per- | processing will become commonplace, so that per-packet processing | |||
packet processing will be as likely to be the bottleneck as per-bit | will be as likely to be the bottleneck as per-bit transmission | |||
transmission [Shin08]. | [Shin08]. | |||
Over the last three decades, performance gains have mainly been | Over the last three decades, performance gains have mainly been | |||
through increased packet rates, not bigger packets. But if bigger | achieved through increased packet rates, not bigger packets. But if | |||
maximum segment sizes do become more prevalent, tiny segments (e.g. | bigger maximum segment sizes do become more prevalent, tiny | |||
ACKs) will not stop being widely used – leading to - a widening | segments (e.g. ACKs) will not stop being widely used - leading to a | |||
range of packet sizes. | widening range of packet sizes. | |||
The open question is thus whether or not packet processing rates | The open question is thus whether or not packet processing rates | |||
(r) will keep up with growth in transmission rates (x). A | (r) will keep up with growth in transmission rates (x). A | |||
superficial look at Moore's Law type trends would suggest that | superficial look at Moore's Law type trends would suggest that | |||
processing (r) will continue to outstrip growth in transmission | processing (r) will continue to outstrip growth in transmission | |||
(x). But predictions based on actual knowledge of technology | (x). But predictions based on actual knowledge of technology | |||
futures would be useful. Another open question is whether there are | futures would be useful. Another open question is whether there are | |||
likely to be more small packets in the average packet mix. If the | likely to be more small packets in the average packet mix. If the | |||
answers to either of these questions predict that packet congestion | answers to either of these questions predict that packet congestion | |||
could become prevalent, congestion control protocols will have to | could become prevalent, congestion control protocols will have to | |||
skipping to change at page 19, line 27 | skipping to change at page 19, line 35 | |||
congestion and packet congestion), and policing loss. | congestion and packet congestion), and policing loss. | |||
If congestion is due to excess bits, the bit rate should be | If congestion is due to excess bits, the bit rate should be | |||
reduced. If congestion is due to excess packets, the packet rate | reduced. If congestion is due to excess packets, the packet rate | |||
can be reduced without reducing the bit rate - by using larger | can be reduced without reducing the bit rate - by using larger | |||
packets. However, if the transport cannot tell which of these | packets. However, if the transport cannot tell which of these | |||
causes led to a specific drop, its only safe response is to reduce | causes led to a specific drop, its only safe response is to reduce | |||
the bit rate. This is why the Internet would be more complicated if | the bit rate. This is why the Internet would be more complicated if | |||
packet congestion were prevalent, as reducing the bit rate normally | packet congestion were prevalent, as reducing the bit rate normally | |||
also reduces the packet rate, while reducing the packet rate | also reduces the packet rate, while reducing the packet rate | |||
doesn't necessarily reduce the bit rate. | does not necessarily reduce the bit rate. | |||
Given distinguishing between transmission loss and congestion is | Given distinguishing between transmission loss and congestion is | |||
already an open issue (Section 3.2), if that problem is ever | already an open issue (Section 3.2), if that problem is ever | |||
solved, a further open issue would be whether to standardize a | solved, a further open issue would be whether to standardize a | |||
solution that distinguishes all the above causes of drop, not just | solution that distinguishes all the above causes of drop, not just | |||
two of them. | two of them. | |||
Nonetheless, even if we find a way for network equipment to | Nonetheless, even if we find a way for network equipment to | |||
explicitly distinguish which sort of drop has occurred, we will | explicitly distinguish which sort of drop has occurred, we will | |||
never be able to assume that such a smart AQM solution is deployed | never be able to assume that such a smart AQM solution is deployed | |||
at every congestible resource throughout the Internet - at every | at every congestible resource throughout the Internet - at every | |||
higher layer device like firewalls, proxies, servers and at every | higher layer device like firewalls, proxies, servers and at every | |||
lower layer device like low-end home hubs, DSLAMs, WLAN cards, | lower layer device like low-end home hubs, DSLAMs, WLAN cards, | |||
cellular base-stations and so on. Thus, transport protocols will | cellular base-stations and so on. Thus, transport protocols will | |||
always have to cope with drops due to unpredictable causes, so we | always have to cope with drops due to unpredictable causes, so we | |||
should always treat AQM smarts as an optimization, not a given. | should always treat, e.g., AQM as an optimization, not a given. | |||
- What does a congestion notification on a packet of a certain size | - What does a congestion notification on a packet of a certain size | |||
mean? | mean? | |||
The open issue here is whether a loss or explicit congestion mark | The open issue here is whether a loss or explicit congestion mark | |||
should be interpreted as a single congestion event irrespective of | should be interpreted as a single congestion event irrespective of | |||
the size of the packet lost or marked, or whether the strength of | the size of the packet lost or marked, or whether the strength of | |||
the congestion notification is weighted by the size of the packet. | the congestion notification is weighted by the size of the packet. | |||
This issue is discussed at length in [Bri08], along with other | This issue is discussed at length in [Bri08], along with other | |||
aspects of packet size and congestion control. | aspects of packet size and congestion control. | |||
skipping to change at page 20, line 38 | skipping to change at page 20, line 47 | |||
must then face the issue of how they should take account of packet | must then face the issue of how they should take account of packet | |||
size. If we determine that TCP was incorrect in not taking account | size. If we determine that TCP was incorrect in not taking account | |||
of packet size, even if we don't change TCP, we should not allow | of packet size, even if we don't change TCP, we should not allow | |||
new protocols to follow TCP's example in this respect. For example, | new protocols to follow TCP's example in this respect. For example, | |||
as explained here above, the small-packet variant of TCP-friendly | as explained here above, the small-packet variant of TCP-friendly | |||
rate control (TFRC-SP [RFC4828]) is an experimental protocol that | rate control (TFRC-SP [RFC4828]) is an experimental protocol that | |||
aims to take account of packet size. Whatever packet size it uses, | aims to take account of packet size. Whatever packet size it uses, | |||
it ensures its rate approximately equals that of a TCP using 1500B | it ensures its rate approximately equals that of a TCP using 1500B | |||
segments. This raises the further question of whether TCP with | segments. This raises the further question of whether TCP with | |||
1500B segments will be a suitable long-term gold standard, or | 1500B segments will be a suitable long-term gold standard, or | |||
whether we need a more thoroughgoing review of what it means for a | whether we need a more thorough review of what it means for a | |||
congestion control to scale with packet size. | congestion control to scale with packet size. | |||
3.4 Challenge 4: Flow Startup | 3.4 Challenge 4: Flow Startup | |||
The beginning of data transmissions imposes some further, unique | The beginning of data transmissions imposes some further, unique | |||
challenges: When a connection to a new destination is established, | challenges: when a connection to a new destination is established, | |||
the end-systems have hardly any information about the characteristics | the end-systems have hardly any information about the characteristics | |||
of the path in between and the available bandwidth. In this flow | of the path in between and the available bandwidth. In this flow | |||
startup situation there is no obvious choice how to start to send. A | startup situation there is no obvious choice how to start to send. A | |||
similar problem also occurs after relatively long idle times, since | similar problem also occurs after relatively long idle times, since | |||
the congestion control state then no longer reflects current | the congestion control state then no longer reflects current | |||
information about the state of the network (flow restart problem). | information about the state of the network (flow restart problem). | |||
Van Jacobson [Jacobson88] suggested using the slow-start mechanism | Van Jacobson [Jacobson88] suggested using the slow-start mechanism | |||
both for the flow startup and the flow restart, and this is today’s | both for the flow startup and the flow restart, and this is today's | |||
standard solution [RFC2581]. The slow-start algorithm starts with a | standard solution [RFC2581]. The slow-start algorithm starts with a | |||
small initial congestion window, which is exponentially increased as | small initial congestion window, which is exponentially increased as | |||
soon as acknowledgements arrive. However, the slow-start is not | soon as acknowledgements arrive. However, the slow-start is not | |||
optimal in many situations: First, it can take quite a long time | optimal in many situations: First, it can take quite a long time | |||
until a sender can fully utilize the available bandwidth on a path. | until a sender can fully utilize the available bandwidth on a path. | |||
Second, the exponential increase may be too aggressive and cause | Second, the exponential increase may be too aggressive and cause | |||
multiple packet loss if large congestion windows are reached (slow- | multiple packet loss if large congestion windows are reached (slow- | |||
start overshooting). Finally, the slow-start does not ensure that new | start overshooting). Finally, the slow-start does not ensure that new | |||
flows converge quickly to a reasonable share of resources, in | flows converge quickly to a reasonable share of resources, in | |||
particular if they compete with long-lived flows. This convergence | particular if they compete with long-lived flows. This convergence | |||
skipping to change at page 21, line 29 | skipping to change at page 21, line 35 | |||
The slow-start and its interaction with the congestion avoidance | The slow-start and its interaction with the congestion avoidance | |||
phase was largely designed by intuition [Jacobson88]. So far, little | phase was largely designed by intuition [Jacobson88]. So far, little | |||
theory has been developed to understand the flow startup problem and | theory has been developed to understand the flow startup problem and | |||
its implication on congestion control stability and fairness. There | its implication on congestion control stability and fairness. There | |||
is also no established methodology to evaluate whether new flow | is also no established methodology to evaluate whether new flow | |||
startup mechanisms are appropriate or not. | startup mechanisms are appropriate or not. | |||
As a consequence, it is a non-trivial task to address the | As a consequence, it is a non-trivial task to address the | |||
shortcomings of the slow-start algorithm. Several experimental | shortcomings of the slow-start algorithm. Several experimental | |||
enhancements have been proposed, such as the congestion window | enhancements have been proposed, such as congestion window validation | |||
validation [RFC2861] and the limited slow-start [RFC3742]. There are | [RFC2861] and limited slow-start [RFC3742]. There are also ongoing | |||
also ongoing research activities, focusing e.g. on bandwidth | research activities, focusing e.g. on bandwidth estimation | |||
estimation techniques, delay-based congestion control, or rate pacing | techniques, delay-based congestion control, or rate pacing | |||
mechanisms. However, any alternative end-to-end flow startup approach | mechanisms. However, any alternative end-to-end flow startup approach | |||
has to cope with the inherent problem that there is no or only few | has to cope with the inherent problem that there is no or only little | |||
information about the path at the beginning of a data transfer. This | information about the path at the beginning of a data transfer. This | |||
uncertainty could be reduced by more expressive feedback signaling | uncertainty could be reduced by more expressive feedback signaling | |||
(cf. Section 3.1). For instance, a source could learn the path | (cf. Section 3.1). For instance, a source could learn the path | |||
characteristics faster with the Quick-Start mechanism [RFC4782]. But, | characteristics faster with the Quick-Start mechanism [RFC4782]. But, | |||
even if the source knew exactly what rate it should aim for, it would | even if the source knew exactly what rate it should aim for, it would | |||
still not necessarily be safe to jump straight to that rate. The end- | still not necessarily be safe to jump straight to that rate. The end- | |||
system still doesn't know how much how a change in its own rate will | system still does not know how a change in its own rate will affect | |||
affect the path, which also might become congested in less than one | the path, which also might become congested in less than one RTT. | |||
RTT. Further research would be useful to understand the effect of | Further research would be useful to understand the effect of | |||
decreasing the uncertainty by explicit feedback separately from | decreasing the uncertainty by explicit feedback separately from | |||
control theoretic stability questions. Furthermore, the flow startup | control theoretic stability questions. Furthermore, the flow startup | |||
also raises fairness questions. For instance, it is unclear whether | also raises fairness questions. For instance, it is unclear whether | |||
it could be reasonable to use a faster startup when an end-system | it could be reasonable to use a faster startup when an end-system | |||
detects that a path is currently not congested. | detects that a path is currently not congested. | |||
In summary, there are several topics for further research concerning | In summary, there are several topics for further research concerning | |||
flow startups: | flow startups: | |||
- Better theoretical understanding of the design and evaluation of | - Better theoretical understanding of the design and evaluation of | |||
flow startup mechanisms, concerning their impact on congestion | flow startup mechanisms, concerning their impact on congestion | |||
risk, stability, and fairness | risk, stability, and fairness. | |||
- Evaluate whether it may be appropriate to allow more | - Evaluate whether it may be appropriate to allow alternative | |||
differentiated starting schemes, e. g., to allow higher initial | starting schemes, e.g., to allow higher initial rates under certain | |||
rates under certain constraints; this also requires refining | constraints; this also requires refining fairness for startup | |||
fairness for startup situations | situations. | |||
- Better theoretical models for the effects of decreasing | - Better theoretical models for the effects of decreasing | |||
uncertainty by additional network feedback, in particular if the | uncertainty by additional network feedback, in particular if the | |||
path characteristics are very dynamic. | path characteristics are very dynamic. | |||
3.5 Challenge 5: Multi-domain Congestion Control | 3.5 Challenge 5: Multi-domain Congestion Control | |||
Transport protocols such as TCP operate over the Internet that is | Transport protocols such as TCP operate over the Internet, which is | |||
divided into autonomous systems. These systems are characterized by | divided into autonomous systems. These systems are characterized by | |||
their heterogeneity as IP networks are realized by a multitude of | their heterogeneity as IP networks are realized by a multitude of | |||
technologies. The variety of conditions and their variations leads to | technologies. | |||
correlation effects between policers that regulate traffic against | ||||
certain conformance criteria. | 3.5.1 Multi-domain Transport of Congestion Signals | |||
The variety of conditions and their variations leads to correlation | ||||
effects between policers that regulate traffic against certain | ||||
conformance criteria. | ||||
With the advent of techniques allowing for early detection of | With the advent of techniques allowing for early detection of | |||
congestion, packet loss is no longer the sole metric of congestion. | congestion, packet loss is no longer the sole metric of congestion. | |||
ECN (Explicit Congestion Notification) marks packets - set by active | ECN (Explicit Congestion Notification) marks packets - set by active | |||
queue management techniques - to convey congestion information trying | queue management techniques - to convey congestion information trying | |||
to prevent packet losses (packet loss and the number of packets | to prevent packet losses (packet loss and the number of packets | |||
marked gives an indication of the level of congestion). Using TCP | marked gives an indication of the level of congestion). Using TCP | |||
ACKs to feed back that information allows the hosts to realign their | ACKs to feed back that information allows the hosts to realign their | |||
transmission rate and thus encourage them to efficiently use the | transmission rate and thus encourage them to efficiently use the | |||
network. In IP, ECN uses the two unused bits of the TOS field | network. In IP, ECN uses the two unused bits of the TOS field | |||
skipping to change at page 22, line 46 | skipping to change at page 23, line 9 | |||
ECN [RFC3168] is an example of a congestion feedback mechanism from | ECN [RFC3168] is an example of a congestion feedback mechanism from | |||
the network toward hosts. The congestion-based feedback scheme | the network toward hosts. The congestion-based feedback scheme | |||
however has limitations when applied on an inter-domain basis. | however has limitations when applied on an inter-domain basis. | |||
Indeed, Section 8 and 19 of RFC3168 details consequences/implication | Indeed, Section 8 and 19 of RFC3168 details consequences/implication | |||
of i) a network erasing CE introduced earlier on the path and ii) a | of i) a network erasing CE introduced earlier on the path and ii) a | |||
network changing Not-ECT to ECT. Both of which could allow an | network changing Not-ECT to ECT. Both of which could allow an | |||
attacking network to cause excess congestion in an upstream network, | attacking network to cause excess congestion in an upstream network, | |||
even if the transports were behaving correctly. There are since so | even if the transports were behaving correctly. There are since so | |||
far two possible solutions to problem i) the ECN nonce [RFC3540] and | far two possible solutions to problem i) the ECN nonce [RFC3540] and | |||
the re-ECN incentive system. Nevertheless, the absence of IPv6 header | the re-ECN incentive system. Nevertheless, the absence of an IPv6 | |||
checksum implies that corruption could be more impacting than in the | header checksum implies that corruption could be more impacting than | |||
IPv4 case. Fragmentation is another: the ECN-nonce cannot protect | in the IPv4 case. Fragmentation is another: the ECN-nonce cannot | |||
against misbehaving receivers that conceal marked fragments, so some | protect against misbehaving receivers that conceal marked fragments, | |||
protection is lost in situations where Path MTU discovery is | so some protection is lost in situations where Path MTU discovery is | |||
disabled. So, there is still room for improvement on the ECN | disabled. So, there is still room for improvement on the ECN | |||
mechanism to cope with ECN when operating in multi-domain networks. | mechanism when operating in multi-domain networks. | |||
Operational/deployment experience is nevertheless required to | Operational/deployment experience is nevertheless required to | |||
determine the extent of these problems. The second problem is mainly | determine the extent of these problems. The second problem is mainly | |||
related to deployment and usage practices and does not seem to result | related to deployment and usage practices and does not seem to result | |||
into any specific research challenge. | in any specific research challenge. | |||
Another solution in a multi-domain environment may be the TCP rate | Another solution in a multi-domain environment may be the TCP rate | |||
controller (TRC), a traffic conditioner which regulates the TCP flow | controller (TRC), a traffic conditioner which regulates the TCP flow | |||
at the ingress node in each domain by controlling packet drops and | at the ingress node in each domain by controlling packet drops and | |||
delays of the packets in a flow. The outgoing traffic from a TRC | delays of the packets in a flow. The outgoing traffic from a TRC | |||
controlled domain is shaped in such a way that no packets are dropped | controlled domain is shaped in such a way that no packets are dropped | |||
at the policer. However, the TRC depends on the end-to-end TCP model, | at the policer. However, the TRC depends on the end-to-end TCP model, | |||
and thus the diversity of TCP implementations is a general problem. | and thus the diversity of TCP implementations is a general problem. | |||
3.5.1 Multi-domain operations | 3.5.2 Multi-domain Information Exchange | |||
Security is a challenge for multi-domain network operation. At domain | Security is a challenge for multi-domain network operation. At domain | |||
boundaries, authentication and authorization issues can arise | boundaries, authentication and authorization issues can arise | |||
whenever congestion control information is exchanged. From this | whenever congestion control information is exchanged. From this | |||
perspective, the Internet does not have so far a single general | perspective, the Internet does not have so far a single general | |||
security architecture that could be used in all cases. Many | security architecture that could be used in all cases. Many | |||
autonomous systems also only exchange some limited amount of | autonomous systems also only exchange some limited amount of | |||
information about their internal state (topology hiding principle), | information about their internal state (topology hiding principle), | |||
even though having more precise information could be highly | even though having more precise information could be highly | |||
beneficial for congestion control. Indeed, prevent revealing internal | beneficial for congestion control. Indeed, prevent revealing internal | |||
network structure is highly sensitive in multi-domain network | network structure is highly sensitive in multi-domain network | |||
operations and thus also a concern when it comes to the deployability | operations and thus also a concern when it comes to the deployability | |||
of congestion control schemes. For instance, an RCP-like scheme could | of congestion control schemes. For instance, a network-assisted | |||
reveal more information about the internal network dimensioning than | congestion control scheme with explicit signaling could reveal more | |||
TCP does today. | information about the internal network dimensioning than TCP does | |||
today. | ||||
The future evolution of the Internet inter-domain operation has to | The future evolution of the Internet inter-domain operation has to | |||
show whether more multi-domain information exchange can be | show whether more multi-domain information exchange can be | |||
effectively realized. This is of particular importance for congestion | effectively realized. This is of particular importance for congestion | |||
control schemes that make use of explicit per-datagram rate feedback | control schemes that make use of explicit per-datagram rate feedback | |||
(e.g. RCP or XCP) or explicit rate feedback or that use in-band | (e.g. RCP or XCP) or explicit rate feedback or that use in-band | |||
congestion signaling (e.g. QuickStart) or out-of-band signaling (e.g. | congestion signaling (e.g. QuickStart) or out-of-band signaling (e.g. | |||
CADPC/PTP). Explicit signaling exchanges at the inter-domain level | CADPC/PTP). Explicit signaling exchanges at the inter-domain level | |||
that result in local domain triggers are currently absent from the | that result in local domain triggers are currently absent from the | |||
Internet. From this perspective, security means resulting from | Internet. From this perspective, security means resulting from | |||
limited trust between different administrative units result in policy | limited trust between different administrative units result in policy | |||
enforcement that exacerbates difficulty encountered when explicit | enforcement that exacerbates difficulty encountered when explicit | |||
feedback congestion control information is exchanged between domains. | feedback congestion control information is exchanged between domains. | |||
3.5.2 Multi-domain Pseudowires | 3.5.3 Multi-domain Pseudowires | |||
Extending pseudo-wires across multiple domains poses specific issues. | Extending pseudo-wires across multiple domains poses specific issues. | |||
Pseudowires (PW) may carry non-TCP data flows (e.g. TDM traffic) over | Pseudowires (PW) may carry non-TCP data flows (e.g. TDM traffic) over | |||
a multi-domain IP networks. Structure Agnostic TDM over Packet | a multi-domain IP network. Structure Agnostic TDM over Packet | |||
(SATOP) [RFC4553], Circuit Emulation over Packet Switched Networks | (SATOP) [RFC4553], Circuit Emulation over Packet Switched Networks | |||
(CESoPSN), TDM over IP, are not responsive to congestion control in a | (CESoPSN), TDM over IP, are not responsive to congestion control in a | |||
TCP-friendly manner as discussed by [RFC2914] (see also [RFC5033]). | TCP-friendly manner as discussed by [RFC2914] (see also [RFC5033]). | |||
Moreover, it is not possible to simply reduce the flow rate of a TDM | Moreover, it is not possible to simply reduce the flow rate of a TDM | |||
PW when facing packet loss. Indeed, providers can rate control | PW when facing packet loss. Providers can rate control corresponding | |||
corresponding incoming traffic but it may not be able to detect that | incoming traffic but they may not be able to detect that PW carry TDM | |||
a PW carries TDM traffic. This can be illustrated with the following | traffic. This can be illustrated with the following example. | |||
example. | ||||
........... ............ | ........... ............ | |||
. . . | . . . | |||
S1 --- E1 --- . . | S1 --- E1 --- . . | |||
. | . . | . | . . | |||
. === E5 === E7 --- | . === E5 === E7 --- | |||
. | . . | | . | . . | | |||
S2 --- E2 --- . . | | S2 --- E2 --- . . | | |||
. . . | | | . . . | | | |||
........... . | v | ........... . | v | |||
skipping to change at page 25, line 4 | skipping to change at page 25, line 18 | |||
The problem arises for transit provider P2 that is not able to detect | The problem arises for transit provider P2 that is not able to detect | |||
that IP packets are carrying constant-bit rate service traffic for | that IP packets are carrying constant-bit rate service traffic for | |||
which the only useful congestion control mechanism would rely on | which the only useful congestion control mechanism would rely on | |||
implicit or explicit admission control. | implicit or explicit admission control. | |||
Assuming P1 providers are rate limiting BE traffic, a transit P2 | Assuming P1 providers are rate limiting BE traffic, a transit P2 | |||
provider router R may be subject to serious congestion as all TDM PWs | provider router R may be subject to serious congestion as all TDM PWs | |||
cross the same router. TCP-friendly traffic (e.g. each flow within | cross the same router. TCP-friendly traffic (e.g. each flow within | |||
the PW) would follow TCP's AIMD algorithm of reducing the sending | the PW) would follow TCP's AIMD algorithm of reducing the sending | |||
rate in half in response to each packet drop. Nevertheless, the PWs | rate in half in response to each packet drop. Nevertheless, the PWs | |||
of TDM traffic could take all the available capacity while other more | carrying TDM traffic could take all the available capacity while | |||
TCP-friendly traffic reduced itself to nothing. Note that | other more TCP-friendly traffic reduced itself to nothing. Note that | |||
the situation may simply occur because S4 suddenly turns on | the situation may simply occur because S4 suddenly turns on | |||
additional TDM channels. | additional TDM channels. | |||
It is neither possible nor desirable to assume that edge routers will | It is neither possible nor desirable to assume that edge routers will | |||
soon have the ability to detect the responsiveness of the carried | soon have the ability to detect the responsiveness of the carried | |||
traffic, but it is still important for transit providers to be able | traffic, but it is still important for transit providers to be able | |||
to police a fair, robust, responsive and efficient congestion control | to police a fair, robust, responsive and efficient congestion control | |||
technique in order to avoid impacting congestion responsive Internet | technique in order to avoid impacting congestion responsive Internet | |||
traffic. | traffic. | |||
skipping to change at page 25, line 52 | skipping to change at page 26, line 18 | |||
The preferential treatment of higher precedence traffic with | The preferential treatment of higher precedence traffic with | |||
appropriate congestion control mechanisms is still an open issue that | appropriate congestion control mechanisms is still an open issue that | |||
may, depending on the proposed solution, impact both the host and the | may, depending on the proposed solution, impact both the host and the | |||
network precedence awareness, and thereby congestion control. | network precedence awareness, and thereby congestion control. | |||
[RFC2990] points out that the interactions between congestion control | [RFC2990] points out that the interactions between congestion control | |||
and DiffServ [RFC2475] have yet to be addressed, and this statement | and DiffServ [RFC2475] have yet to be addressed, and this statement | |||
is still valid at the time of writing. | is still valid at the time of writing. | |||
There is also still work to be performed regarding lower precedence | There is also still work to be performed regarding lower precedence | |||
traffic – data transfers which are useful, yet not important enough | traffic - data transfers which are useful, yet not important enough | |||
to significantly impair any other traffic. Examples of applications | to significantly impair any other traffic. Examples of applications | |||
that could make use of such traffic are web caches and web browsers | that could make use of such traffic are web caches and web browsers | |||
(e.g. for pre-fetching) as well as peer-to-peer applications. There | (e.g. for pre-fetching) as well as peer-to-peer applications. There | |||
are proposals for achieving low precedence on a pure end-to-end basis | are proposals for achieving low precedence on a pure end-to-end basis | |||
(e.g. TCP-LP [Kuzmanovic03]), and there is a specification for | (e.g. TCP-LP [Kuzmanovic03]), and there is a specification for | |||
achieving it via router mechanisms [RFC3662]. It seems, however, that | achieving it via router mechanisms [RFC3662]. It seems, however, that | |||
such traffic is still hardly used, and sending lower precedence data | such traffic is still hardly used, and sending lower precedence data | |||
is not yet a common service on the Internet. | is not yet a common service on the Internet. | |||
3.7 Challenge 7: Misbehaving Senders and Receivers | 3.7 Challenge 7: Misbehaving Senders and Receivers | |||
skipping to change at page 26, line 27 | skipping to change at page 26, line 42 | |||
interest to honestly return feedback about congestion on the path, | interest to honestly return feedback about congestion on the path, | |||
effectively requesting a slower transfer. It is not in the sender's | effectively requesting a slower transfer. It is not in the sender's | |||
interest to reduce its rate in response to congestion if it can rely | interest to reduce its rate in response to congestion if it can rely | |||
on others to do so. Additionally, networks may have strategic reasons | on others to do so. Additionally, networks may have strategic reasons | |||
to make other networks appear congested. | to make other networks appear congested. | |||
Numerous strategies to improve the congestion control have already | Numerous strategies to improve the congestion control have already | |||
been identified. The IETF has particularly focused on misbehaving TCP | been identified. The IETF has particularly focused on misbehaving TCP | |||
receivers that could confuse a compliant sender into assigning | receivers that could confuse a compliant sender into assigning | |||
excessive network and/or server resources to that receiver (e.g. | excessive network and/or server resources to that receiver (e.g. | |||
[Sav99], [RFC3540]). But, although such strategies are worryingly | [Savage99], [RFC3540]). But, although such strategies are worryingly | |||
powerful, they do not yet seem common (however, evidence of attack | powerful, they do not yet seem common (however, evidence of attack | |||
prevalence is itself a research requirement). | prevalence is itself a research requirement). | |||
A growing proportion of Internet traffic comes from applications | A growing proportion of Internet traffic comes from applications | |||
designed not to use congestion control at all, or worse, applications | designed not to use congestion control at all, or worse, applications | |||
that add more forward error correction the more losses they | that add more forward error correction the more losses they | |||
experience. Some believe the Internet was designed to allow such | experience. Some believe the Internet was designed to allow such | |||
freedom so it can hardly be called misbehavior. But others consider | freedom so it can hardly be called misbehavior. But others consider | |||
that it is misbehavior to abuse this freedom [RFC3714], given one | that it is misbehavior to abuse this freedom [RFC3714], given one | |||
person's freedom can constrain the freedom of others (congestion | person's freedom can constrain the freedom of others (congestion | |||
skipping to change at page 27, line 8 | skipping to change at page 27, line 22 | |||
Note that the problem is not just misbehavior driven by a self- | Note that the problem is not just misbehavior driven by a self- | |||
interested desire for more bandwidth. Indeed, congestion control may | interested desire for more bandwidth. Indeed, congestion control may | |||
be attacked by someone who makes no gain for themselves, other than | be attacked by someone who makes no gain for themselves, other than | |||
the satisfaction of harming others (see Security Considerations in | the satisfaction of harming others (see Security Considerations in | |||
Section 4). | Section 4). | |||
Open research questions resulting from these considerations are: | Open research questions resulting from these considerations are: | |||
- By design, new congestion control protocols need to enable one end | - By design, new congestion control protocols need to enable one end | |||
to check the other for protocol compliance. | to check the other for protocol compliance. Still, it is unclear | |||
- We need to provide congestion control primitives that satisfy more | how such mechanisms would have to be designed. | |||
demanding applications (smoother than TFRC, faster than high speed | ||||
TCPs), so that application developers and users do not turn off | - Which congestion control primitives could satisfy more demanding | |||
congestion control to get the rate they expect and need. | applications (smoother than TFRC, faster than high speed TCPs), so | |||
that application developers and users do not turn off congestion | ||||
control to get the rate they expect and need. | ||||
Note also that self-restraint is disappearing from the Internet. So, | Note also that self-restraint is disappearing from the Internet. So, | |||
it may no longer be sufficient to rely on developers/users | it may no longer be sufficient to rely on developers/users | |||
voluntarily submitting themselves to congestion control. As main | voluntarily submitting themselves to congestion control. As a | |||
consequence, mechanisms to enforce fairness (see Sections 2.3, 3.4, | consequence, mechanisms to enforce fairness (see Sections 2.3, 3.4, | |||
and 3.5) need to have more emphasis within the research agenda. | and 3.5) need to have more emphasis within the research agenda. | |||
3.8 Other challenges | 3.8 Other Challenges | |||
This section provides additional challenges and open research issues | This section provides additional challenges and open research issues | |||
that are not (at this point in time) deemed very large or of | that are not (at this point in time) deemed very large or of | |||
different nature compared to the main challenges depicted so far. | different nature compared to the main challenges depicted so far. | |||
Note that this section may be complemented in future release of this | 3.8.1 RTT Estimation | |||
document by topics discussed during the last ICCRG meeting, co- | ||||
located with PFLDNet 2008 International Workshop. Topics of interest | ||||
include multipath congestion control, and congestion control for | ||||
multimedia codecs that only support certain set of data rates. | ||||
3.8.1 RTT estimation | ||||
Several congestion control schemes have to precisely know the round- | Several congestion control schemes have to precisely know the round- | |||
trip time (RTT) of a path. The RTT is a measure of the current delay | trip time (RTT) of a path. The RTT is a measure of the current delay | |||
on a network. It is defined as the delay between the sending of a | on a network. It is defined as the delay between the sending of a | |||
packet and the reception of a corresponding response, if echoed back | packet and the reception of a corresponding response, if echoed back | |||
immediately by receiver upon receipt of the packet. This corresponds | immediately by the receiver upon receipt of the packet. This | |||
to the sum of the one-way delay of the packet and the (potentially | corresponds to the sum of the one-way delay of the packet and the | |||
different) one-way delay of the response. Furthermore, any RTT | (potentially different) one-way delay of the response. Furthermore, | |||
measurement also includes some additional delay due to the packet | any RTT measurement also includes some additional delay due to the | |||
processing in both end-systems. | packet processing in both end-systems. | |||
There are various techniques to measure the RTT: Active measurements | There are various techniques to measure the RTT: active measurements | |||
inject special probe packets to the network and then measure the | inject special probe packets to the network and then measure the | |||
response time, using e.g. ICMP. In contrast, passive measurements | response time, using e.g. ICMP. In contrast, passive measurements | |||
determine the RTT from ongoing communication processes, without | determine the RTT from ongoing communication processes, without | |||
sending additional packets. | sending additional packets. | |||
The connection endpoints of reliable transport protocols such as TCP, | The connection endpoints of reliable transport protocols such as TCP, | |||
SCTP, and DCCP, as well as several application protocols, keep track | SCTP, and DCCP, as well as several application protocols, keep track | |||
of the RTT in order to dynamically adjust protocol parameters such as | of the RTT in order to dynamically adjust protocol parameters such as | |||
the retransmission timeout (RTO). They can implicitly measure the RTT | the retransmission timeout (RTO). They can implicitly measure the RTT | |||
on the sender side by observing the time difference between the | on the sender side by observing the time difference between the | |||
skipping to change at page 28, line 18 | skipping to change at page 28, line 31 | |||
measurements from retransmitted segments [RFC2988]. Traditionally, | measurements from retransmitted segments [RFC2988]. Traditionally, | |||
TCP implementations take one RTT measurement at a time (i. e., about | TCP implementations take one RTT measurement at a time (i. e., about | |||
once per RTT). As alternative, the TCP timestamp option [RFC1323] | once per RTT). As alternative, the TCP timestamp option [RFC1323] | |||
allows more frequent explicit measurements, since a sender can safely | allows more frequent explicit measurements, since a sender can safely | |||
obtain an RTT sample from every received acknowledgment. In | obtain an RTT sample from every received acknowledgment. In | |||
principle, similar measurement mechanisms are used by protocols other | principle, similar measurement mechanisms are used by protocols other | |||
than TCP. | than TCP. | |||
Sometimes it would be beneficial to know the RTT not only at the | Sometimes it would be beneficial to know the RTT not only at the | |||
sender, but also at the receiver, e.g., to find the one-way variation | sender, but also at the receiver, e.g., to find the one-way variation | |||
in delay due to one-way congestion.. A passive receiver can deduce | in delay due to one-way congestion. A passive receiver can deduce | |||
some information about the RTT by analyzing the sequence numbers of | some information about the RTT by analyzing the sequence numbers of | |||
received segments. But this method is error-prone and only works if | received segments. But this method is error-prone and only works if | |||
the sender permanently sends data. Other network entities on the path | the sender permanently sends data. Other network entities on the path | |||
can apply similar heuristics in order to approximate the RTT of a | can apply similar heuristics in order to approximate the RTT of a | |||
connection, but this mechanism is protocol-specific and requires per- | connection, but this mechanism is protocol-specific and requires per- | |||
connection state. In the current Internet, there is no simple and | connection state. In the current Internet, there is no simple and | |||
safe solution to determine the RTT of a connection in network | safe solution to determine the RTT of a connection in network | |||
entities other than the sender. | entities other than the sender. | |||
As outlined earlier in this document, the round-trip time is | As outlined earlier in this document, the round-trip time is | |||
skipping to change at page 29, line 37 | skipping to change at page 29, line 50 | |||
values that may not be optimal for most Internet communication. | values that may not be optimal for most Internet communication. | |||
Still, the impact of more aggressive settings is not well | Still, the impact of more aggressive settings is not well | |||
understood. | understood. | |||
- Clock granularities: RTT estimation depends on the clock | - Clock granularities: RTT estimation depends on the clock | |||
granularities of the protocol stacks. Even though there is a trend | granularities of the protocol stacks. Even though there is a trend | |||
towards higher precision timers, the limited granularity | towards higher precision timers, the limited granularity | |||
(particularly on low cost devices) may still prevent highly | (particularly on low cost devices) may still prevent highly | |||
accurate RTT estimations. | accurate RTT estimations. | |||
3.8.2 Malfunctioning devices | 3.8.2 Malfunctioning Devices | |||
There is a long history of malfunctioning devices harming the | There is a long history of malfunctioning devices harming the | |||
deployment of new and potentially beneficial functionality in the | deployment of new and potentially beneficial functionality in the | |||
Internet. Sometimes, such devices drop packets or even crash | Internet. Sometimes, such devices drop packets or even crash | |||
completely when a certain mechanism is used, causing users to opt for | completely when a certain mechanism is used, causing users to opt for | |||
reliability instead of performance and disable the mechanism, or | reliability instead of performance and disable the mechanism, or | |||
operating system vendors to disable it by default. One well-known | operating system vendors to disable it by default. One well-known | |||
example is ECN, whose deployment was long hindered by malfunctioning | example is ECN, whose deployment was long hindered by malfunctioning | |||
firewalls and is still hindered by malfunctioning home-hubs, but | firewalls and is still hindered by malfunctioning home-hubs, but | |||
there are many other examples (e.g. the Window Scaling option of TCP) | there are many other examples (e.g. the Window Scaling option of TCP) | |||
[Thaler07]. | [Thaler07]. | |||
skipping to change at page 30, line 12 | skipping to change at page 30, line 26 | |||
of eventually seeing them deployed in the Internet, it would be | of eventually seeing them deployed in the Internet, it would be | |||
useful to collect information about failures caused by devices of | useful to collect information about failures caused by devices of | |||
this sort, analyze the reasons for these failures, and determine | this sort, analyze the reasons for these failures, and determine | |||
whether there are ways for such devices to do what they intend to do | whether there are ways for such devices to do what they intend to do | |||
without causing unintended failures. Recommendation for vendors of | without causing unintended failures. Recommendation for vendors of | |||
these devices could be derived from such an analysis. It would also | these devices could be derived from such an analysis. It would also | |||
be useful to see whether there are ways for failures caused by such | be useful to see whether there are ways for failures caused by such | |||
devices to become more visible to endpoints, or for those failures to | devices to become more visible to endpoints, or for those failures to | |||
become more visible to the maintainers of such devices. | become more visible to the maintainers of such devices. | |||
3.8.3. Dependence on RTT | 3.8.3 Dependence on RTT | |||
AIMD window algorithms that have the goal of packet conservation end | AIMD window algorithms that have the goal of packet conservation end | |||
up converging on a rate that is inversely proportional to RTT. | up converging on a rate that is inversely proportional to RTT. | |||
However, control theoretic approaches to stability have shown that | However, control theoretic approaches to stability have shown that | |||
only the increase in rate (acceleration) not the target rate needs to | only the increase in rate (acceleration) not the target rate needs to | |||
be inversely proportional to RTT. | be inversely proportional to RTT. | |||
It is possible to have more aggressive behaviors for some demanding | It is possible to have more aggressive behaviors for some demanding | |||
applications as long as they are part of a mix with less aggressive | applications as long as they are part of a mix with less aggressive | |||
transports [Key04]. This beneficial effect of transport type mixing | transports [Key04]. This beneficial effect of transport type mixing | |||
is probably how the Internet currently manages to remain stable even | is probably how the Internet currently manages to remain stable even | |||
in the presence of TCP slow start, which is more aggressive than the | in the presence of TCP slow start, which is more aggressive than the | |||
theory allows for stability. Research giving deeper insight into | theory allows for stability. Research giving deeper insight into | |||
these aspects would be very useful. | these aspects would be very useful. | |||
3.8.4. Congestion Control in Multi-layered Networks | 3.8.4 Congestion Control in Multi-layered Networks | |||
We often forget that a network of IP nodes is just as vulnerable to | A network of IP nodes is just as vulnerable to congestion in the | |||
congestion in the lower layers between IP-capable nodes as it is to | lower layers between IP-capable nodes as it is to congestion on the | |||
congestion on the IP-capable nodes themselves. As we develop | IP-capable nodes themselves. If network elements take a greater part | |||
techniques for network equipment to take a greater part in congestion | in congestion control (ECN, XCP, RCP, etc. - see Section 3.1), these | |||
control (ECN, XCP, RCP etc – see Section 3.1), we must not forget | techniques will either need to be deployed at lower layers as well, | |||
that these techniques will either need to be deployed at lower layers | or they will need to interwork with lower layer mechanisms. | |||
as well, or they will need to interwork with lower layer mechanisms. | ||||
[ECN-tunnel] gives guidelines on propagating ECN from lower layers | [ECN-tunnel] gives guidelines on propagating ECN from lower layers | |||
upwards, but to the authors' knowledge the layering problem has not | upwards but to the authors' knowledge the layering problem has not | |||
been addressed for explicit rate protocol proposals such as XCP & | been addressed for explicit rate protocol proposals such as XCP and | |||
RCP. Some issues are straightforward matters of interoperability | RCP. Some issues are straightforward matters of interoperability | |||
(e.g. how exactly to copy fields up the layers). While others are | (e.g. how exactly to copy fields up the layers) while others are | |||
less obvious (e.g. re-framing issues: if RCP were deployed in a lower | less obvious (e.g. re-framing issues: if RCP were deployed in a lower | |||
layer, how might multiple small RCP frames all with different rates | layer, how might multiple small RCP frames all with different rates | |||
in their headers be assembled into a larger IP-layer datagram?). | in their headers be assembled into a larger IP-layer datagram?). | |||
Multi-layer considerations also confound many mechanisms that aim to | Multi-layer considerations also confound many mechanisms that aim to | |||
discover whether every node on the path supports the new congestion | discover whether every node on the path supports the new congestion | |||
control protocol. For instance, some proposals maintain a secondary | control protocol. For instance, some proposals maintain a secondary | |||
TTL field parallel to that in the IP header. Any nodes that support | TTL field parallel to that in the IP header. Any nodes that support | |||
the new behavior update both TTL fields, whereas legacy IP nodes will | the new behavior update both TTL fields, whereas legacy IP nodes will | |||
only update the IP TTL field. This allows the endpoints to check | only update the IP TTL field. This allows the endpoints to check | |||
whether all IP nodes on the path support the new behavior, in which | whether all IP nodes on the path support the new behavior, in which | |||
case both TTLs will be equal at the receiver. But mechanisms like | case both TTLs will be equal at the receiver. But mechanisms like | |||
these overlook nodes at lower layers that might not support the new | these overlook nodes at lower layers that might not support the new | |||
behavior. | behavior. | |||
It should also be possible to include the issue of congestion control | A further related issue is congestion control across overlay networks | |||
across overlay networks of relays under the general area of multi- | of relays. | |||
layer congestion control. | ||||
3.8.5. Multipath End-to-end Congestion Control and Traffic Engineering | 3.8.5 Multipath End-to-end Congestion Control and Traffic Engineering | |||
Recent work has shown that multipath endpoint congestion control | Recent work has shown that multipath endpoint congestion control | |||
[Kelly05] offers considerable benefits in terms of resilience and | [Kelly05] offers considerable benefits in terms of resilience and | |||
resource usage efficiency. By pooling the resources on all paths, | resource usage efficiency. By pooling the resources on all paths, | |||
even nodes not using multiple paths benefit from those that are. | even nodes not using multiple paths benefit from those that are. | |||
Nowadays, there is considerable further research to do in this area, | Nowadays, there is considerable further research to do in this area, | |||
particularly to understand interactions with network operator | particularly to understand interactions with network operator | |||
controlled route provision and traffic engineering, and indeed | controlled route provision and traffic engineering, and indeed | |||
whether multipath congestion control can perform better traffic | whether multipath congestion control can perform better traffic | |||
engineering than the network itself, given the right incentives. | engineering than the network itself, given the right incentives. | |||
3.8.6 ALGs and Middleboxes | 3.8.6 ALGs and Middleboxes | |||
An increasing number of application layer gateways (ALG), | An increasing number of application layer gateways (ALG), | |||
middleboxes, and proxies (see Section 3.6 of [RFC2775]) are deployed | middleboxes, and proxies (see Section 3.6 of [RFC2775]) is deployed | |||
at domain boundaries to verify conformance but also filter traffic | at domain boundaries to verify conformance but also filter traffic | |||
and control flows to e.g. prevent among other information leaking | and control flows. One motivation is to prevent information beyond | |||
between autonomous systems beyond routing information. These systems | routing data leaking between autonomous systems. These systems split | |||
split up end-to-end TCP connections and prevent end-to-end congestion | up end-to-end TCP connections and prevent end-to-end congestion | |||
control. On the other side, transport over encrypted tunnels may not | control. Furthermore, transport over encrypted tunnels may not allow | |||
allow that other network entities to participate in congestion | that other network entities to participate in congestion control. | |||
control. | ||||
Basically, such systems disrupt the primal and dual congestion | Basically, such systems disrupt the primal and dual congestion | |||
control components whereas their effects have not been so far | control components. In particular, end-to-end congestion control may | |||
systematically studied. From this perspective, one shall account for | be replaced by flow-control backpressure mechanisms on the split | |||
two levels of interference: | connections. A large variety of ALGs and middleboxes uses such | |||
- The "transparent" case i.e. the end-point address from the sender | mechanisms to improve the performance of applications (Performance | |||
perspective is still the receiver (the destination IP address). For | Enhancing Proxies, Application Accelerators, etc.). However, the | |||
instance relay systems intercept payload but do not relay | implications of such mechanisms, which are often proprietary and not | |||
congestion control information. | documented, have not been studied systematically so far. | |||
- The "non-transparent" case is not a problem (back-to-back | ||||
connections) results in a lesser problem. Indeed, although these | There are two levels of interference: | |||
devices interfere with end-to-end network transparency, they | ||||
correctly terminating network, transport and application layer | - The "transparent" case, i.e. the end-point address from the sender | |||
protocols on both sides. | perspective is still visible to the receiver (the destination IP | |||
address). An example are relay systems that intercept payload but | ||||
do not relay congestion control information. Such middleboxes can | ||||
prevent the operation of end-to-end congestion control. | ||||
- The "non-transparent" case, which causes less problems. Although | ||||
these devices interfere with end-to-end network transparency, they | ||||
correctly terminate network, transport and application layer | ||||
protocols on both sides, which individually can be congestion | ||||
controlled. | ||||
4. Security Considerations | 4. Security Considerations | |||
Misbehavior may be driven by pure malice, or malice may in turn be | Misbehavior may be driven by pure malice, or malice may in turn be | |||
driven by wider selfish interests, e.g. using distributed denial of | driven by wider selfish interests, e.g. using distributed denial of | |||
service (DDoS) attacks to gain rewards by extortion [RFC4948]. DDoS | service (DDoS) attacks to gain rewards by extortion [RFC4948]. DDoS | |||
attacks are possible both because of vulnerabilities in operating | attacks are possible both because of vulnerabilities in operating | |||
systems and because the Internet delivers packets without requiring | systems and because the Internet delivers packets without requiring | |||
congestion control. | congestion control. | |||
skipping to change at page 32, line 32 | skipping to change at page 33, line 5 | |||
spoofing. But if mechanisms to enforce congestion control fairness | spoofing. But if mechanisms to enforce congestion control fairness | |||
were robust to both selfishness and malice [Bri06] they would also | were robust to both selfishness and malice [Bri06] they would also | |||
naturally mitigate denial of service, which can be considered (from | naturally mitigate denial of service, which can be considered (from | |||
the perspective of well-behaving Internet user) as a congestion | the perspective of well-behaving Internet user) as a congestion | |||
control enforcement problem. Even some denial of service attacks on | control enforcement problem. Even some denial of service attacks on | |||
hosts (rather than the network) could be considered as a congestion | hosts (rather than the network) could be considered as a congestion | |||
control enforcement issue at the higher layer. But clearly there are | control enforcement issue at the higher layer. But clearly there are | |||
also denial of service attacks that would not be solved by enforcing | also denial of service attacks that would not be solved by enforcing | |||
congestion control. | congestion control. | |||
5. Contributors | 5. References | |||
This document is the result of a collective effort to which the | ||||
following people have contributed: | ||||
Dimitri Papadimitriou <dimitri.papadimitriou@alcatel-lucent.be> | ||||
Michael Welzl <michael.welzl@uibk.ac.at> | ||||
Wesley Eddy <weddy@grc.nasa.gov> | ||||
Bela Berde <bela.berde@gmx.de> | ||||
Paulo Loureiro <loureiro.pjg@gmail.com> | ||||
Chris Christou <christou_chris@bah.com> | ||||
Michael Scharf <michael.scharf@ikr.uni-stuttgart.de> | ||||
6. References | ||||
6.1 Normative References | 5.1 Normative References | |||
[RFC791] Postel, J., "Internet Protocol", STD 5, RFC 791, | [RFC791] Postel, J., "Internet Protocol", STD 5, RFC 791, | |||
September 1981. | September 1981. | |||
[RFC793] Postel, J., "Transmission Control Protocol", STD 7, | [RFC793] Postel, J., "Transmission Control Protocol", STD 7, | |||
RFC793, September 1981. | RFC793, September 1981. | |||
[RFC896] Nagle, J., "Congestion Control in IP/TCP", RFC 896, | [RFC896] Nagle, J., "Congestion Control in IP/TCP", RFC 896, | |||
January 1984. | January 1984. | |||
[RFC1323] Jacobson, V., Braden, R., Borman, D., "TCP Extensions for | [RFC1323] Jacobson, V., Braden, R., and Borman, D., "TCP Extensions | |||
High Performance", RFC 1323, May 1992. | for High Performance", RFC 1323, May 1992. | |||
[RFC1701] Hanks, S., Li, T, Farinacci, D., and P. Traina, "Generic | ||||
Routing Encapsulation", RFC 1701, October 1994. | ||||
[RFC1958] Carpenter, B., Ed., "Architectural Principles of the | ||||
Internet", RFC 1958, June 1996. | ||||
[RFC1958] Carpenter, B., Ed., “Architectural Principles of the | ||||
[RFC2309] Braden, B., et al., "Recommendations on queue management | [RFC2309] Braden, B., et al., "Recommendations on queue management | |||
and congestion avoidance in the Internet", RFC 2309, | and congestion avoidance in the Internet", RFC 2309, | |||
April 1998. | April 1998. | |||
[RFC2003] Perkins, C., "IP Encapsulation within IP", RFC 1633, | [RFC2003] Perkins, C., "IP Encapsulation within IP", RFC 1633, | |||
October 1996. | October 1996. | |||
[RFC2474] Nichols, K., Blake, S. Baker, F. and D. Black, | [RFC2474] Nichols, K., Blake, S. Baker, F. and D. Black, | |||
"Definition of the Differentiated Services Field (DS | "Definition of the Differentiated Services Field (DS | |||
Field) in the IPv4 and IPv6 Headers", RFC 2474, December | Field) in the IPv4 and IPv6 Headers", RFC 2474, December | |||
skipping to change at page 33, line 34 | skipping to change at page 33, line 49 | |||
[RFC2475] Blake, S., Black, D., Carlson, M., Davies, E., Wang, Z. | [RFC2475] Blake, S., Black, D., Carlson, M., Davies, E., Wang, Z. | |||
and Weiss, W., "An Architecture for Differentiated | and Weiss, W., "An Architecture for Differentiated | |||
Services", RFC 2475, December 1998. | Services", RFC 2475, December 1998. | |||
[RFC2581] Allman, M., Paxson, V., and W. Stevens, "TCP Congestion | [RFC2581] Allman, M., Paxson, V., and W. Stevens, "TCP Congestion | |||
Control", RFC 2581, April 1999. | Control", RFC 2581, April 1999. | |||
[RFC2861] Handley, M., J. Padhye, J., and S., Floyd, "TCP | [RFC2861] Handley, M., J. Padhye, J., and S., Floyd, "TCP | |||
Congestion Window Validation", RFC 2861, June 2000. | Congestion Window Validation", RFC 2861, June 2000. | |||
[RFC2784] Farinacci, D., Li, T., Hanks, S., Meyer, D. and P. | ||||
Traina, "Generic Routing Encapsulation (GRE)", RFC 2784, | ||||
March 2000. | ||||
[RFC2914] Floyd, S., "Congestion Control Principles", BCP 41, | [RFC2914] Floyd, S., "Congestion Control Principles", BCP 41, | |||
RFC 2914, September 2000. | RFC 2914, September 2000. | |||
[RFC2988] Paxson, V. and Allman, M., "Computing TCP's | [RFC2988] Paxson, V. and Allman, M., "Computing TCP's | |||
Retransmission Timer", RFC 2988, Nov. 2000 | Retransmission Timer", RFC 2988, November 2000. | |||
[RFC2990] Huston, G., "Next Steps for the IP QoS Architecture", | [RFC2990] Huston, G., "Next Steps for the IP QoS Architecture", | |||
RFC 2990, November 2000. | RFC 2990, November 2000. | |||
[RFC3168] Ramakrishnan, K., Floyd, S., and D. Black, "The Addition | [RFC3168] Ramakrishnan, K., Floyd, S., and D. Black, "The Addition | |||
of Explicit Congestion Notification (ECN) to IP", | of Explicit Congestion Notification (ECN) to IP", | |||
RFC 3168, September 2001. | RFC 3168, September 2001. | |||
[RFC3448] Handley, M., Floyd, S., Padhye, J., and J. Widmer, "TCP | [RFC3448] Handley, M., Floyd, S., Padhye, J., and J. Widmer, "TCP | |||
Friendly Rate Control (TFRC): Protocol Specification", | Friendly Rate Control (TFRC): Protocol Specification", | |||
skipping to change at page 34, line 20 | skipping to change at page 34, line 35 | |||
Per-Domain Behavior for Differentiated Services", RFC | Per-Domain Behavior for Differentiated Services", RFC | |||
3662, December 2003. | 3662, December 2003. | |||
[RFC3714] Floyd, S., and J. Kempf, Eds. "IAB Concerns Regarding | [RFC3714] Floyd, S., and J. Kempf, Eds. "IAB Concerns Regarding | |||
Congestion Control for Voice Traffic in the Internet", | Congestion Control for Voice Traffic in the Internet", | |||
RFC 3714, March 2004. | RFC 3714, March 2004. | |||
[RFC3742] Floyd, S., "Limited Slow-Start for TCP with Large | [RFC3742] Floyd, S., "Limited Slow-Start for TCP with Large | |||
Congestion Windows", RFC 3742, March 2004. | Congestion Windows", RFC 3742, March 2004. | |||
[RFC3985] Bryant, S. and P. Pate, "Pseudo Wire Emulation Edge-to- | [RFC3985] Bryant, S., and P. Pate, "Pseudo Wire Emulation Edge-to- | |||
Edge (PWE3) Architecture", RFC 3985, March 2005. | Edge (PWE3) Architecture", RFC 3985, March 2005. | |||
[RFC4340] Kohler, E., Handley, M., and S. Floyd, "Datagram | [RFC4340] Kohler, E., Handley, M., and S. Floyd, "Datagram | |||
Congestion Control Protocol (DCCP)", RFC 4340, March | Congestion Control Protocol (DCCP)", RFC 4340, March | |||
2006. | 2006. | |||
[RFC4341] Floyd, S. and E. Kohler, "Profile for Datagram Congestion | [RFC4341] Floyd, S. and E. Kohler, "Profile for Datagram Congestion | |||
Control Protocol (DCCP) Congestion Control ID 2: TCP-like | Control Protocol (DCCP) Congestion Control ID 2: TCP-like | |||
Congestion Control", RFC 4341, March 2006. | Congestion Control", RFC 4341, March 2006. | |||
[RFC4342] Floyd, S., Kohler, E., and J. Padhye, "Profile for | [RFC4342] Floyd, S., Kohler, E., and J. Padhye, "Profile for | |||
Datagram Congestion Control Protocol (DCCP) Congestion | Datagram Congestion Control Protocol (DCCP) Congestion | |||
Control ID 3: TCP-Friendly Rate Control (TFRC)", RFC | Control ID 3: TCP-Friendly Rate Control (TFRC)", RFC | |||
4342, March 2006. | 4342, March 2006. | |||
[RFC4553] Vainshtein, A. and Y. Stein, "Structure-Agnostic Time | [RFC4553] Vainshtein, A., and Y. Stein, "Structure-Agnostic Time | |||
Division Multiplexing (TDM) over Packet (SAToP)", | Division Multiplexing (TDM) over Packet (SAToP)", | |||
RFC 4553, June 2006. | RFC 4553, June 2006. | |||
[RFC4614] Duke, M., R. Braden, R., Eddy, W., and Blanton, E., "A | [RFC4614] Duke, M., R. Braden, R., Eddy, W., and E. Blanton, "A | |||
Roadmap for Transmission Control Protocol (TCP) | Roadmap for Transmission Control Protocol (TCP) | |||
Specification Documents", RFC 4614, September 2006. | Specification Documents", RFC 4614, September 2006. | |||
[RFC4782] Floyd, S., Allman, M., Jain, A., and P. Sarolahti, | [RFC4782] Floyd, S., Allman, M., Jain, A., and P. Sarolahti, | |||
"Quick-Start for TCP and IP", RFC 4782, Jan. 2007. | "Quick-Start for TCP and IP", RFC 4782, January 2007. | |||
[RFC4948] Andersson, L., Davies, E., and L. Zhang, "Report from the | [RFC4948] Andersson, L., Davies, E., and L. Zhang, "Report from the | |||
IAB workshop on Unwanted Traffic March 9-10, 2006", RFC | IAB workshop on Unwanted Traffic March 9-10, 2006", RFC | |||
4948, August 2007. | 4948, August 2007. | |||
[RFC5033] Floyd, S., and M. Allman, "Specifying New Congestion | [RFC5033] Floyd, S., and M. Allman, "Specifying New Congestion | |||
Control Algorithms", RFC 5033, Aug. 2007. | Control Algorithms", RFC 5033, August 2007. | |||
[RFC5405] Eggert, L., and G. Fairhurst, "Unicast UDP Usage | [RFC5405] Eggert, L., and G. Fairhurst, "Unicast UDP Usage | |||
Guidelines for Application Designers, RFC 5405, November | Guidelines for Application Designers, RFC 5405, November | |||
2008. | 2008. | |||
[iccrg-rfcs]Welzl, M., and W. Eddy, "Congestion Control in the RFC | [ICCRG-RFCs] Welzl, M., and W. Eddy, "Congestion Control in the RFC | |||
Series", Internet Draft, work in Progress, October 2008. | Series", Internet Draft, work in Progress, October 2008. | |||
6.2 Informative References | 5.2 Informative References | |||
[Allman99] Allman, M. and V. Paxson, "On Estimating End-to-End | [Allman99] Allman, M., and V. Paxson, "On Estimating End-to-End | |||
Network Path Properties", Proceedings of ACM SIGCOMM'99, | Network Path Properties", Proceedings of ACM SIGCOMM'99, | |||
September 1999. | September 1999. | |||
[Andrew00] L. Andrew, B. Wydrowski and S. Low, "An Example of | [Andrew05] Andrew, L., Wydrowski, B., and S. Low, "An Example of | |||
Instability in XCP", Manuscript available at | Instability in XCP", Manuscript available at | |||
<http://netlab.caltech.edu/maxnet/XCP_instability.pdf> | <http://netlab.caltech.edu/maxnet/XCP_instability.pdf> | |||
[Ath01] Athuraliya, S., Low, S., Li, V., and Q. Yin, "REM: Active | [Ath01] Athuraliya, S., Low, S., Li, V., and Q. Yin, "REM: Active | |||
queue management", IEEE Network Magazine, vol.15, no.3, | Queue Management", IEEE Network Magazine, Vol.15, No.3, | |||
pp.48-53, May 2001. | pp.48-53, May 2001. | |||
[BALAN01] Balan, R. K., Lee, B.P., Kumar, K.R.R., Jacob, L., Seah, | [Balan01] Balan, R. K., Lee, B.P., Kumar, K.R.R., Jacob, L., Seah, | |||
W.K.G., and Ananda, A.L., "TCP HACK: TCP Header Checksum | W.K.G., and Ananda, A.L., "TCP HACK: TCP Header Checksum | |||
Option to Improve Performance over Lossy Links", | Option to Improve Performance over Lossy Links", | |||
Proceedings of IEEE INFOCOM'01, Anchorage (Alaska), USA, | Proceedings of IEEE INFOCOM'01, Anchorage (Alaska), USA, | |||
April 2001. | April 2001. | |||
[Bonald00] Bonald, T., May, M., and J.-C. Bolot, "Analytic | [Bonald00] Bonald, T., May, M., and J.-C. Bolot, "Analytic | |||
Evaluation of RED Performance," Proceedings of IEEE | Evaluation of RED Performance," Proceedings of IEEE | |||
INFOCOM'00, Tel Aviv, Israel, March 2000. | INFOCOM'00, Tel Aviv, Israel, March 2000. | |||
[Bri08] Briscoe, B., Moncaster, T. and L. Burness, "Problem | [Bri08] Briscoe, B., Moncaster, T. and L. Burness, "Problem | |||
skipping to change at page 36, line 8 | skipping to change at page 36, line 23 | |||
Workshop on the Economics of Securing the Information | Workshop on the Economics of Securing the Information | |||
Infrastructure, October 2006. | Infrastructure, October 2006. | |||
<http://wesii.econinfosec.org/draft.php?paper_id=19> | <http://wesii.econinfosec.org/draft.php?paper_id=19> | |||
[Bryant08] Bryant, S., Davie, B., Martini, L., and E. Rosen, | [Bryant08] Bryant, S., Davie, B., Martini, L., and E. Rosen, | |||
"Pseudowire Congestion Control Framework", Work in | "Pseudowire Congestion Control Framework", Work in | |||
Progress, draft-ietf-pwe3-congestion-frmwk-01.txt, May | Progress, draft-ietf-pwe3-congestion-frmwk-01.txt, May | |||
2008. | 2008. | |||
[Chester04] Chesterfield, J., Chakravorty, R., Banerjee, S., | [Chester04] Chesterfield, J., Chakravorty, R., Banerjee, S., | |||
Rodriguez, P., Pratt, I. and Crowcroft, J., "Transport | Rodriguez, P., Pratt, I., and Crowcroft, J., "Transport | |||
level optimisations for streaming media over wide-area | level optimisations for streaming media over wide-area | |||
wireless networks", WIOPT'04, March 2004. | wireless networks", WIOPT'04, March 2004. | |||
[Chiu89] Chiu, D. M., and R. Jain, "Analysis of the increase and | [Chiu89] Chiu, D. M., and R. Jain, "Analysis of the increase and | |||
decrease algorithms for congestion avoidance in computer | decrease algorithms for congestion avoidance in computer | |||
networks", Computer Networks and ISDN Systems, vol.17, | networks", Computer Networks and ISDN Systems, Vol.17, | |||
pp.1-14, 1989. | pp.1-14, 1989. | |||
[Clark98] Clark, D. and W. Fang, "Explicit Allocation of Best- | [Clark88] Clark, D., "The design philosophy of the DARPA internet | |||
Effort Packet Delivery Service," IEEE/ACM Transactions on | protocols", ACM SIGCOMM Computer Communication Review, | |||
Networking, vol.6, no.4, pp.362-373, August 1998. | Vol.18, No.4, pp.106-114, August 1988. | |||
[Clark98] Clark, D., and W. Fang, "Explicit Allocation of Best- | ||||
Effort Packet Delivery Service," IEEE/ACM Transactions | ||||
on Networking, Vol.6, No.4, pp.362-373, August 1998. | ||||
[Dukki05] Dukkipati, N., Kobayashi, M., Zhang-Shen, R. and N., | [Dukki05] Dukkipati, N., Kobayashi, M., Zhang-Shen, R. and N., | |||
McKeown, "Processor Sharing Flows in the Internet", | McKeown, "Processor Sharing Flows in the Internet", | |||
Proceedings of International Workshop on QoS (IWQoS'05), | Proceedings of International Workshop on QoS (IWQoS'05), | |||
June 2005. | June 2005. | |||
[Dukki06] Dukkipati, N. and N. McKeown, "Why Flow-Completion Time | [Dukki06] Dukkipati, N. and N. McKeown, "Why Flow-Completion Time | |||
is the Right Metric for Congestion Control", ACM SIGCOMM | is the Right Metric for Congestion Control", ACM SIGCOMM | |||
Computer Communication Review, Vol.36, No.1, January | Computer Communication Review, Vol.36, No.1, January | |||
2006. | 2006. | |||
[ECN-tunnel]Briscoe, B., "Layered Encapsulation of Congestion | [ECN-tunnel]Briscoe, B., "Layered Encapsulation of Congestion | |||
Notification", draft-briscoe-tsvwg-ecn-tunnel, Work in | Notification", draft-briscoe-tsvwg-ecn-tunnel, Work in | |||
progress. | progress. | |||
[ECODE] "ECODE Project", European Commission Seventh Framework | ||||
Program Contract Number: INFSO-ICT-223936 | ||||
<http://www.ecode-project.eu> | ||||
[Falk07] Falk, A., et al., "Specification for the Explicit Control | [Falk07] Falk, A., et al., "Specification for the Explicit Control | |||
Protocol (XCP)", Work in Progress, draft-falk-xcp-spec- | Protocol (XCP)", Work in Progress, draft-falk-xcp-spec- | |||
03.txt, July 2007. | 03.txt, July 2007. | |||
[Firoiu00] Firoiu, V., and M. Borden, "A Study of Active Queue | [Firoiu00] Firoiu, V., and M. Borden, "A Study of Active Queue | |||
Management for Congestion Control," Proceedings of IEEE | Management for Congestion Control," Proceedings of IEEE | |||
INFOCOM'00, Tel Aviv, Israel, March 2000. | INFOCOM'00, Tel Aviv, Israel, March 2000. | |||
[Floyd93] Floyd, S., and V. Jacobson, "Random early detection | [Floyd93] Floyd, S., and V. Jacobson, "Random early detection | |||
gateways for congestion avoidance," IEEE/ACM Transactions | gateways for congestion avoidance," IEEE/ACM Transactions | |||
on Networking, vol.1, no.4, pp.397-413, August 1993. | on Networking, Vol.1, No.4, pp.397-413, August 1993. | |||
[Floyd94] Floyd, S., "TCP and Explicit Congestion Notification", | [Floyd94] Floyd, S., "TCP and Explicit Congestion Notification", | |||
ACM Computer Communication Review, vol.24, no.5, pp.10- | ACM Computer Communication Review, Vol.24, No.5, pp.10- | |||
23, October 1994. | 23, October 1994. | |||
[Floyd08] Floyd, S., and M. Allman, "Comments on the Usefulness of | [Floyd08] Floyd, S., and M. Allman, "Comments on the Usefulness of | |||
Simple Best-Effort Traffic", RFC 5290, July 2008. | Simple Best-Effort Traffic", RFC 5290, July 2008. | |||
[Hollot01] Hollot, C., Misra, V., Towsley, D., and W.-B. Gong, "A | [Hollot01] Hollot, C., Misra, V., Towsley, D., and W.-B. Gong, "A | |||
Control Theoretic Analysis of RED," Proceedings of IEEE | Control Theoretic Analysis of RED," Proceedings of IEEE | |||
INFOCOM'01, Anchorage, Alaska, April 2001. | INFOCOM'01, Anchorage, Alaska, April 2001. | |||
[Jacobson88]Jacobson, V., "Congestion Avoidance and Control", | [Jacobson88]Jacobson, V., "Congestion Avoidance and Control", | |||
skipping to change at page 37, line 28 | skipping to change at page 37, line 51 | |||
[Jain90] Jain, R., "Congestion Control in Computer Networks: | [Jain90] Jain, R., "Congestion Control in Computer Networks: | |||
Trends and Issues", IEEE Network, pp. 24-30, May 1990. | Trends and Issues", IEEE Network, pp. 24-30, May 1990. | |||
[Jin04] Jin, Ch., Wei, D.X., and S. Low, "FAST TCP: Motivation, | [Jin04] Jin, Ch., Wei, D.X., and S. Low, "FAST TCP: Motivation, | |||
Architecture, Algorithms, Performance," Proceedings of | Architecture, Algorithms, Performance," Proceedings of | |||
IEEE INFOCOM'04, Hong-Kong, China, March 2004. | IEEE INFOCOM'04, Hong-Kong, China, March 2004. | |||
[Katabi02] Katabi, D., M. Handley, and C. Rohr, "Internet Congestion | [Katabi02] Katabi, D., M. Handley, and C. Rohr, "Internet Congestion | |||
Control for Future High Bandwidth-Delay Product | Control for Future High Bandwidth-Delay Product | |||
Environments", Proceedings of ACM SIGCOMM'02 Symposium, | Environments", Proceedings of ACM SIGCOMM'02 Symposium, | |||
pp. 89-102, August 2002. | August 2002. | |||
[Kelly98] Kelly, F., Maulloo, A., and D. Tan, "Rate control in | [Kelly98] Kelly, F., Maulloo, A., and D. Tan, "Rate control in | |||
communication networks: shadow prices, proportional | communication networks: shadow prices, proportional | |||
fairness, and stability," Journal of the Operational | fairness, and stability," Journal of the Operational | |||
Research Society, vol.49, pp. 237–252, 1998. | Research Society, Vol.49, pp.237-252, 1998. | |||
[Kelly05] Kelly, F., and Th. Voice, "Stability of end-to-end | [Kelly05] Kelly, F., and Th. Voice, "Stability of end-to-end | |||
algorithms for joint routing and rate control", ACM | algorithms for joint routing and rate control", ACM | |||
SIGCOMM Computer Communication Review, Vol.35, No.2, pp. | SIGCOMM Computer Communication Review, Vol.35, No.2, pp. | |||
5-12, April 2005. | 5-12, April 2005. | |||
[Keshav] Keshav, S., "What is congestion and what is congestion | [Keshav07] Keshav, S., "What is congestion and what is congestion | |||
control", Presentation at IRTF ICCRG Workshop, PFLDNet | control", Presentation at IRTF ICCRG Workshop, PFLDNet | |||
2007, Los Angeles (California), USA, February 2007. | 2007, Los Angeles (California), USA, February 2007. | |||
[Key04] Key, P., Massoulié, L., Bain, A., and F. Kelly, "Fair | [Key04] Key, P., Massoulie, L., Bain, A., and F. Kelly, "Fair | |||
Internet Traffic Integration: Network Flow Models and | Internet Traffic Integration: Network Flow Models and | |||
Analysis", Annales des Télécommunications, Vol.59, No.11- | Analysis", Annales des Telecommunications, Vol.59, No.11- | |||
12, pp. 1338-1352, November-December 2004. | 12, pp. 1338-1352, November-December 2004. | |||
[Krishnan04] Krishnan, R., Sterbenz, J., Eddy, W., Partridge, C., and | [Krishnan04] Krishnan, R., Sterbenz, J., Eddy, W., Partridge, C., and | |||
M. Allman, "Explicit Transport Error Notification (ETEN) | M. Allman, "Explicit Transport Error Notification (ETEN) | |||
for Error-Prone Wireless and Satellite Networks", | for Error-Prone Wireless and Satellite Networks", | |||
Computer Networks, vol.46, no.3, October 2004. | Computer Networks, Vol.46, No.3, October 2004. | |||
[Kuzmanovic03] Kuzmanovic, A., and E. W. Knightly, "TCP-LP: A | [Kuzmanovic03] Kuzmanovic, A., and E. W. Knightly, "TCP-LP: A | |||
Distributed Algorithm for Low Priority Data Transfer", | Distributed Algorithm for Low Priority Data Transfer", | |||
Proceedings of IEEE INFOCOM'03, San Francisco | Proceedings of IEEE INFOCOM'03, San Francisco | |||
(California), USA, April 2003. | (California), USA, April 2003. | |||
[Low05] Low, S., L. Andrew, L., and B. Wydrowski, "Understanding | [Low05] Low, S., Andrew, L., and B. Wydrowski, "Understanding | |||
XCP: equilibrium and fairness", Proceedings of IEEE | XCP: equilibrium and fairness", Proceedings of IEEE | |||
INFOCOM'05, Miami (Florida), USA, March 2005. | INFOCOM'05, Miami (Florida), USA, March 2005. | |||
[Low03.2] Low, S., Paganini, F., Wang, J., and J. Doyle, "Linear | [Low03.2] Low, S., Paganini, F., Wang, J., and J. Doyle, "Linear | |||
stability of TCP/RED and a scalable control", Computer | stability of TCP/RED and a scalable control", Computer | |||
Networks Journal, vol.43, no.5, pp.633-647, December | Networks Journal, Vol.43, No.5, pp.633-647, December | |||
2003. | 2003. | |||
[Low03.1] Low, S., "A duality model of TCP and queue management | [Low03.1] Low, S., "A duality model of TCP and queue management | |||
algorithms", IEEE/ACM Transactions on Networking, vol.11, | algorithms", IEEE/ACM Transactions on Networking, Vol.11, | |||
no.4, pp.525–536, August 2003. | No.4, pp.525-536, August 2003. | |||
[Low02] Low, S., Paganini, F., Wang, J., Adlakha, S., and J.C. | [Low02] Low, S., Paganini, F., Wang, J., Adlakha, S., and J.C. | |||
Doyle, "Dynamics of TCP/RED and a Scalable Control", | Doyle, "Dynamics of TCP/RED and a Scalable Control", | |||
Proceedings of IEEE INFOCOM'02, New York (New-Jersey), | Proceedings of IEEE INFOCOM'02, New York (New-Jersey), | |||
USA, June 2002. | USA, June 2002. | |||
[LT-TCP] Tickoo, O., Subramanian, V., Kalyanaraman, S., and K.K. | [LT-TCP] Tickoo, O., Subramanian, V., Kalyanaraman, S., and K.K. | |||
Ramakrishnan, "LT-TCP: End-to-End Framework to Improve | Ramakrishnan, "LT-TCP: End-to-End Framework to Improve | |||
TCP Performance over Networks with Lossy Channels", | TCP Performance over Networks with Lossy Channels", | |||
Proceedings of International Workshop on QoS (IWQoS), | Proceedings of International Workshop on QoS (IWQoS), | |||
June 2005. | June 2005. | |||
[Mascolo01] Mascolo, S., Casetti, Cl., Gerla M., Sanadidi, M.Y., and | [Mascolo01] Mascolo, S., Casetti, Cl., Gerla M., Sanadidi, M.Y., and | |||
R. Wang, "TCP westwood: Bandwidth estimation for enhanced | R. Wang, "TCP westwood: Bandwidth estimation for enhanced | |||
transport over wireless links", Proceedings of MOBICOM | transport over wireless links", Proceedings of MOBICOM | |||
2001, pp.287-297, 2001. | 2001. | |||
[Moors02] Moors, T., "A critical review of "End-to-end arguments in | [Moors02] Moors, T., "A critical review of "End-to-end arguments in | |||
system design", Proceedings of IEEE International | system design", Proceedings of IEEE International | |||
Conference on Communications (ICC), Apr./May 2002. | Conference on Communications (ICC), April/May 2002. | |||
[MKMV95] MacKie-Mason, J., and H. Varian, "Pricing Congestible | [MKMV95] MacKie-Mason, J., and H. Varian, "Pricing Congestible | |||
Network Resources", IEEE Journal on Selected Areas in | Network Resources", IEEE Journal on Selected Areas in | |||
Communications, 'Advances in the Fundamentals of | Communications, 'Advances in the Fundamentals of | |||
Networking', Vol.13, No.7, pp.1141-1149, 1995, <http:// | Networking', Vol.13, No.7, pp.1141-1149, 1995. | |||
www.sims.berkeley.edu/~hal/Papers/ | ||||
pricing-congestible.pdf>. | ||||
[Padhye98] Padhye, J., Firoiu, V., Towsley, D., and J. Kurose, | [Padhye98] Padhye, J., Firoiu, V., Towsley, D., and J. Kurose, | |||
"Modeling TCP Throughput: A Simple Model and Its | "Modeling TCP Throughput: A Simple Model and Its | |||
Empirical Validation", University of Massachusetts | Empirical Validation", University of Massachusetts | |||
(UMass), CMPSCI Tech Report TR98-008, February 1998. | (UMass), CMPSCI Tech. Report TR98-008, February 1998. | |||
[Pan00] Pan, R., Prabhakar, B., and K. Psounis, "CHOKe: a | [Pan00] Pan, R., Prabhakar, B., and K. Psounis, "CHOKe: a | |||
stateless AQM scheme for approximating fair bandwidth | stateless AQM scheme for approximating fair bandwidth | |||
allocation", In Proceedings of IEEE INFOCOM'00, Tel Aviv, | allocation", In Proceedings of IEEE INFOCOM'00, Tel Aviv, | |||
Israel, March 2000. | Israel, March 2000. | |||
[Rossi06] Rossi, M., "Evaluating TCP with Corruption Notification | [Rossi06] Rossi, M., "Evaluating TCP with Corruption Notification | |||
in an IEEE 802.11 Wireless LAN", master thesis, | in an IEEE 802.11 Wireless LAN", master thesis, | |||
University of Innsbruck, November 2006. Available from | University of Innsbruck, November 2006. Available from | |||
http://www.welzl.at/research/projects/corruption/ | http://www.welzl.at/research/projects/corruption/ | |||
[Sarola02] Sarolahti, P. and A. Kuznetsov, "Congestion Control in | [Sarola02] Sarolahti, P., and A. Kuznetsov, "Congestion Control in | |||
Linux TCP", Proceedings of USENIX Annual Technical | Linux TCP", Proceedings of USENIX Annual Technical | |||
Conference, June 2002. | Conference, June 2002. | |||
[Sarola07] Sarolahti, P., Floyd, S., and M. Kojo, "Transport-layer | [Sarola07] Sarolahti, P., Floyd, S., and M. Kojo, "Transport-layer | |||
Considerations for Explicit Cross-layer Indications", | Considerations for Explicit Cross-layer Indications", | |||
Work in Progress, draft-sarolahti-tsvwg-crosslayer- | Work in Progress, draft-sarolahti-tsvwg-crosslayer- | |||
01.txt, March 2007. | 01.txt, March 2007. | |||
[Savage99] Savage, S., Wetherall, D., and T. Anderson, "TCP | [Savage99] Savage, S., Wetherall, D., and T. Anderson, "TCP | |||
Congestion Control with a Misbehaving Receiver," ACM | Congestion Control with a Misbehaving Receiver," ACM | |||
SIGCOMM Computer Communication Review, 1999. | SIGCOMM Computer Communication Review, 1999. | |||
[Saltzer84] Saltzer, J., Reed, D., and D. Clark, "End-to-end | [Saltzer84] Saltzer, J., Reed, D., and D. Clark, "End-to-end | |||
arguments in system design", ACM Transactions on Computer | arguments in system design", ACM Transactions on Computer | |||
Systems, Vol.2, No.4, November 1984. | Systems, Vol.2, No.4, November 1984. | |||
[Shin08] Shin, M., Chong, S., and I., Rhee, "Dual-Resource TCP/AQM | [Shin08] Shin, M., Chong, S., and I. Rhee, "Dual-Resource TCP/AQM | |||
for Processing-Constrained Networks", IEEE/ACM | for Processing-Constrained Networks", IEEE/ACM | |||
Transactions on Networking, Vol.16, No.2, pp. 435—449, | Transactions on Networking, Vol.16, No.2, pp.435-449, | |||
April 2008. | April 2008. | |||
[Thaler07] Thaler, D., Sridhara, M., and D. Bansal, "Implementation | [Thaler07] Thaler, D., Sridhara, M., and D. Bansal, "Implementation | |||
Report on Experiences with Various TCP RFCs", | Report on Experiences with Various TCP RFCs", | |||
presentation to the IETF Transport Area, | Presentation to the IETF Transport Area, | |||
<http://www.ietf.org/proceedings/07mar/slides/tsvarea- | <http://www.ietf.org/proceedings/07mar/slides/tsvarea-3/> | |||
3/>, March 2007. | March 2007. | |||
[TRILOGY] "Trilogy Project", European Commission Seventh Framework | [TRILOGY] "Trilogy Project", European Commission Seventh Framework | |||
Program Contract Number: INFSO-ICT-216372 | Program Contract Number: INFSO-ICT-216372 | |||
<http://www.trilogy-project.org> | <http://www.trilogy-project.org> | |||
[Welzl03] Welzl, M., "Scalable Performance Signalling and | [Welzl03] Welzl, M., "Scalable Performance Signalling and | |||
Congestion Avoidance", Springer (ISBN 1-4020-7570-7), | Congestion Avoidance", Springer (ISBN 1-4020-7570-7), | |||
August 2003. | August 2003. | |||
[Welzl08] Welzl, M., Rossi, M., Fumagalli, A., and M. Tacca, | [Welzl08] Welzl, M., Rossi, M., Fumagalli, A., and M. Tacca, | |||
"TCP/IP over IEEE 802.11b WLAN: the Challenge of | "TCP/IP over IEEE 802.11b WLAN: the Challenge of | |||
Harnessing Known-Corrupt Data", Proceedings of IEEE ICC | Harnessing Known-Corrupt Data", Proceedings of IEEE ICC | |||
2008, Beijing, China, May 2008. | 2008, Beijing, China, May 2008. | |||
[Xia05] Xia, Y., Subramanian, L., Stoica, I., and S. | [Xia05] Xia, Y., Subramanian, L., Stoica, I., and S. | |||
Kalyanaraman, "One more bit is enough", Proceedings of | Kalyanaraman, "One more bit is enough", Proceedings of | |||
ACM SIGCOMM'05, and ACM Computer Communication Review, | ACM SIGCOMM'05, and ACM SIGCOMM Computer Communication | |||
Vol.35, No.4, pp. 37-48, 2005. | Review, Vol.35, No.4, pp.37-48, 2005. | |||
[Zhang03] Zhang, H., Hollot, C., Towsley, D., and V. Misra. "A | [Zhang03] Zhang, H., Hollot, C., Towsley, D., and V. Misra, "A | |||
Self-Tuning Structure for Adaptation in TCP/AQM | Self-Tuning Structure for Adaptation in TCP/AQM | |||
Networks", ACM SIGMETRICS’03, San Diego (California), | Networks", ACM SIGMETRICS'03, San Diego (California), | |||
USA, June 2003. | USA, June 2003. | |||
Acknowledgments | 6. Acknowledgments | |||
The authors would like to thank the following people whose feedback | The authors would like to thank the following people whose feedback | |||
and comments contributed to this document: Keith Moore, Jan | and comments contributed to this document: Keith Moore, Jan | |||
Vandenabeele. | Vandenabeele. | |||
Dimitri Papadimitriou's contribution was partly funded by [ECODE], a | ||||
research project supported by the European Commission. | ||||
Larry Dunn (his comments at the Manchester ICCRG and discussions with | Larry Dunn (his comments at the Manchester ICCRG and discussions with | |||
him helped with the section on packet-congestibility). Bob Briscoe's | him helped with the section on packet-congestibility). | |||
contribution was partly funded by [TRILOGY], a research project | ||||
supported by the European Commission. | ||||
Author's Addresses | Bob Briscoe's contribution was partly funded by [TRILOGY], a research | |||
project supported by the European Commission. | ||||
7. Author's Addresses | ||||
Michael Welzl | Michael Welzl | |||
University of Innsbruck | University of Oslo, Department of Informatics | |||
Technikerstr 21a | PO Box 1080 Blindern | |||
A-6020 Innsbruck, Austria | N-0316 Oslo, Norway | |||
Phone: +43 (512) 507-6110 | Phone: +47 22 85 24 20 | |||
Email: michael.welzl@uibk.ac.at | Email: michawe@ifi.uio.no | |||
Dimitri Papadimitriou | Dimitri Papadimitriou | |||
Alcatel-Lucent | Alcatel-Lucent | |||
Copernicuslaan, 50 | Copernicuslaan, 50 | |||
B-2018 Antwerpen, Belgium | 2018 Antwerpen, Belgium | |||
Phone : +32 3 240 8491 | Phone : +32 3 240 8491 | |||
Email: dimitri.papadimitriou@alcatel-lucent.be | Email: dimitri.papadimitriou@alcatel-lucent.be | |||
Michael Scharf | Michael Scharf | |||
University of Stuttgart | University of Stuttgart | |||
Pfaffenwaldring 47 | Pfaffenwaldring 47 | |||
D-70569 Stuttgart | D-70569 Stuttgart, Germany | |||
Germany | ||||
Phone: +49 711 685 69006 | Phone: +49 711 685 69006 | |||
Email: michael.scharf@ikr.uni-stuttgart.de | Email: michael.scharf@ikr.uni-stuttgart.de | |||
Bob Briscoe | Bob Briscoe | |||
BT & UCL | BT & UCL | |||
B54/77, Adastral Park | B54/77, Adastral Park | |||
Martlesham Heath | Martlesham Heath | |||
Ipswich IP5 3RE | Ipswich IP5 3RE, UK | |||
UK | ||||
Email: bob.briscoe@bt.com | Email: bob.briscoe@bt.com | |||
8. Contributors | ||||
The following additional people have contributed to this document: | ||||
- Wesley Eddy <weddy@grc.nasa.gov> | ||||
- Bela Berde <bela.berde@gmx.de> | ||||
- Paulo Loureiro <loureiro.pjg@gmail.com> | ||||
- Chris Christou <christou_chris@bah.com> | ||||
Full Copyright Statement | Full Copyright Statement | |||
Copyright (c) 2009 IETF Trust and the persons identified as the | Copyright (c) 2009 IETF Trust and the persons identified as the | |||
document authors. All rights reserved. | document authors. All rights reserved. | |||
This document is subject to BCP 78 and the IETF Trust's Legal | This document is subject to BCP 78 and the IETF Trust's Legal | |||
Provisions Relating to IETF Documents in effect on the date of | Provisions Relating to IETF Documents in effect on the date of | |||
publication of this document (http://trustee.ietf.org/license-info). | publication of this document (http://trustee.ietf.org/license-info). | |||
Please review these documents carefully, as they describe your rights | Please review these documents carefully, as they describe your rights | |||
and restrictions with respect to this document. | and restrictions with respect to this document. | |||
Acknowledgment | Acknowledgments | |||
Funding for the RFC Editor function is provided by the IETF | Funding for the RFC Editor function is provided by the IETF | |||
Administrative Support Activity (IASA). | Administrative Support Activity (IASA). | |||
End of changes. 142 change blocks. | ||||
340 lines changed or deleted | 381 lines changed or added | |||
This html diff was produced by rfcdiff 1.35. The latest version is available from http://tools.ietf.org/tools/rfcdiff/ |