TOC 
Congestion and Pre CongestionT. Moncaster
Internet-DraftBT
Intended status: Standards TrackB. Briscoe
Expires: April 17, 2009BT & UCL
 M. Menth
 University of Wuerzburg
 October 14, 2008


Baseline Encoding and Transport of Pre-Congestion Information
draft-ietf-pcn-baseline-encoding-01

Status of This Memo

By submitting this Internet-Draft, each author represents that any applicable patent or other IPR claims of which he or she is aware have been or will be disclosed, and any of which he or she becomes aware will be disclosed, in accordance with Section 6 of BCP 79.

Internet-Drafts are working documents of the Internet Engineering Task Force (IETF), its areas, and its working groups. Note that other groups may also distribute working documents as Internet-Drafts.

Internet-Drafts are draft documents valid for a maximum of six months and may be updated, replaced, or obsoleted by other documents at any time. It is inappropriate to use Internet-Drafts as reference material or to cite them other than as “work in progress.”

The list of current Internet-Drafts can be accessed at http://www.ietf.org/ietf/1id-abstracts.txt.

The list of Internet-Draft Shadow Directories can be accessed at http://www.ietf.org/shadow.html.

This Internet-Draft will expire on April 17, 2009.

Abstract

Pre-congestion notification (PCN) provides information to support admission control and flow termination in order to protect the Quality of Service of inelastic flows. It does this by marking packets when traffic load on a link is approaching or has exceeded a threshold below the physical link rate. This document specifies how such marks are to be encoded into the IP header. The baseline encoding described here provides for only two PCN encoding states. It is designed to be easily extended to provide more encoding states but such schemes will be described in other documents.



Table of Contents

1.  Introduction
2.  Requirements notation
3.  Terminology
4.  Encoding two PCN States in IP
    4.1.  Rationale for Encoding
    4.2.  PCN-Compatible DiffServ Codepoints
5.  Rules for Experimental Encoding Schemes
6.  Backwards Compatability
7.  IANA Considerations
8.  Security Considerations
9.  Conclusions
10.  Acknowledgements
11.  Comments Solicited
12.  References
    12.1.  Normative References
    12.2.  Informative References
Appendix A.  Tunnelling Constraints
Appendix B.  PCN Node Behaviours
Appendix C.  Deployment Scenarios for PCN Using Baseline Encoding




 TOC 

1.  Introduction

Pre-congestion notification (PCN) provides information to support admission control and flow termination in order to protect the quality of service (QoS) of inelastic flows. This is achieved by marking packets according to the level of pre-congestion at nodes within a PCN-domain. These markings are evaluated by the egress nodes of the PCN-domain. [pcn‑arch] (Eardley, P., “Pre-Congestion Notification (PCN) Architecture,” September 2008.) describes how PCN packet markings can be used to assure the QoS of inelastic flows within a single DiffServ domain.

This document specifies how these PCN marks are encoded into the IP header. It also describes how packets are identified as belonging to a PCN flow. Some deployment models require two PCN encoding states, others require more. The baseline encoding described here only provides for two PCN encoding states. An extension of the baseline encoding described in [PCN‑3‑enc‑state] (Moncaster, T., Briscoe, B., and M. Menth, “A three state extended PCN encoding scheme,” June 2008.) provides for three PCN encoding states. Other extensions have also been suggested all of which can build on the baseline encoding. In order to ensure backward compatibility any alternative encoding schemes that claim complinace with PCN standards MUST extend this baseline scheme.

Changes from previous drafts (to be removed by the RFC Editor):

From -00 to -01:
Added section on restrictions for extension encoding schemes.
Included table in Appendix showing encoding transitions at different PCN nodes.
Checked for consistency of terminology.
Minor language changes for clarity.
Changes from previous filename
Filename changed from draft-moncaster-pcn-baseline-encoding.
Terminology changed for clarity (PCN-compatible DSCP and PCN-enabled packet).
Minor changes throughout.
Modified meaning of ECT(1) state to EXP.
Moved text relevant to behaviour of nodes into appendix for later transfer to new document on edge behaviours.
From draft-moncaster -01 to -02:
Minor changes throughout including tightening up language to remain consistent with the PCN Architecture terminology
From draft-moncaster -00 to -01:
Change of title from "Encoding and Transport of (Pre-)Congestion Information from within a DiffServ Domain to the Egress"
Extensive changes to Introduction and abstract.
Added a section on the implications of re-using a DSCP.
Added appendix listing possible operator scenarios for using this baseline encoding.
Minor changes throughout.



 TOC 

2.  Requirements notation

The key words "MUST", "MUST NOT", "REQUIRED", "SHALL", "SHALL NOT", "SHOULD", "SHOULD NOT", "RECOMMENDED", "MAY", and "OPTIONAL" in this document are to be interpreted as described in [RFC2119] (Bradner, S., “Key words for use in RFCs to Indicate Requirement Levels,” March 1997.).



 TOC 

3.  Terminology

The following terms are used in this document:

In addition the document uses the terminology defined in [pcn‑arch] (Eardley, P., “Pre-Congestion Notification (PCN) Architecture,” September 2008.).



 TOC 

4.  Encoding two PCN States in IP

The PCN encoding states are defined using a combination of the DSCP and ECN fields within the IP header. The baseline PCN encoding closely follows the semantics of ECN [RFC3168]. It allows the encoding of two PCN states: Not-Marked and PCN-Marked. It also allows for traffic that is not PCN capable to be marked as such (not-PCN). Given the scarcity of codepoints within the IP header the baseline encoding leaves one codepoint free for experimental use. The following table defines how to encode these states in IP:



ECN codepointnot-ECT (00)ECT(0) (10)ECT(1) (01)CE (11)
DSCP n not-PCN NM EXP PM

Where DSCP n is a PCN-compatible DiffServ codepoint (see Section 4.2 (PCN-Compatible DiffServ Codepoints)) and EXP means available for Experimental use.

 Table 1: Encoding PCN in IP 

The following rules apply to all PCN traffic:



 TOC 

4.1.  Rationale for Encoding

The exact choice of encoding was dictated by the constraints imposed by existing IETF RFCs, in particular [RFC3168] (Ramakrishnan, K., Floyd, S., and D. Black, “The Addition of Explicit Congestion Notification (ECN) to IP,” September 2001.) and [RFC4774] (Floyd, S., “Specifying Alternate Semantics for the Explicit Congestion Notification (ECN) Field,” November 2006.). One of the tightest constraints was the need for any PCN encoding to survive being tunnelled through either an IP in IP tunnel or an IPSec Tunnel. Appendix A (Tunnelling Constraints) explains this in detail. The main effect of this constraint is that any PCN marking has to carry the 11 codepoint in the ECN field. If the packet is being tunneled then only the 11 codepoint gets copied into the inner header upon decapsulation. An additional constraint is the need to minimise the use of DiffServ codepoints as there is a limited supply of standards track codepoints remaining. Section 4.2 (PCN-Compatible DiffServ Codepoints) explains how we have minimised this still further by reusing pre-existing Diffserv codepoint(s) such that non-PCN traffic can still be distinguished from PCN traffic. There are a number of factors that were considered before deciding to set 10 as the NM state. These included similarity to ECN, presence of tunnels within the domain, leakage into and out of PCN-domain and incremental deployment.

The encoding scheme above seems to meet all these constraints and ends up looking very similar to ECN. This is perhaps not surprising given the similarity in architectural intent between PCN and ECN.



 TOC 

4.2.  PCN-Compatible DiffServ Codepoints

Equipment complying with the baseline PCN encoding MUST allow PCN to be enabled for certain Diffserv codepoints. This document defines the term "PCN-compatible Diffserv codepoint" for such a DSCP. Enabling PCN for a DSCP switches on PCN marking behaviour for packets with that DSCP, but only if those packets also have their ECN field set to indicate a codepoint other than not-PCN.

Enabling PCN marking behaviour disables any other marking behaviour (e.g. enabling PCN disables the default ECN marking behaviour introduced in [RFC3168] (Ramakrishnan, K., Floyd, S., and D. Black, “The Addition of Explicit Congestion Notification (ECN) to IP,” September 2001.)). All traffic scheduling and conditioning behaviours are discussed in [pcn‑marking‑behaviour] (Eardley, P., “Marking behaviour of PCN-nodes,” October 2008.).



 TOC 

5.  Rules for Experimental Encoding Schemes

Any experimental encoding scheme MUST follow these rules to ensure backward compatibility with this baseline scheme:



 TOC 

6.  Backwards Compatability

BCP 124 [RFC4774] (Floyd, S., “Specifying Alternate Semantics for the Explicit Congestion Notification (ECN) Field,” November 2006.) gives guidelines for specifying alternative semantics for the ECN field. It sets out a number of factors to be taken into consideration. It also suggests various techniques to allow the co-existence of default ECN and alternative ECN semantics. The baseline encoding specified in this document defines PCN-compatible DiffServ codepoints as no longer supporting the default ECN semantics. As such this document is compatible with BCP 124. It should be noted that this baseline encoding blocks end-to-end ECN except where mechanisms are put in place to tunnel such traffic across the PCN-domain.



 TOC 

7.  IANA Considerations

This document makes no request to IANA.



 TOC 

8.  Security Considerations

Packets claim entitlement to be PCN marked by carrying a PCN-Compatible DSCP and a PCN-Enabled ECN codepoint. This encoding document is intended to stand independently of the architecture used to determine whether specific packets are authorised to be PCN marked, which will be described in a future separate document on PCN edge-node behaviour (see Appendix B (PCN Node Behaviours)).

The PCN working group has initially been chartered to only consider a PCN-domain to be entirely under the control of one operator, or a set of operators who trust each other [PCN‑charter] (IETF, “IETF Charter for Congestion and Pre-Congestion Notification Working Group,” .). However there is a requirement to keep inter-domain scenarios in mind when defining the PCN encoding. One way to extend to multiple domains would be to concatenate PCN-domains and use PCN-boundary-nodes back to back at borders. Then any one domain's security against its neighbours would be described as part of the proposed edge-node behaviour document.

One proposal on the table allows one to extend PCN across multiple domains without PCN-boundary-nodes back-to-back at borders [re‑PCN] (Briscoe, B., “Emulating Border Flow Policing using Re-ECN on Bulk Data,” July 2007.). It is believed that the encoding described here would be compatible with the security framework described there.



 TOC 

9.  Conclusions

This document defines the baseline PCN encoding utilising a combination of a PCN-enabled DSCP and the ECN field in the IP header. This baseline encoding allows the existence of two PCN encoding states, not-Marked and PCN-Marked. It also allows for the co-existence of competing traffic within the same DSCP so long as theat traffic doesn't require end-to-end ECN support. The encoding scheme is conformant with [RFC4774] (Floyd, S., “Specifying Alternate Semantics for the Explicit Congestion Notification (ECN) Field,” November 2006.).



 TOC 

10.  Acknowledgements

This document builds extensively on work done in the PCN working group by Kwok Ho Chan, Georgios Karagiannis, Philip Eardley, Anna Charny, Joe Babiarz and others. Thanks to Ruediger Geib for providing detailed comments on this document.



 TOC 

11.  Comments Solicited

Comments and questions are encouraged and very welcome. They can be addressed to the IETF congestion and pre-congestion working group mailing list <pcn@ietf.org>, and/or to the authors.



 TOC 

12.  References



 TOC 

12.1. Normative References

[RFC2119] Bradner, S., “Key words for use in RFCs to Indicate Requirement Levels,” BCP 14, RFC 2119, March 1997 (TXT, HTML, XML).
[RFC4774] Floyd, S., “Specifying Alternate Semantics for the Explicit Congestion Notification (ECN) Field,” BCP 124, RFC 4774, November 2006 (TXT).
[pcn-arch] Eardley, P., “Pre-Congestion Notification (PCN) Architecture,” draft-ietf-pcn-architecture-07 (work in progress), September 2008 (TXT).


 TOC 

12.2. Informative References

[PCN-3-enc-state] Moncaster, T., Briscoe, B., and M. Menth, “A three state extended PCN encoding scheme,” draft-moncaster-pcn-3-state-encoding-00 (work in progress), June 2008 (TXT).
[PCN-charter] IETF, “IETF Charter for Congestion and Pre-Congestion Notification Working Group.”
[RFC3168] Ramakrishnan, K., Floyd, S., and D. Black, “The Addition of Explicit Congestion Notification (ECN) to IP,” RFC 3168, September 2001 (TXT).
[RFC4301] Kent, S. and K. Seo, “Security Architecture for the Internet Protocol,” RFC 4301, December 2005 (TXT).
[ecn-tunnelling] Briscoe, B., “Layered Encapsulation of Congestion Notification,” draft-briscoe-tsvwg-ecn-tunnel-01 (work in progress), July 2008 (TXT).
[pcn-marking-behaviour] Eardley, P., “Marking behaviour of PCN-nodes,” draft-ietf-pcn-marking-behaviour-00 (work in progress), October 2008 (TXT).
[re-PCN] Briscoe, B., “Emulating Border Flow Policing using Re-ECN on Bulk Data,” draft-briscoe-re-pcn-border-cheat-00 (work in progress), July 2007 (TXT).
[voice-admit] Baker, F., Polk, J., and M. Dolly, “DSCPs for Capacity-Admitted Traffic,” draft-ietf-tsvwg-admitted-realtime-dscp-04 (work in progress), February 2008 (TXT).


 TOC 

Appendix A.  Tunnelling Constraints

The rules that govern the behaviour of the ECN field for IP-in-IP tunnels were defined in [RFC3168] (Ramakrishnan, K., Floyd, S., and D. Black, “The Addition of Explicit Congestion Notification (ECN) to IP,” September 2001.). This allowed for two tunnel modes. The limited functionality mode sets the outer header to not-ECT, regardless of the value of the inner header, in other words disabling ECN within the tunnel. The full functionality mode copies the inner ECN field into the outer header if the inner header is not-ECT or either of the 2 ECT codepoints. If the inner header is CE then the outer header is set to ECT(0). On decapsulation, if the CE codepoint is set on the outer header then this is copied into the inner header. Otherwise the inner header is left unchanged. The stated reason for blocking CE from being copied to the outer header was to prevent this from being used as a covert channel through IPSec tunnels.

The IPSec protocol [RFC4301] (Kent, S. and K. Seo, “Security Architecture for the Internet Protocol,” December 2005.) changed the ECN tunnelling rule to allow IPSec tunnels to simply copy the inner header into the outer header. On decapsulation the outer header is discarded and the ECN field is only copied down if it is set to CE.

Because of the possible existence of tunnels, only CE (11) can be used as a PCN marking as it is the only mark that will always survive decapsulation. However there is a need for caution with all tunneling within the PCN-domain. RFC3168 full functionality IP in IP tunnels are expected to set the ECN field to ECT(0) if the inner ECN field is set to CE. This leads to the possibility that some packets within the PCN-domain that have already been marked may have that mark concealed further into the domain. This is undesirable for many PCN schemes and thus the PCN working group needs to decide whether to advise against the use of full functionality RFC3168 IP in IP tunnels within a PCN-domain ot to support the ongoing work within the Transport Area to rationalise the behaviour of IP in IP tunnels in respect to the ECN field and bring them in line with the behaviour of IPSec tunnels [ecn‑tunnelling] (Briscoe, B., “Layered Encapsulation of Congestion Notification,” July 2008.).



 TOC 

Appendix B.  PCN Node Behaviours

The following table of valid and invalid transitions, while necessary for the correct functioning of PCN they is not strictly part of the encoding scheme. The PCN working group needs to decide whether to include this in this baseline encoding or whether to transfer it to an alternative document.



PCN node typeCodepoint inValid codepoint outInvalid codepoint out
ingress Any NM or Not-PCN PM
------------ ----------- ---------------- -----------------
interior NM NM or PM not-PCN
------------ ----------- ---------------- -----------------
interior Not-PCN Not-PCN Any other codepoint
------------ ----------- ---------------- -----------------
egress Any 00 Any other codepoint*

* Except where the egress node knows that other marks may be safely exposed outside the PCN-domain (e.g. [PCN‑3‑enc‑state] (Moncaster, T., Briscoe, B., and M. Menth, “A three state extended PCN encoding scheme,” June 2008.)).

 Table 2: Valid and Invalid Transitions at PCN nodes 

It is also necessary to define a safe behaviour for baseline-compliant nodes to follow should they unexpectedly encounter a packet carrying the EXP (01) codepoint. The obvious safe behaviour would be to treat this as if it were a NM packet but to raise an alarm at a higher layer to check why the packet was there. An alternative safe approach is to treat it as a not-PCN packet but this might jeopardise partial deployment of any future experimental encoding scheme.



 TOC 

Appendix C.  Deployment Scenarios for PCN Using Baseline Encoding

This appendix illustrates possible PCN deployment scenarios where the baseline encoding can be used and also explain a case for which baseline encoding is not sufficient. {Note this appendix is provided for information only}.

  1. An operator may wish to use PCN-based admission control only. To that end, threshold marking based on admissible rates might be used as the only PCN metering and marking algorithm. As a consequence, the PM marks on the packets are interpreted as meaning the ingress should stop admitting new traffic.
  2. An operator may wish to use PCN-based flow termination only. To that end, excess rate marking based on supportable rates might be used as the only PCN metering and marking algorithm. As a consequence, the PM marks on the packets are interpreted as meaning the ingress should start terminating appropriate flows.
  3. An operator may wish to use both PCN-based admission control and flow termination. To that end, excess rate marking based on admissible rates might be used as the only PCN metering and marking algorithm. The level of marks will be used to determine when the ingress should stop admitting new traffic and whether the ingress should terminate any flows.
  4. An operator may wish to implement admission control based on threshold marking at admissible rates and flow termination based on excess rate marking at supportable rates because these methods are believed to work better with small ingress-egress aggregates. Then two different markings are needed. Such a deployment scenario is not supported by the PCN baseline encoding.


 TOC 

Authors' Addresses

  Toby Moncaster
  BT
  B54/70, Adastral Park
  Martlesham Heath
  Ipswich IP5 3RE
  UK
Phone:  +44 1473 648734
EMail:  toby.moncaster@bt.com
  
  Bob Briscoe
  BT & UCL
  B54/77, Adastral Park
  Martlesham Heath
  Ipswich IP5 3RE
  UK
Phone:  +44 1473 645196
EMail:  bob.briscoe@bt.com
  
  Michael Menth
  University of Wuerzburg
  room B206, Institute of Computer Science
  Am Hubland
  Wuerzburg D-97074
  Germany
Phone:  +49 931 888 6644
EMail:  menth@informatik.uni-wuerzburg.de


 TOC 

Full Copyright Statement

Intellectual Property

Acknowledgement