<?xml version="1.0" encoding="iso-8859-1"?>
<?rfc toc="yes"?>
<?rfc symrefs="yes" ?>
<?rfc sortrefs="yes" ?>
<?rfc compact="yes" ?>
<?rfc subcompact="no" ?>

<rfc category="std" ipr="trust200902" docName="draft-ietf-nvo3-bfd-geneve-13" consensus="true" submissionType="IETF">

<front>
        <title abbrev="BFD for Geneve"> BFD for Geneve </title>

  <author fullname="Xiao Min" initials="X" surname="Min">
      <organization>ZTE Corp.</organization>
     <address>
       <postal>
         <street/>

         <!-- Reorder these if your country does things differently -->

         <city>Nanjing</city>

         <region/>

         <code/>

         <country>China</country>
       </postal>

       <phone>+86 18061680168</phone>

       <email>xiao.min2@zte.com.cn</email>

       <!-- uri and facsimile elements may also be added -->
     </address>
    </author>

  <author fullname="Greg Mirsky" initials="G" surname="Mirsky">
      <organization>Ericsson</organization>
     <address>
       <postal>
         <street></street>

         <!-- Reorder these if your country does things differently -->

         <city></city>

         <region></region>

         <code></code>

         <country>United States of America</country>
       </postal>

       <phone></phone>

       <email>gregimirsky@gmail.com</email>

       <!-- uri and facsimile elements may also be added -->
     </address>
    </author>

  <author fullname="Santosh Pallagatti" initials="S" surname="Pallagatti">
      <organization>VMware</organization>
     <address>
       <postal>
         <street/>

         <!-- Reorder these if your country does things differently -->

         <city></city>

         <region/>

         <code/>

         <country>India</country>
       </postal>

       <phone></phone>

       <email>santosh.pallagatti@gmail.com</email>

       <!-- uri and facsimile elements may also be added -->
     </address>
    </author>
	
  <author fullname="Jeff Tantsura" initials="J" surname="Tantsura">
      <organization>Nvidia</organization>
     <address>
       <postal>
         <street/>

         <!-- Reorder these if your country does things differently -->

         <city></city>

         <region/>

         <code/>

         <country>United States of America</country>
       </postal>

       <phone></phone>

       <email>jefftant.ietf@gmail.com</email>

       <!-- uri and facsimile elements may also be added -->
     </address>
    </author>
	   
  <author fullname="Sam Aldrin" initials="S" surname="Aldrin">
      <organization>Google</organization>
     <address>
       <postal>
         <street/>

         <!-- Reorder these if your country does things differently -->

         <city></city>

         <region/>

         <code/>

         <country>United States of America</country>
       </postal>

       <phone></phone>

       <email>aldrin.ietf@gmail.com</email>
       <!-- uri and facsimile elements may also be added -->
     </address>
    </author>
	
    <date year="2023"/>
  
    <area>Routing</area>
    <workgroup>NVO3 Working Group</workgroup>

    <keyword>Request for Comments</keyword>
    <keyword>RFC</keyword>
    <keyword>Internet Draft</keyword>
    <keyword>I-D</keyword>

    <abstract>
  <t> This document describes the use of the Bidirectional Forwarding Detection (BFD) protocol in point-to-point 
  Generic Network Virtualization Encapsulation (Geneve) unicast tunnels used to make up an overlay network. </t>
    </abstract>
    
</front>
  
<middle>

  <section title="Introduction">
  
  <t> "Generic Network Virtualization Encapsulation" (Geneve) <xref target="RFC8926"/> provides an encapsulation scheme 
  that allows building an overlay network of tunnels by decoupling the address space of the attached virtual hosts from 
  that of the network. </t>
  
  <t> This document describes the use of Bidirectional Forwarding Detection (BFD) protocol <xref target="RFC5880"/>
  to enable monitoring the continuity of the path between two Geneve tunnel endpoints, which may be a NVE (Network 
  Virtualization Edge) or another device acting as a Geneve tunnel endpoint. Specifically, the asynchronous mode of BFD, 
  as defined in <xref target="RFC5880"/>, is used to monitor a P2P Geneve tunnel. The support for BFD Echo function is outside 
  the scope of this document. For simplicity, NVE is used to represent the Geneve tunnel endpoint. 
  TS (Tenant System) is used to represent the physical or virtual device attached to a Geneve tunnel endpoint from the 
  outside. VAP (Virtual Access Point) is the NVE side of the interface between the NVE and the TS, and a VAP is a 
  logical network port (virtual or physical) into a specific virtual network. For detailed definitions and descriptions 
  of NVE, TS and VAP, please refer to <xref target="RFC7365"/> and <xref target="RFC8014"/>. </t>
  
  <t> The use cases and the deployment of BFD for Geneve are mostly consistent with what's described in Section 1 and 3 of 
  <xref target="RFC8971"/> ("Bidirectional Forwarding Detection (BFD) for Virtual eXtensible Local Area Network (VXLAN)"). 
  One exception is on the usage of Management VNI, which is described in <xref target="I-D.ietf-nvo3-geneve-oam"/> and outside 
  the scope of this document. </t>
  
  <t> As specified in Section 4.2 of <xref target="RFC8926"/>, Geneve MUST be used with congestion-controlled traffic or 
  within a traffic-managed controlled environment (TMCE) to avoid congestion, that requirement applies to BFD traffic too. 
  Specifically, considering the complexity and immaturity of BFD congestion control mechanism, BFD for Geneve MUST be used 
  within a TMCE unless BFD is really congestion controlled. As an alternative to a real congestion control, an operator of 
  a TMCE deploying BFD for Geneve is required to provision the rates at which BFD is transmitted to avoid congestion and 
  false failure detection. </t>
  
  </section>
  
  <section title="Conventions Used in This Document">
   
    <section title="Abbreviations">
    <t> BFD: Bidirectional Forwarding Detection</t>
    <t> FCS: Frame Check Sequence</t>
    <t> Geneve: Generic Network Virtualization Encapsulation</t>
    <t> NVE: Network Virtualization Edge</t>
    <t> TMCE: Traffic-Managed Controlled Environment</t>
    <t> TS: Tenant System</t>
    <t> VAP: Virtual Access Point</t>
	<t> VNI: Virtual Network Identifier</t>
	<t> VXLAN: Virtual eXtensible Local Area Network</t>
    </section>
       
    <section title="Requirements Language">
	<t> The key words "MUST", "MUST NOT", "REQUIRED", "SHALL", "SHALL NOT",   "SHOULD", "SHOULD NOT", "RECOMMENDED", 
	"NOT RECOMMENDED", "MAY", and   "OPTIONAL" in this document are to be interpreted as described in BCP 14 
	<xref target="RFC2119"/> <xref target="RFC8174"/> when, and only when, they appear in all capitals, as shown here.</t>
    </section>
	
  </section>

  <section title="BFD Packet Transmission over Geneve Tunnel">

  <t> Since the Geneve data packet payload may be either an Ethernet frame or an IP packet, this document defines two 
  formats of BFD packet encapsulation in Geneve. The BFD session is originated and terminated at the VAP of an NVE. The selection 
  of the BFD packet encapsulation is based on how the VAP encapsulates the data packets. If the payload is 
  IP, then BFD over IP is carried in the payload. If the payload is Ethernet, then BFD over IP over Ethernet is carried in 
  the payload, in the same manner as BFD over IP in the IP payload case, regardless of what the Ethernet payload might normally carry.</t>
  
  </section> 
  
  <section anchor="ethernet-ip-encaps-section" title="BFD Encapsulation With Inner Ethernet/IP/UDP Header">
	
  <t> If the VAP that originates the BFD packets is used to encapsulate Ethernet data frames, then the BFD packets are 
  encapsulated in Geneve as described below. The Geneve packet formats over IPv4 and IPv6 are defined in Section 3.1 and 
  3.2 of <xref target="RFC8926"/> respectively. The Outer IP/UDP and Geneve headers are encoded by the sender 
  as defined in <xref target="RFC8926"/>. Note that the outer IP header and the inner IP header may not be of the same 
  address family. In other words, an outer IPv6 header accompanied by an inner IPv4 header and an outer IPv4 header accompanied 
  by an inner IPv6 header are both possible. </t>
	
     <figure anchor="Figure_1" title="Geneve Encapsulation of BFD Control Packet With the Inner Ethernet/IP/UDP Header">
     <artwork align="left"><![CDATA[
 0                   1                   2                   3
 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
|                                                               |
~                      Outer Ethernet Header                    ~
|                                                               |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
|                                                               |
~                        Outer IPvX Header                      ~
|                                                               |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
|                                                               |
~                        Outer UDP Header                       ~
|                                                               |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
|                                                               |
~                          Geneve Header                        ~
|                                                               |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
|                                                               |
~                      Inner Ethernet Header                    ~
|                                                               |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
|                                                               |
~                        Inner IPvX Header                      ~
|                                                               |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
|                                                               |
~                         Inner UDP Header                      ~
|                                                               |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
|                                                               |
~                        BFD Control Packet                     ~
|                                                               |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
|                        Outer Ethernet FCS                     |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
     ]]></artwork>
     </figure>
	 
     <t> The BFD packet MUST be carried inside the inner Ethernet frame of the Geneve packet.
	      The inner Ethernet frame carrying the BFD Control packet has the following format:
		   <list>
			   <t>Inner Ethernet Header:
				   <list>
					   <t>Destination MAC: MAC address of a VAP of the terminating NVE.</t>
					   <t>Source MAC: MAC address of a VAP of the originating NVE.</t>
				   </list>
			   </t>
			   
			   <t>IP Header:
				   <list>
					   <t>Source IP: IP address of a VAP of the originating NVE. If the VAP of the originating NVE 
					   has no IP address, then the IP address 0.0.0.0 for IPv4 or ::/128 for IPv6 MUST be used.</t>
					   <t>Destination IP: IP address of a VAP of the terminating NVE. If the VAP of the terminating 
					   NVE has no IP address, then the IP address 127.0.0.1 for IPv4 or ::1/128 for IPv6 MUST be used.</t>
					   <t>TTL or Hop Limit: MUST be set to 255 in accordance with <xref target="RFC5881"/> that specifies the IPv4/IPv6 single-hop BFD.</t>
				   </list>
			   </t>

		   <t> The fields of the UDP header and the BFD Control packet are encoded as specified in <xref target="RFC5881"/>.</t>
		   </list>
	 </t>
	 
     <t> When the BFD packets are encapsulated in Geneve in this way, the Geneve header defined in <xref target="RFC8926"/> 
	 follows the value set below.</t>	
	 
    <t> 
	<list>
     <t> Opt Len field MUST be set consistent with the Geneve specification <xref target="RFC8926"/> depending on whether 
	 or not Geneve options are present in the frame. The use of Geneve options with BFD is beyond the scope of this document.</t>
	 
	 <t> O bit MUST be set to 1, which indicates this packet contains a control message.</t>
	 
     <t> C bit MUST be set to 0, which indicates there isn't any critical option.</t>
	 
     <t> Protocol Type field MUST be set to 0x6558 (Ethernet frame).</t>
	 
     <t> Virtual Network Identifier (VNI) field MUST be set to the VNI number that the originating VAP is mapped to.</t>
	</list>
    </t> 

    <section title="Demultiplexing BFD packet when payload is Ethernet"> 
    
    <t> Once a packet is received, the NVE validates the packet as described in <xref target="RFC8926"/>. When the
	payload is Ethernet, the Protocol Type field equals 0x6558. The Destination MAC address of the inner Ethernet frame 
	matches the MAC address of a VAP which is mapped to the same VNI as the received VNI. Then the Destination IP, the UDP 
	destination port and the TTL or Hop Limit of the inner IP packet MUST be validated to determine whether the received 
	packet can be processed by BFD, i.e., the three field values of the inner IP packet MUST be in compliance with what's 
	defined in Section 4 of this document, as well as Section 4 of <xref target="RFC5881"/>. If the validation fails, the 
	received packet MUST NOT be processed by BFD.</t>
      
    <t> In BFD over Geneve, a BFD session is originated and terminated at a VAP. Usually one NVE owns 
    multiple VAPs. Since multiple BFD sessions may be running between two NVEs, there needs to be a mechanism 
    for demultiplexing received BFD packets to the proper session. Furthermore, due to the fact that 
    <xref target="RFC8014"/> allows for N-to-1 mapping between VAP and VNI at one NVE, multiple BFD sessions 
    between two NVEs for the same VNI are allowed. Also note that a BFD session can only be established between 
    two VAPs that are mapped to the same VNI and use the same way to encapsulate data packets. </t>
    
    <t> If the BFD packet is received with Your Discriminator equals to 0, then the BFD session SHOULD be identified using 
	the VNI number and the inner Ethernet/IP header. The inner Ethernet/IP header stands for the source MAC, the source IP, 
	the destination MAC, and the destination IP. An implementation MAY use the inner UDP port source number to aid in 
	demultiplexing incoming BFD Control packets. If it fails to identify the BFD session, the incoming BFD Control packets 
	MUST be dropped, and an exception event indicating the failure should be reported to the management.</t>
    
    <t> If the BFD packet is received with non-zero Your Discriminator, then the BFD session MUST be demultiplexed 
    only with Your Discriminator as the key. </t>
	  
    </section>
	
  </section>
  
  <section anchor="ip-encaps-section" title="BFD Encapsulation With Inner IP/UDP Header">
	
  <t> If the VAP that originates the BFD packets is used to encapsulate IP data packets, then the BFD packets are 
  encapsulated in Geneve as described below. The Geneve packet formats over IPv4 and IPv6 are defined in Section 3.1 and 
  3.2 of <xref target="RFC8926"/> respectively. The Outer IP/UDP and Geneve headers are encoded by the sender 
  as defined in <xref target="RFC8926"/>. Note that the outer IP header and the inner IP header may not be of the same 
  address family. In other words, an outer IPv6 header accompanied by an inner IPv4 header and an outer IPv4 header accompanied 
  by an inner IPv6 header are both possible. </t>
	
     <figure anchor="Figure_2" title="Geneve Encapsulation of BFD Control Packet With the Inner IP/UDP Header">
     <artwork align="left"><![CDATA[
 0                   1                   2                   3
 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
|                                                               |
~                         Ethernet Header                       ~
|                                                               |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
|                                                               |
~                        Outer IPvX Header                      ~
|                                                               |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
|                                                               |
~                        Outer UDP Header                       ~
|                                                               |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
|                                                               |
~                          Geneve Header                        ~
|                                                               |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
|                                                               |
~                        Inner IPvX Header                      ~
|                                                               |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
|                                                               |
~                         Inner UDP Header                      ~
|                                                               |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
|                                                               |
~                        BFD Control Packet                     ~
|                                                               |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
|                               FCS                             |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
     ]]></artwork>
     </figure>
	 
     <t> The BFD packet MUST be carried inside the inner IP packet of the Geneve packet.
     The inner IP packet carrying the BFD Control packet has the following format:
		   <list>
			   <t>Inner IP header:
				   <list>
					   <t>Source IP: IP address of a VAP of the originating NVE.</t>
					   <t>Destination IP: IP address of a VAP of the terminating NVE.</t>
					   <t>TTL or Hop Limit: MUST be set to 255 in accordance with <xref target="RFC5881"/> that specifies the IPv4/IPv6 single-hop BFD.</t>
				   </list>
			   </t>

		   <t> The fields of the UDP header and the BFD Control packet are encoded as specified in <xref target="RFC5881"/>.</t>

		   </list>
	 </t>
	 
     <t> When the BFD packets are encapsulated in Geneve in this way, the Geneve header defined in <xref target="RFC8926"/> 
	 follows the value set below.</t>	

    <t> 
	<list>
     <t> Opt Len field MUST be set consistent with the Geneve specification <xref target="RFC8926"/> depending on whether 
	 or not Geneve options are present in the frame. The use of Geneve options with BFD is beyond the scope of this document.</t>
	 
	 <t> O bit MUST be set to 1, which indicates this packet contains a control message.</t>
	 
     <t> C bit MUST be set to 0, which indicates there isn't any critical option.</t>
	 
     <t> Protocol Type field MUST be set to 0x0800 (IPv4) or 0x86DD (IPv6), depending on the address family 
	 of the inner IP packet.</t>
	 
     <t> Virtual Network Identifier (VNI) field MUST be set to the VNI number that the originating VAP is mapped to.</t>
	</list>
    </t> 

    <section title="Demultiplexing BFD packet when payload is IP"> 
    
    <t> Once a packet is received, the NVE validates the packet as described in <xref target="RFC8926"/>. When the
	payload is IP, the Protocol Type field equals 0x0800 or 0x86DD. The Destination IP address of the inner IP packet matches 
	the IP address of a VAP which is mapped to the same VNI as the received VNI. Then the UDP destination port and the TTL or Hop 
	Limit of the inner IP packet MUST be validated to determine whether the received packet can be processed by BFD, i.e., 
	the two field values of the inner IP packet MUST be in compliance with what's defined in Section 5 of this document, as well 
	as Section 4 of <xref target="RFC5881"/>. If the validation fails, the received packet MUST NOT be processed by BFD.</t>
    
    <t> If the BFD packet is received with Your Discriminator equals to 0, then the BFD session SHOULD be identified using the VNI 
	number and the inner IP header. The inner IP header stands for the source IP and the destination IP. An implementation MAY 
	use the inner UDP port source number to aid in demultiplexing incoming BFD Control packets. If it fails to identify the BFD session, 
	the incoming BFD Control packets MUST be dropped, and an exception event indicating the failure should be reported to the management.</t>
    
    <t> If the BFD packet is received with non-zero Your Discriminator, then the BFD session MUST be demultiplexed 
    only with Your Discriminator as the key. </t>
	  
    </section>
	
  </section>

  <section title="Security Considerations">
  
  <t> Security issues discussed in <xref target="RFC8926"/> and <xref target="RFC5880"/> apply to this document. Particularly, 
  the BFD is an application that is run at the two Geneve tunnel endpoints. The IP underlay network and/or the Geneve option 
  can provide security between the peers, which are subject to the issue of overload described below. The BFD introduces no 
  security vulnerabilities when run in this manner. Considering Geneve does not have any inherent security mechanisms, BFD 
  authentication as specified in <xref target="RFC5880"/> is RECOMMENDED to be utilized.</t>
   
  <t> This document supports establishing multiple BFD sessions between the same pair of NVEs, each BFD session over 
  a pair of VAPs residing in the same pair of NVEs, there SHOULD be a mechanism to control the maximum number of such 
  sessions that can be active at the same time. Particularly, assuming an example that each NVE of the pair of NVEs has N VAPs 
  using Ethernet as the payload, then there could be N squared BFD sessions running between the pair of NVEs. Considering N 
  could be a high number, the N squared BFD sessions could result in overload of the NVE. In this case, it's recommended 
  that N BFD sessions covering all N VAPs are run for the pair of NVEs. Generally speaking, the number of BFD sessions is 
  supposed to be enough as long as all VAPs of the pair of NVEs are covered.</t>
  
  </section>
  
  <section title="IANA Considerations"> 
  <t> This document has no IANA action requested.</t>
  </section>

  <section title="Acknowledgements">
  <t> The authors would like to acknowledge Reshad Rahman, Jeffrey Haas, and Matthew Bocci for their guidance 
  on this work.</t>
  <t> The authors would like to acknowledge David Black for his explanation on the mapping relation between VAP 
  and VNI.</t>
  <t> The authors would like to acknowledge Stewart Bryant, Anoop Ghanwani, Jeffrey Haas, Reshad Rahman, Matthew 
  Bocci, Andrew Alston, Magnus Westerlund, Paul Kyzivat, Sheng Jiang, Carl Wallace, Roman Danyliw, John Scudder, 
  Donald Eastlake, Eric Vyncke, Zaheduzzaman Sarker, and Lars Eggert for their thorough review and very helpful 
  comments.</t>
  </section>  
  
</middle>
  
<back>

    <references title="Normative References">
     <?rfc include="reference.RFC.5880"?>
     <?rfc include="reference.RFC.5881"?>
     <?rfc include="reference.RFC.2119"?>
     <?rfc include="reference.RFC.8174"?>
     <?rfc include="reference.RFC.8926"?>
    </references>
	
    <references title="Informative References">
     <?rfc include="reference.RFC.7365"?>
     <?rfc include="reference.RFC.8014"?>
     <?rfc include="reference.RFC.8971"?>
     <?rfc include="reference.I-D.ietf-nvo3-geneve-oam"?>
    </references>
	
</back>
</rfc>
