<?xml version='1.0' encoding='utf-8'?>
<!DOCTYPE rfc [
  <!ENTITY nbsp    "&#160;">
  <!ENTITY zwsp   "&#8203;">
  <!ENTITY nbhy   "&#8209;">
  <!ENTITY wj     "&#8288;">
]>
<?xml-stylesheet type="text/xsl" href="rfc2629.xslt" ?>
<!-- generated by https://github.com/cabo/kramdown-rfc version 1.7.24 (Ruby 3.2.3) -->
<rfc xmlns:xi="http://www.w3.org/2001/XInclude" ipr="trust200902" docName="draft-zhao-anima-automatic-congestion-relief-00" category="std" consensus="true" submissionType="IETF" version="3">
  <!-- xml2rfc v2v3 conversion 3.28.0 -->
  <front>
    <title abbrev="Automatic Network Congestion Relief">Automatic Network Congestion Relief</title>
    <seriesInfo name="Internet-Draft" value="draft-zhao-anima-automatic-congestion-relief-00"/>
    <author initials="J." surname="Zhao" fullname="Jing Zhao" role="editor">
      <organization>China Unicom</organization>
      <address>
        <postal>
          <city>Beijing</city>
          <country>China</country>
        </postal>
        <email>zhaoj501@chinaunicom.cn</email>
      </address>
    </author>
    <author initials="S." surname="Zhang" fullname="Shuai Zhang" role="editor">
      <organization>China Unicom</organization>
      <address>
        <postal>
          <city>Beijing</city>
          <country>China</country>
        </postal>
        <email>zhangs633@chinaunicom.cn</email>
      </address>
    </author>
    <date year="2025" month="March" day="03"/>
    <area>Operations and Management Area</area>
    <workgroup>anima</workgroup>
    <keyword>Internet-Draft</keyword>
    <abstract>
      <?line 45?>
<t>This document introduces an automatic congestion relief mechanism based on intelligent traffic analysis and dynamic regulation. In the event of congestion caused by fiber optic failures, it can respond intelligently and self-heal in real time, ensuring the stable operation of the network.</t>
    </abstract>
  </front>
  <middle>
    <?line 47?>

<section anchor="intro">
      <name>Introduction</name>
      <t>Nowadays, fiber optic failures occur frequently, leading to network congestion and becoming a common pain point for operators. These issues necessitate dedicated staff to perform daily traffic inspections and manually adjust configurations on an hourly basis, which significantly increases the difficulty of network maintenance.</t>
      <t>This draft introduces an automatic congestion relief mechanism based on intelligent traffic analysis and auto-regulation. In the event of congestion caused by fiber optic failures, it can intelligently respond to congestion and initiate real-time self-healing processes, solving the network congestion and maintenance challenges faced by operators due to fiber optic failures, and ensuring the stable operation of the network.</t>
    </section>
    <section anchor="automatic-network-congestion-relief">
      <name>Automatic Network Congestion Relief</name>
      <t>This second-level congestion relief mechanism is automated through the intelligent module within the device. Leveraging intelligent traffic analysis, it precisely calculates the volume of traffic requiring redistribution. Subsequently, it redirects this traffic to paired devices via inter-device protocol announcements and the automatic adjustment of routing priorities.</t>
      <figure anchor="fig-1">
        <name>Mechanism Framework Description</name>
        <artwork><![CDATA[
   +-+-+-+-+-+-+-+-+-+-+       +-+-+-+-+-+-+-+-+-+-+-+     +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+  
   | Traffic Modeling  |------>| Traffic Monitoring  |---->| Intelligent policy generation |
   +-+-+-+-+-+-+-+-+-+-+       +-+-+-+-+-+-+-+-+-+-+-+     +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
                                                                         |
                                                                         |
                               +-+-+-+-+-+-+-+-+-+-+-+     +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+  
                               |   Policy Reversion  |<----|      Policy Regulation      |
                               +-+-+-+-+-+-+-+-+-+-+-+     +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+      

                                          
]]></artwork>
      </figure>
      <section anchor="step1-traffic-modeling">
        <name>Step1: Traffic Modeling</name>
        <t>The forwarding chip of the device performs real-time traffic sorting using full-flow data and identifies the top N traffic flows.</t>
        <t>The intelligent component of the AI chip subscribes to the BGP RIB-out (Routing Information Base-outbound) and employs intelligent flow recognition algorithms to perform AI-based traffic modeling. This modeling approach, considering factors such as historical traffic patterns and flow behavior, provides a solid basis for subsequent traffic detection and regulation.</t>
        <t>The intelligent flow feature statistics cover multiple dimensions, arranged in a logical order from macroscopic to microscopic traffic characteristics, including flow rate, packet length, the proportion of TCP/UDP traffic, the proportion of fragmented packets, and the proportion of SYN packets.</t>
      </section>
      <section anchor="step2-traffic-monitoring">
        <name>Step2: Traffic Monitoring</name>
        <t>Through the extension of the BGP-LS protocol, the inter-domain link bandwidth and load changes are obtained by the device in real-time.</t>
        <t>When the link bandwidth exceeds the set congestion threshold, the situation where the link bandwidth exceeds the threshold is reported quickly.</t>
        <t>The interface statistics are collected at a second-level time interval.</t>
        <section anchor="bgp-ls-utilized-bandwidth-tlv">
          <name>BGP-LS Utilized Bandwidth TLV</name>
          <t>The device uses the Utilized Bandwidth to aggregate the inter-domain BGP EPE link bandwidth and bandwidth utilization rate.
The BGP-LS Utilized Bandwidth TLV reuses the Maximum Link Bandwidth TLV (Type 1089) <xref target="RFC5305"/>
This TLV is used to describe the bandwidth and bandwidth utilization of inter-domain BGP Egress Peer Engineering (EPE) links.
The format of the BGP-LS Utilized Bandwidth TLV is as follows.</t>
          <figure anchor="fig-2">
            <name>BGP-LS Utilized Bandwidth KEY TLV</name>
            <artwork><![CDATA[
   0                   1                   2                   3
   0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1
   +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
   |               Type            |             Length            |
   +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
   |                     Utilized Bandwidth                        |
   +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
    
]]></artwork>
          </figure>
        </section>
      </section>
      <section anchor="step3-intelligent-policy-generation">
        <name>Step3: Intelligent Policy Generation</name>
        <t>When its utilization exceeds the predefined congestion threshold, the device launches the intelligent module. The device intelligently identifies the traffic that needs to be adjusted and generates policies based on traffic analysis.</t>
        <t>One policy is that, considering the traffic characteristics such as flow rate and type, the device can make a more accurate calculation. Since the routing prefixes sent to the one network domain (rib-out) are the same, it is only necessary to change the attributes after finding the TOP N routing prefixes to be adjusted, and lower the priority of the faulty plane.</t>
      </section>
      <section anchor="step4-policy-propagation">
        <name>Step4: Policy Propagation</name>
        <t>The device announces the automatic adjustment mechanism of routing priorities through the BGP RPD protocol and then automatically allocates the calculated traffic to the lightly-loaded plane.
The end-to-end process is completed within seconds, effectively alleviating the congestion on the original plane. 
It can precisely control neighbors and paths without affecting the existing routing policies in the network.</t>
      </section>
      <section anchor="step5-policy-reversion">
        <name>Step5: Policy Reversion</name>
        <t>After the interrupted link recovers, the optimization policies will be gradually withdrawn. 
With the revocation of the policies, the network traffic will progressively return to the load-sharing state before the failure.</t>
      </section>
    </section>
    <section anchor="usecase">
      <name>usecase</name>
      <figure anchor="fig-3">
        <name>Intelligent Decision-Making usecase</name>
        <artwork><![CDATA[
 ----------------------------------------
|  --------                   --------   |
| |   C2   |    Network2     |    C1  |  |
|  --------                   --------   |
 ------|--------------------------|------
       |                          |
       |                          x
       | 100GB x 8                x 100GB x 8
       |                          |
-------|--------------------------|------
| +-------                   --------   |
| |  CR2   |    Network1     |  CR1   | |
|  --------                   --------  |
 ----------------------------------------
]]></artwork>
      </figure>
      <section anchor="pre-deployment-conditions">
        <name>Pre-deployment Conditions</name>
        <t>Establish topology mapping in Network 1 between CR1 and CR2, and between CR1/CR2 and C1/C2 in Network 2, clarifying the connection relationships.
Set up BGP-LS peering between CR1 and CR2 to exchange topology and bandwidth information.
Enable BGP EPE functionality via EBGP peering between CR1 and C1, and between CR2 and C2, to obtain link state and bandwidth-related information and generate BGP-LS LINK routes.
Activate the BGP RPD neighbor function on CR1 and CR2 to receive optimized routing policies.</t>
      </section>
      <section anchor="implementation-of-automated-mechanism">
        <name>Implementation of Automated Mechanism</name>
        <ul spacing="normal">
          <li>
            <t>Step1：AI Traffic Modeling  </t>
            <t>
CR1 and CR2 perform real-time and automatic TOP N traffic modeling on the link using the built-in automated algorithms. Without human intervention, they are capable of accurately grasping the traffic conditions of the link.
  The system automatically monitors that the total bandwidth of the traffic channels between CR1-C1 and CR2-C2 is 100 x 8GB, and the current traffic on both paths is 600GB.</t>
          </li>
          <li>
            <t>Step2：Traffic Monitoring  </t>
            <t>
If the CR1 device detects that a total of five links between CR1-C1 have failed, leaving only three links, the system automatically determines that congestion will occur on the CR1-C1 link.</t>
          </li>
          <li>
            <t>Step3：Intelligent Policy Generation  </t>
            <t>
As the primary adjustment device, the intelligent module of CR1 automatically generates optimization policies based on the established TOPN traffic model and the real-time collected prefix information. 
  The entire process does not require human intervention, which greatly shortens the time from failure discovery to policy formulation, and enables timely response to network emergencies to ensure the stable operation of the network.</t>
          </li>
          <li>
            <t>Step4: Policy Propagation  </t>
            <t>
After CR1 generates the optimization policies, it automatically propagates them to C1. Upon receiving the policies, C1 automatically guides the remote routers to adjust their routing paths.  </t>
            <t>
The system intelligently identifies and then automatically diverts the two services with the highest priority from the CR1-C1 link to the CR2-C2 link (through C2) to alleviate the congestion on the CR1-C1 link. After the policy adjustment, the system automatically monitors that the traffic on the CR1-C1 link is reduced to 300G, and the traffic on the CR2-C2 link is increased to 800G. The automated policy propagation and traffic diversion process is efficient and accurate, effectively improving the utilization efficiency of network resources and quickly alleviating the link congestion problem.</t>
          </li>
          <li>
            <t>Step5: Policy Reversion  </t>
            <t>
When the failed links between CR1-C1 are restored, the system automatically detects the link status change and gradually withdraws the relevant optimization policies.
  Upon the automatic revocation of the policies, the network traffic automatically and gradually returns to the load-sharing state before the fault.  </t>
            <t>
This automated mechanism ensures that the network can quickly return to normal operation after the fault is eliminated, reducing the cost of human intervention and improving the self-healing ability of the network.</t>
          </li>
        </ul>
      </section>
    </section>
    <section anchor="security-considerations">
      <name>Security Considerations</name>
      <t>TBD.</t>
    </section>
    <section anchor="iana-considerations">
      <name>IANA Considerations</name>
      <t>TBD.</t>
    </section>
  </middle>
  <back>
    <references anchor="sec-normative-references">
      <name>Normative References</name>
      <reference anchor="RFC5305" target="https://www.rfc-editor.org/info/rfc5305" xml:base="https://bib.ietf.org/public/rfc/bibxml/reference.RFC.5305.xml">
        <front>
          <title>IS-IS Extensions for Traffic Engineering</title>
          <author fullname="T. Li" initials="T." surname="Li"/>
          <author fullname="H. Smit" initials="H." surname="Smit"/>
          <date month="October" year="2008"/>
          <abstract>
            <t>This document describes extensions to the Intermediate System to Intermediate System (IS-IS) protocol to support Traffic Engineering (TE). This document extends the IS-IS protocol by specifying new information that an Intermediate System (router) can place in Link State Protocol Data Units (LSP). This information describes additional details regarding the state of the network that are useful for traffic engineering computations. [STANDARDS-TRACK]</t>
          </abstract>
        </front>
        <seriesInfo name="RFC" value="5305"/>
        <seriesInfo name="DOI" value="10.17487/RFC5305"/>
      </reference>
    </references>
  </back>
  <!-- ##markdown-source:
H4sIAAAAAAAAA71a63LbxhX+rxm9w078x04FRpc4TThtp5Ise5TYjkaSm0k7
/bEElsRGABbdBUgxUfMgfZa+U1+h3zm7C4AUKUuTCzU2Sezt3M93zjJJkt2d
RjeFGovjtjGlbHQq3qtmYeyNODXVTLlGm0pcqkKr6e6OnEysmj9ycmbSSpbY
OrNy2iQ/5tIkstKlTGRcnqTdssTysmR/f3fHTJwpVKPceHenrTPpP9E73lK8
zYxdjoVrst0d105K7Rx2uF7WOOz87Pr17s7ujq7tWDS2dc3h/v5X+4cg3io5
Ft/Wyko60AlZZeKdrORMlapqxDHGd3eIm5k1bT0WTOzuzo1a4mGGratG2Uo1
yStiiA4BI7mxIApyFLpyY/H16O/gE98861/raibCE2Nn2PFHPnwsTnNdSfGh
0qkpMahKqYuxICH98HL/4K8pDbc8OkorTLCGtKQy3RiLr6luIIETpX/ACfTd
tFVDQuF9BwRdEUE8xVN0lbdSi/jskTRVM/fF0dEvJir+VcaS+uekTnH5+vTl
0f7LMSutmg6G4l+SJEJOXGNlCqlf59oJ2FbLStM4wGRtqkidojMs0RuW8IYl
SpWCEe1KMZFOZQIjWKyKQs9oI+w+nWIh7KFYOu2tI1tCaHho1awtWEgjWIFo
ciXUnFaZ6fCkVLa082QppnqirDA1kTKFFFur3J7QDaYQQa422H1wfLHk85wq
pkmuZIExTMN7o0u1J1TlWkumRCe7Rk4Khc2DIRMR9LzyrjjyAit1lhUqivAZ
GS8Lilf89Izl9m8aem8WMpNLkLeJaGHStLViatW/WqZzTxRKZkyLiUcOZUBs
TBQshKZIjJQlntYSDNUGpwpoONBurBuJ61w5JeDBLQ6rFBTpdAMXFxnMinw9
I4anUzoOq8g+RAbilp3GYOa1SnuXLmXVyoIkmv0A9yfipnrWRq9nGkVuWosp
MAUNzhe5TnPh9KzS2FGyPnSVQgMOVJFwM01ntUWzJHFHvuEg0GElq1SNSJbe
Nik6/MaGSfslv65VrlpjtFFIfU25utKNJgWRfSZkn73Zks5ra0iJtDOi+Dxa
7RZTGUhQQAxFoWgY1KWe5M5SRNYqomYzD7TVE53Eu8WjMhmr1cGoqywpIOLi
QS2SivyuYKHJkUtmOZ891GoJ2wB9C90gNHoLU3MNMxJvcYCVM2LkITNgvdVW
pRrSX0KDRUrGEMx1bgrER2Y6rCMH1iweC8dCMNWT1pvOVTtxvXdjV5qAjRva
CszEHcgBJQayQKoTcy2ZRpv4J6T7xqSmAJUVYn/KidVbLFHVe4F3zTIYK0TU
eNPRxsK6lGP9/Pzzz8gPQvwh2fAn/GvjWBjdMjacxfvfievA4juTKTZicZfw
6y/DsYrSXD+KsfOBgmpT6HQp8Dma3N1vSz3v/uu87n7XvX4B10FjD56Pfxde
F5fkSgQN8fRPpLI7P6UbjvHzd6GcXmTWHzlj8Aou8NNYPEMGSw4EY/U/fyLe
dcHmtQWq47j1SrnU6prZ+YQz+7Nn4qpR9cH4nn37XKUoFy+k5WwObFfHIBm9
2adbN4j0MRY4Y9ljW0f/T9uiSKaFWSAzN9JniQw+gWQa4lFjavG+W01T3SgS
MYxygAtIOyEs0MLjc08ZYD6xN6H9DI+cvLkQl+cnCWKHeH4ZIsh5BJAQwgmy
KY1OEImyFz5FlHVhlm7lSKYb4c7MKLFRWipmFIXy0g0xx/F54tNzZKIMsiQI
gygZvwpZIwrKNN+jJOEgBw4ZSGicxVwLoCGdwBIKJilBvLBhLRuqL3y4ZKom
KpdzhMQ9CqxzbIUxSqo688CFsZTrwne3U4baKe1S7AAmbBI5nzRVskEmpbTZ
gDKdOlAP5xElII+uCwJAiNbkS5RtrUVJoAgLgKDCzJgP1EiKkKIpkdVTa1xq
ap82AKH7r4FGGDDheUiHj9sjvFW0bIpeI8hl4FumN6oRhAoaSJT0DlHUZH0+
p1+fXnz24dVF3HbTlCmyKWUa0Ou3C3jh/syr79/HKaOBAx2ONyQBL8o+uavb
xssnWi7sM3l71aXEvQ4CIFsagj0C5nIDRVbZQmdNzjQVRmYkGgZBKFiFmTSY
6rHQwDVDecBOyaR+lysPI9Y2VbepUpl3QqeaIXABNFEuN0XmSQPsbr3jLHKF
kz+yWbea8I5VJEVQCYyR3hTLFUOzBOaGlkV8QSIFbBRLZENGPQRXHGl45VwW
QRHPojw/NLrQP2LdSUfW9du/xfOCeNoI2zfMhj3K2QxOQRj2nk4oqpxdnG3S
Tf+t5V29sMhOR/7wBymEjDqq3slbXbaleEuHrM56Tl0McbD/5VcvxD9CafzP
gEBpHG+M5sFFpnxE5C0fQyoM8z6vkIRz4kLBdc8qoE7l49VzCOEFS8GNulyB
wLpm3Vt4JQhM0anoAn3Ecvsb0tzBhmeHG54dhR0OMHokPhcvxRfij+JL8dVT
nm3HZE/5C7hx9cWqG7xWx99yDFsZ/w1p8a8N6tny+jVpWccthx1u2W4135x9
z5bjkUsXeo/GKxg7ALc3HcbuQp9GmTG09WGoQpWUqSlH0e3hL0SOQqJsyYOf
3i/YuF/RB+FhxbyOeGLVlMNpKk+LQUYPpQ8FPjhqqBawhssHWtw1AdYrPvaj
bysVKw0qzbD5Ks4YHr2WYjvs0eVXnwZhtSsioGZAKW8wCqYRqiV1gGh2rDF9
1aipZKdlffUGId8qqpQJi3iUBjDXlf4h7DxH0CJc9oITAaceSU0uVJ6a2jOQ
pe8DSbvk/gOnQ19ANr5upeQ4bQht6CqLbF9/ewGMeY+aVanvhTS7wGJvG1xy
LmNcm0ru8dSFrNSo6555Y/x8HA3wArBBzjoLHNhELHzd9oK37xRsLH1XWgYM
ci9eDQtrBi6DjpJvdSHSpl313zUDsmHx7hP6LIexJoQzCA8FNokBhfzbmARv
sYVD6iBATq34LLYqfKYGhFLTKYHMufLHg33ZRFUM3Mx4WALmkFyAEv2RCBHn
vu00aGEYapkVUD6InBBYJmaBinPHhxPQl/7QcIy6JcumlkaUYnSi0FNZ6/Z4
Nb4c36sPafSYDaoDBLatiWvGAVQd0EzvJtR7KmOc6U5c6KIgQ5tZmfn2I9Gc
WbmoiNvvdOM1atWcNDUAinGLvZU+WVQc7wuFcJ720rYKWL3qVApVJg6uTgJw
3DqdqKkJrhVaZL0pE3xIEWJiSk4e+drdQYqJXzYkkMHQHc2lhHR6GDNTaK4d
9nnx9IA/3D1t3/D1bjudd5HcjUl4Q877yKTbwaSD/f03J+IWUGJ9Uj/0yINX
qX2QkTuk5aeI/fRyXewHkaDTywP+8ASp3z3JRlby/lGX94c5/BX5O134vZM3
voXA9tj3LC6sSjJFhTqHy1OEGy7MHY2fcWNXO8LxcBwzWyJX1bXvlnY93AO4
QLNQCJPEMYURCGUv3E10A5+RpHgQHw+H6zE3LeBR0+UgolWhprbK50CX65qS
8hXKqrbu6r0AoDcQQB4LXBLSWSR/Fa/rvoGBrc8q7mLHomSKzEIjsqCMRd3X
MxraeuTBOsuBXbAHUnxp6SOcjxsrpCTMJxf5fU9liFkix2/P33/DEZjbtseU
E2JpFdNXDOkdB5QW1iSDIKsQ32J4xcHrUT3W5OeUksg2ujB63PXbu84Yzf3U
97/+99//HJ9v7IGRqQ/JiM2evuEVL1t8FvcQY70DFHMcS9J3xLgka3XRJLoa
3Ab0raWR+C6ktLwtw+2LpcsbcMSJYOnLZFn7e4xpB8OQAJBjXH0P7XV+EhML
0TPyTFKCd0sAoHINOJS+oeGhZGjVNUjTvUWGzQagEp5QuKG1JaedCBNyJEfh
kILhm5O+1QLq7bBJBaFNDPb3CR5rvqAQOhK94g6huM2NF2Lp3NNF6gvYy7e9
AisyMEINIDIrLmXXic7l3KdIAoaFknOvTbpcRHUQFoX2yCbp0Ym2RGURDh3g
Hs7b/vI0mEc4M2glMnkEJj9a4hC/x7GY0SUB4wGg9Oz3Daa1OyZIgG18hfK+
8NgMZ/oqhHBWDLp4BBdY84BOw73X9N0dD8NXwproTZLs3aoOb2aG7oBNEy6r
1Ebf8He1gEOSqi5UcJYab95G6WxuQAbYIzLtGLpxIRHqJqIklDHx4pB8zPHy
7urTqeEFN8KNhchYNBTF6apRPfqi8dOHywfWLqNP0lOvma1gk2ulVX3WYUu/
rCQiTw9G4kPNCYsia4wX/San94yi5R6z12VpGl/bKcs8h9t0DGrbh2Zy3lFk
YhBlttbGWwqYDD5qm6DGhUGZYf0d4yIC5xwpBIbYV2ys6DXHiqA4RCJ+9DwW
VKeHL5iRUKyoLaXK0E9FXxUE6+n97oG4sCGq9lFvnWTun9IvBbird4Qo2AfN
e8t6trTrfqLAC7/EQt+e6PNNoLnu7c3vHO8JdLwfG1R8ioY0xQ/OfSHrrNZ7
uuQLiWBRK32XsDpd+Z0EXMq0Ng3qD03ie1UjszXQB86Aa5UDF9pSupHtdQ1w
H843x3tKqaAFqlHZR8J6Goyxg0eti50IBkH3yrzoN+BJ0v3VJscN2Zi9crU/
8NSycK38X6HI14fusQViWzQDD1759ULfqvARb2DQ3e85EKCjQvvClH/hVQyi
ouz8iA9kQysgoEpyU4YdoMfajvvM96O/v1pcMb6V357IiS4GzZz1H3tcKRgz
jZ+Gjpns6orrk1ej8EOp4/fHWybs/B+PNq5sMSkAAA==

-->

</rfc>
