<?xml version="1.0" encoding="US-ASCII"?>
<!-- This is built from a template for a generic Internet Draft. Suggestions for
     improvement welcome - write to Brian Carpenter, brian.e.carpenter @ gmail.com 
     This can be converted using the Web service at http://xml.resource.org/ -->
<!DOCTYPE rfc SYSTEM "rfc2629.dtd">
<!-- You want a table of contents -->
<!-- Use symbolic labels for references -->
<!-- This sorts the references -->
<!-- Change to "yes" if someone has disclosed IPR for the draft -->
<!-- This defines the specific filename and version number of your draft (and inserts the appropriate IETF boilerplate -->
<?rfc sortrefs="yes"?>
<?rfc toc="yes"?>
<?rfc symrefs="yes"?>
<?rfc compact="yes"?>
<?rfc subcompact="no"?>
<?rfc topblock="yes"?>
<?rfc comments="no"?>
<rfc category="info" docName="draft-chen-tsvwg-high-speed-data-express-00"
     ipr="trust200902">
  <front>
    <title abbrev="Transport and Services Working Group">High speed data
    express in IP: Concept, Reference Architecture and Technologies</title>

    <author fullname="Danyang Chen" initials="D." surname="Chen">
      <organization>China Mobile</organization>

      <address>
        <postal>
          <street/>

          <city>Beijing</city>

          <code>100053</code>

          <country>China</country>
        </postal>

        <email>chendanyang@chinamobile.com</email>
      </address>
    </author>

    <author fullname="Zhiqiang Li" initials="Z." surname="Li">
      <organization>China Mobile</organization>

      <address>
        <postal>
          <street/>

          <city>Beijing</city>

          <code>100053</code>

          <country>China</country>
        </postal>

        <email>lizhiqiang@chinamobile.com</email>
      </address>
    </author>

    <!---->

    <date day="8" month="July" year="2024"/>

    <area>Networking</area>

    <workgroup>Internet Research Task Force</workgroup>

    <keyword>high data express, IP</keyword>

    <abstract>
      <t>With the rapid development of AI technology, intelligent computing
      and super-computing services, the demand for wide-area transmission of
      massive data is increasing. This paper makes full use of the advantages
      of IP network, such as statistical reuse and elastic supply, and
      proposes a reference architecture of high elasticity and high throughput
      data express based on IP network. In order to support differentiated
      data express services and create task-type data express services, the
      "one low and three high" technical system is proposed.</t>
    </abstract>

    <note title="Requirements Language">
      <t>The key words "MUST", "MUST NOT", "REQUIRED", "SHALL", "SHALL NOT",
      "SHOULD", "SHOULD NOT", "RECOMMENDED", "MAY", and "OPTIONAL" in this
      document are to be interpreted as described in <xref
      target="RFC2119">RFC 2119</xref>.</t>
    </note>
  </front>

  <middle>
    <section anchor="intro" title="Introduction">
      <t>With the rapid development of AI technology, intelligent computing,
      and super-computing services, the demand for wide-area transmission of
      massive data is increasing. How to transfer a large amount of data
      quickly, efficiently and conveniently has become an important problem
      that needs to be solved in the development of communication networks.
      With the network bandwidth from 10G to 25G, 100G, 200G and even 400G,
      bandwidth is not the bottleneck of data transmission, the key to data
      transmission is how to make full use of large bandwidth to achieve
      efficient throughput.</t>

      <t>In most cases, the effective throughput is less than the throughput,
      which in turn is less than the bandwidth. Bandwidth refers to the number
      of bits that can be transmitted per second on the link, which depends on
      the link clock rate and channel coding, also known as line speed
      (network device transmission capacity). Throughput refers to the actual
      amount of data transmitted on the bandwidth (terminal device
      transmission capacity). Goodput refers to the throughput that truly
      meets service requirements, excluding invalid packets such as
      retransmission and discarding (service data transmission
      capability)..</t>

      <t>In-depth analysis shows that the data transmission throughput
      performance is mainly affected by five factors: transmission distance,
      network packet loss rate, effective data length , sending window and the
      impact factor. As the transmission distance increases, RTT (round trip
      delay) increases. In reliable transmission mode, the reply time of ACK
      acknowledgement messages becomes longer, and the sending sliding window
      cannot be adjusted in time. Network packet loss rate will affect the
      rate adjustment of congestion control algorithm, and increase the tail
      delay of data transmission. Effective data length refers to the
      effective data length of the packet header removed, which can be
      regarded as MSS. Increasing MSS will help improve throughput. The
      sending window affects the duration of continuous data sending. If the
      window is too small, data will be sent before the ACK reply is
      completed, and sending will be paused. The impact factor F is a
      comprehensive parameter, such as the CPU frequency, PCIE rate, disk I/O
      rate, and bandwidth that affect the packet processing rate, 0&lt;F&lt;1,
      if these have no impact, the value of F is 1. In summary, we can draw
      the following conclusion: in addition to improving the network
      environment (improving the physical bandwidth, etc.), the end-to-end
      data sending and receiving mechanism and transmission protocol are also
      a key part of the effective throughput.</t>

      <t>In order to improve the data transmission efficiency and reduce the
      data transmission cost as much as possible, we propose to build a highly
      elastic and high-throughput data transmission network based on IP
      network, and use elastic bandwidth, load balancing, security encryption
      and other technologies to realize new data express services.</t>
    </section>

    <section title="Reference Architecture">
      <t>The data express solution enables massive data transmission from the
      data source to the destination. Data sources are devices or systems that
      generate or store large amounts of data, such as enterprise servers,
      databases, and storage devices. Data sources can be multiple nodes
      distributed across different geographies and network environments, or
      they can be a single node concentrated in one location. Data destination
      refers to the device or system that receives or processes a large amount
      of data, such as cloud computing platform, intelligent computing center,
      and supercomputing center. The data destination can be a public service
      provided by a third party or a cloud data center built by a data express
      operator.</t>

      <t>Based on the massive transmission requirements from the data source
      to the destination end, the overall architecture of the data express
      solution is designed according to the hierarchical idea, which mainly
      includes three parts: cloud private network infrastructure, network
      controller and data express operation platform, as shown in</t>

      <figure>
        <artwork>                         +--------------------+                      
                         |                    |                      
                         |    Data Express    |                      
          Task  -------&gt; | Operation Platform |                      
        Delivery         |                    |                      
                         +---------+----------+                      
                                   |                                 
                        +----------+-----------+                     
                        |                      |                     
        +---------------+  Network Controller  +---------+           
        |               |                      |         |           
        |               +-----------+----------+         |           
        |                           |                    |           
        |                           |                    |           
+-------+---------------------------+--------------------+----------+
|                                                                   |
| +-----------+                                       +-----------+ |
| |Data Source|                                       |  Super    | |
| |  Memory   |                                +------+ Computing | |
| | card/disk |                Wide Area       |      |  Center   | |
| +-----+-----+     +------+Interconnection +--+--+   +-----------+ |
|       |           |Cloud +----------------+Cloud|                 |
|       |           |  PE  |                | PE  |                 |
|       |           +---+--+                +--+--+   +-----------+ |
| +-----v------+        |                      |      |  General  | |
| |Data Express|        |                      +------+ Computing | |
| |  Station   +--------+                             |   Center  | |
| +------------+                                      +-----------+ |
|                                                                   |
|                      Infrastructure                               |
+-------------------------------------------------------------------+</artwork>
      </figure>

      <t><list>
          <t>Infrastructure<list>
              <t>Data Express can use the existing network infrastructure to
              provide data transmission services for customers, and can access
              different data source and data consumption end through SPN, PON,
              optical fiber direct connection and other means according to the
              situation of live network resources, so as to achieve dynamic
              scheduling and optimization of network resources and ensure the
              quality and efficiency of data transmission. SPN is the
              recommended access mode for data express because of its large
              access bandwidth, elastic bandwidth configuration, and wide
              coverage.</t>
            </list></t>

          <t>Network Controller</t>

          <t>Network controllers include SPN and other network controllers in
          different fields, which complete the network configuration according
          to the requirements of the service platform, and realize the whole
          process automation and intelligent opening of the data express
          service. The main functions include:<list>
              <t>Network configuration: Automatically or manually configure
              parameters such as network topology, routes, and security
              policies based on network requirements and scenarios to achieve
              flexibility and customization.</t>

              <t>Information collection: Collects and analyzes network
              performance, quality, and security indicators in real time to
              optimize and improve data express services.</t>

              <t>Network monitoring: Monitors the network in real time to
              detect and handle anomalies such as network faults, attacks, and
              leaks, ensuring data transmission security and stability.</t>
            </list></t>

          <t>Operation Platform<list>
              <t>Operation platform refers to the online platform that
              provides users with one-stop data express service management and
              monitoring functions, and provides users with business ordering,
              opening, monitoring, billing and other functions. The operation
              platform can be accessed through the web or mobile terminal to
              realize the visualization and intelligence of the whole process
              of data express business. The main functions include:<list>
                  <t>Service order: Users can order data express services
                  online on the operation platform, and select parameters such
                  as data source, destination, transmission mode, and
                  transmission time to generate service orders.</t>

                  <t>Enabling the service: Users can enable the online data
                  express service on the operation platform. Based on the
                  order information, users can automatically or manually
                  configure data sources, destinations, and network resources
                  to start data transfer tasks.</t>

                  <t>Service monitoring: Users can monitor data express
                  services online on the operation platform and view data
                  transfer progress, status, rate, and quality in real time to
                  detect and handle anomalies in a timely manner.</t>

                  <t>Service charging: Users can charge online data express
                  services on the operating platform, and generate and pay
                  bills based on preset charging rules based on data transfer
                  volume, duration, and bandwidth.</t>
                </list></t>
            </list></t>
        </list></t>
    </section>

    <section title="&quot;One low, three high&quot; technology system">
      <t>TBD</t>
    </section>

    <section title="Summary">
      <t>TBD</t>
    </section>

    <section anchor="Security" title="Security Considerations">
      <t>TBD.</t>
    </section>

    <section anchor="IANA" title="IANA Considerations">
      <t>This document has no requests to IANA.</t>
    </section>
  </middle>

  <back>
    <references title="Normative References">
      <?rfc include="reference.RFC.2119"?>
    </references>
  </back>
</rfc>
