<?xml version="1.0" encoding="US-ASCII"?>
<!-- This is built from a template for a generic Internet Draft. Suggestions for
     improvement welcome - write to Brian Carpenter, brian.e.carpenter @ gmail.com 
     This can be converted using the Web service at http://xml.resource.org/ -->
<!DOCTYPE rfc SYSTEM "rfc2629.dtd">
<!-- You want a table of contents -->
<!-- Use symbolic labels for references -->
<!-- This sorts the references -->
<!-- Change to "yes" if someone has disclosed IPR for the draft -->
<!-- This defines the specific filename and version number of your draft (and inserts the appropriate IETF boilerplate -->
<?rfc sortrefs="yes"?>
<?rfc toc="yes"?>
<?rfc symrefs="yes"?>
<?rfc compact="yes"?>
<?rfc subcompact="no"?>
<?rfc topblock="yes"?>
<?rfc comments="no"?>
<rfc category="info" docName="draft-chuyi-nmrg-ai-agent-network-00"
     ipr="trust200902">
  <front>
    <title
    abbrev="Large Model based Agents for Network Operation and Maintenance">Large
    Model based Agents for Network Operation and Maintenance</title>

    <author fullname="Chuyi Guo" initials="C." surname="Guo">
      <organization>China Mobile</organization>

      <address>
        <postal>
          <street/>

          <city>Beijing</city>

          <code>100053</code>

          <country>China</country>
        </postal>

        <email>guochuyi@chinamobile.com</email>
      </address>
    </author>

    <!---->

    <date month="March" year="2025"/>

    <area>Networking</area>

    <workgroup>Network Management Research Group</workgroup>

    <keyword>Network Operation and Maintenance; Large Model; Agent</keyword>

    <abstract>
      <t>Current advancements in AI technologies, particularly large models,
      have demonstrated immense potential in content generation, reasoning,
      analysis and so on, providing robust technical support for network
      automation and self-intelligence. However, in practical network
      operations, challenges such as system isolation and fragmented data lead
      to extensive manual, repetitive, and inefficient tasks, the improvement
      of intelligence level is very necessary. This document identifies
      typical scenarios requiring enhanced intelligence, and explains how AI
      Agents and large model technologies can empower networks to address
      operational pain points, reduce manual efforts, and explore impacts on
      network data, system architectures, and interfaces correspondingly. It
      further explores and summarizes standardization efforts in
      implementation.</t>
    </abstract>

    <note title="Requirements Language">
      <t>The key words "MUST", "MUST NOT", "REQUIRED", "SHALL", "SHALL NOT",
      "SHOULD", "SHOULD NOT", "RECOMMENDED", "MAY", and "OPTIONAL" in this
      document are to be interpreted as described in <xref
      target="RFC2119">RFC 2119</xref>.</t>
    </note>
  </front>

  <middle>
    <section anchor="Intro" title="Introduction">
      <section title="Large Models">
        <t>Large models refer to AI systems based on deep learning techniques,
        containing massive parameters (typically billions to trillions). It is
        trained on large-scale datasets, and is capable of capturing complex
        patterns and associations, demonstrating outstanding abilities in
        natural language processing, image generation, decision-making, and
        reasoning.</t>

        <t>Recent breakthroughs in models like GPT-4 and DeepSeek have
        continuously pushed technical boundaries and enhancing the performance
        of models.Users can use the capabilities of large models by accessing
        or deploying inference models, and combining with Fine tuning, Prompt
        Learning, etc.</t>

        <t>The big model has been empowered in multiple vertical domains,
        like:</t>

        <t><list style="symbols">
            <t>Research: AlphaFold for protein structure prediction, Galactica
            for scientific paper assistance. Industry: Generative design
            (e.g., automotive/chip architecture optimization), automated code
            development (GitHub Copilot).</t>

            <t>Finance: Risk prediction, automated report generation.</t>
          </list></t>

        <t>In the future, large models will also move towards embodied AI ,
        embedding model capabilities into physical terminals such as robots
        and autonomous driving, continuously building an open-source developer
        ecosystem, opening up some model capability interfaces, and promoting
        industry collaborative innovation.</t>
      </section>

      <section title="AI Agent">
        <t>Intelligent agent, as an important concept in the field of
        artificial intelligence, refers to a system that can autonomously
        perceive the environment, make decisions, and execute actions. It has
        basic characteristics such as autonomy, interactivity, reactivity, and
        adaptability, and can independently complete tasks in complex and
        changing environments. Intelligent agents have the ability to learn
        and make decisions. Through learning algorithms and data analysis,
        they can extract useful information from massive amounts of data and
        form their own knowledge base. In the decision-making process,
        intelligent agents can comprehensively consider various factors and
        use methods such as logical reasoning and probability statistics to
        make the optimal decision. This ability gives intelligent agents a
        significant advantage in solving complex problems.</t>

        <t>There are four design patterns for intelligent agent workflow:</t>

        <t><list style="symbols">
            <t>Reflection: Let the agent review and revise the output
            generated by themselves;</t>

            <t>Tool Use: LLM generates code, calls APIs, and performs
            practical operations;</t>

            <t>Planning: Let the agent decompose complex tasks and execute
            them according to the plan;</t>

            <t>Multi-agent Collaboration: Multiple agents play different roles
            and collaborate to complete tasks.</t>
          </list></t>

        <t>At present, intelligent agents have been used in the following
        scenarios:</t>

        <t><list style="symbols">
            <t>Personal assistant: <list style="symbols">
                <t>Cross platform task agent: Automatically organize emails,
                schedule meetings, and manage schedules (such as Microsoft
                Copilot).</t>

                <t>Life Butler: Adjust smart homes according to user habits
                and recommend personalized health plans.</t>
              </list></t>

            <t>Industry Intelligence: <list style="symbols">
                <t>Financial advisory: Real time analysis of market data,
                generation of investment portfolio recommendations, and
                automatic execution of trades.</t>

                <t>Medical diagnosis: Provide dynamic treatment
                recommendations based on the patient's medical history and
                real-time monitoring data. Industrial operation and
                maintenance: Predicting equipment failures and scheduling
                maintenance resources to optimize production line
                efficiency.</t>
              </list></t>

            <t>Virtual world interaction:</t>

            <t><list style="symbols">
                <t>Game NPC: Intelligent characters with emotions and memories
                (such as AI driven open world NPCs).</t>

                <t>Metaverse Guide: Help users explore virtual spaces and
                provide personalized content recommendations.</t>
              </list></t>

            <t>Scientific research:</t>

            <t><list style="symbols">
                <t>Laboratory assistant: Automatically design experiments,
                analyze data, and propose hypotheses (such as chemical
                synthesis agents).</t>

                <t>Climate simulation: Coordinating multidimensional data
                models to predict extreme weather and generate response
                plans.</t>
              </list></t>
          </list></t>
      </section>
    </section>

    <section title="Acronyms &amp; Abbreviations">
      <t><list style="hanging">
          <t hangText="Large model:">Machine learning models with large-scale
          parameters and computing power are typically constructed from deep
          neural networks, containing billions or even hundreds of billions of
          parameters, capable of understanding text, images, speech, and other
          content, and performing tasks such as text generation, image
          generation, inference question answering, and scientific
          prediction.</t>

          <t hangText="AI Agent:">An AI agent is an intelligent entity with
          autonomous perception, decision-making, and execution capabilities,
          driven by goals in dynamic environments.</t>
        </list></t>
    </section>

    <section anchor="UC" title="Use case">
      <section title="Scenario 1: Network Migration Operations">
        <t>The current network undergoes a large number of service migration
        or device switchover every day/month, which have a high degree of
        similarity in steps and processes, involving querying and filling a
        large amount of data and configuration. There are two typical types of
        migrations: service provisioning (for external service data
        configuration) and migration change (for internal tasks such as route
        publishing and network optimization). Large models naturally have the
        ability to process and recognize massive amounts of data, and
        intelligent agents can guide the process of each step like experienced
        experts.</t>

        <t>Automation via large models and agents can reduce errors and free
        human resources. Key tasks include:</t>

        <t><list style="symbols">
            <t>Migration Plan Generation: Designing workflows and deployment
            strategies.</t>

            <t>Plan Auditing: Checking configurations, compliance, and
            correcting errors (e.g., typos, hallucinations).</t>

            <t>Automated Execution: Replacing manual configurations with
            AI-generated scripts, call corresponding systems to finish
            tasks.</t>
          </list></t>

        <t>Taking the service provisioning scenario as an example, typically,
        when doing migration, it was necessary to manually log in the device
        configuration parameters. Now, through the interaction of the large
        model, the large model generates a script to distribute the device,
        also configure and audit it. The agent can call other systems, such as
        digit twin platform for script testing, view the impact of the changed
        parameters, and return to the assigned system to reduce manual errors.
        Finally, based on the analysis of the results, it can achieve
        automatic distribution when there shows no problem.</t>
      </section>

      <section title="Scenario 2: Network Fault Handling">
        <t>(Content to be expanded)</t>
      </section>
    </section>

    <section anchor="AF" title="Architecture and Functionality">
      <t>Intelligent agents based on large models can automate network
      operations by coordinating system scheduling and leveraging diverse
      capabilities of large models. This process involves multiple
      interactions with systems such as large models and network management
      systems. Each agent has specialized functions, such as agents for intent
      understanding or agents dedicated to fault localization and demarcation
      in specific network scenarios. Current operational systems already
      provide basic data support, foundational atomic capabilities, and
      well-defined orchestration workflows for task execution. However, most
      processes are manually connected, involve repetitive mechanical work,
      and lack an intelligent coordination "brain". See Figure 1.</t>

      <figure anchor="Architecture"
              title="Architecture of Large Model based Agents">
        <artwork>                        Agents                                       Network
+------------------------------------------------------+ +---------------------------+
|                                                      | |                           |
|                    +------------+                    | |Network Systems &amp; Platforms|      
|                    | Perception |                    | |                           |
|                    +------+-----+                    +-&gt;        AI Models          |
|                           |                          | |                           |
|                  +--------v--------+                 | |    Atomic Capabilities    |
|  +----------+    |      Brain      |    +----------+ | |                           |
|  | Planning &lt;+-+-+                 +-+-+&gt;  Action  | | |          Tools            |
|  +----------+    | LLM | LVM | LSM |    +----------+ &lt;-+                           |
|                  +------+--^-------+                 | |           Data            |
|                         |  |                         | |                           |
|                    +----v--+----+                    | |                           |     
|                    |   Memory   |                    | |                           |              
|                    +------------+                    | |                           |
+------------------------------------------------------+ +---------------------------+     </artwork>
      </figure>

      <t>Functions of Agents:</t>

      <t><list style="symbols">
          <t>Intent Recognition: Understand and interpret user input
          intentions. Determine whether subsequent tasks require identifying
          suitable agents or multi-turn dialogues to complete intent
          recognition and parsing.</t>

          <t>Intent Classification and Analysis: Decompose tasks based on
          recognized user intent.Categorize tasks according to different
          functional requirements.</t>

          <t>Perception: Proactively receive alarms, threshold-exceeding
          notifications, or environmental change information, issuing warnings
          when necessary.Accept task requests from other systems, potentially
          involving multimodal data processing.</t>

          <t>Memory: <list style="symbols">
              <t>Long-term memory: Stores user habits, domain-specific
              processing experiences (e.g., failure/success cases, encountered
              faults) in knowledge bases.</t>

              <t>Short-term memory: Caches temporary processing data (e.g.,
              context).</t>
            </list></t>

          <t>Agents perform reflection and error correction by interacting
          with long-term memory and contextual information.</t>

          <t>Planning: Analyze and decompose intent based on task objectives
          and learned knowledge. Orchestrate subtasks (e.g., breaking complex
          problems into simpler ones). Identify required system components
          (other agents, large models, APIs, etc.).</t>

          <t>Decision-Making: Finalize execution plans and match workflows to
          current tasks. Generate instantiated, executable solutions by
          aligning system components, data, and model strategies.</t>

          <t>Execution: Convert orchestrated results into
          network-understandable commands. Execute tasks by mobilizing
          resources and dynamically adjusting based on feedback.</t>

          <t>Multi-Agent Collaboration:<list style="symbols">
              <t>Team Collaboration: Enable coordinated teamwork among
              multiple agents.</t>

              <t>Competitive Collaboration: Manage competitive relationships
              to avoid efficiency loss.</t>
            </list></t>
        </list></t>
    </section>

    <section anchor="Data" title="Data">
      <t>The data that an agent can learn or perceive includes expert
      knowledge in operation and maintenance processes, logs, configuration
      rules, policy knowledge, case manuals, alarms, network topologies, fault
      reports, and more.</t>
    </section>

    <section anchor="Atomic" title="Standardized Atomic Capabilities">
      <t>Atomic capability refers to a series of orchestrated workflows
      designed to accomplish a subtask. It encapsulates various APIs, exposes
      a unified interface and capabilities externally, and serves as the
      minimal functional unit for achieving specific subtasks. Atomic
      capabilities can be defined with standardized inputs and outputs to
      facilitate cross-system communication and calls.</t>
    </section>
  </middle>

  <back>
    <references title="Informative References">
      <reference anchor="LLMbasedAgents">
        <front>
          <title>Exploring Large Language Model based Intelligent Agents:
          Definitions, Methods, and Prospects.</title>

          <author fullname="Yuheng Cheng" initials="Y. Cheng" surname="Cheng">
            <organization/>
          </author>

          <author fullname="Ceyao Zhang" initials="C. Zhang" surname="Zhang">
            <organization/>
          </author>

          <author fullname="Zhengwen Zhang" initials="Z. Zhang"
                  surname="Zhang">
            <organization/>
          </author>

          <author fullname="Xiangrui Meng" initials="X. Meng" surname="Meng">
            <organization/>
          </author>

          <author fullname="Sirui Hong" initials="S. Hong" surname="Hong">
            <organization/>
          </author>

          <date month="January" year="2024"/>
        </front>
      </reference>
    </references>

    <references title="Normative References">
      <?rfc include="reference.RFC.2119"?>
    </references>
  </back>
</rfc>
