<?xml version='1.0' encoding='utf-8'?>
<!DOCTYPE rfc SYSTEM "rfc2629-xhtml.ent">
<rfc xmlns:xi="http://www.w3.org/2001/XInclude" 
	category="exp" 
	docName="draft-irtf-icnrg-flic-03" 
	ipr="trust200902" 
	obsoletes="" 
	updates="" 
	submissionType="IRTF" 
	xml:lang="en" 
	tocInclude="true" 
	tocDepth="4" 
	symRefs="true" 
	sortRefs="true" 
	version="3">

    <!-- ***** FRONT MATTER ***** -->
  <front>
   	<title abbrev="FLIC">File-Like ICN Collections (FLIC)</title>
   	<seriesInfo name="Internet-Draft" value="draft-irtf-icnrg-flic-03"/>

    <author fullname="Christian Tschudin" initials="C." surname="Tschudin">
		<organization>University of Basel</organization>
    	<address>
       		<email>christian.tschudin@unibas.ch</email>
    	</address>
    </author>
    	
    <author fullname="Christopher A. Wood" initials="C.A." surname="Wood">
    	<organization>Cloudflare</organization>
    	<address>
     		<email>caw@heapingbits.net</email>
    	</address>
    </author>
    	
    <author fullname="Marc Mosko" initials="M.E." surname="Mosko">
    	<organization>PARC, Inc.</organization>
    	<address>
      		<email>marc.mosko@parc.com</email>
    	</address>
    </author>
    
    <author fullname="David Oran" initials="D." surname="Oran" role="editor">
    	<organization>Network Systems Research &amp; Design</organization>
    	<address>
      		<email>daveoran@orandom.net</email>
    	</address>
    </author>
    
    <date/>
    <area>IRTF</area>
    <workgroup>ICNRG</workgroup>
    <keyword>Information Centric Networking</keyword>
    
    <abstract>
    	<t>This document describes a simple "index table" data structure and its associated ICN data objects for organizing a set of primitive ICN data objects into a large, File-Like ICN Collection (FLIC). At the core of this collection is a <em>manifest</em> which acts as the collection's root node. The manifest contains an index table with pointers, each pointer being a hash value pointing to either a final data block or another index table node.</t>
	    </abstract>
  </front>

  <middle>
    <section><name>Introduction</name>
      <t>ICN architectures such as Content-Centric Networking (CCNx)<xref target="RFC8569" format="default"/> and Named Data Networking <xref target="NDN" format="default"/> are well suited for static content distribution.  Each piece of (possibly immutable) static content is assigned a name by its  producer.  Consumers fetch this content using said name. Optionally, consumers may specify the full name of content, which includes its name and a unique (with overwhelming probability) cryptographic digest of said content.  
      <!--      (See <xref target="RFC7927" format="default"/> for a formal definition of "full name".) -->
      </t>

	<aside><t>Note: The reader is assumed to be familiar with general ICN concepts from CCNx or NDN. For general ICN terms, this document uses the terminology defined in <xref target="RFC7927" format="default"/>. Where more specificity is needed, we utilize <xref target="RFC8569" format="default">CCNx</xref> terminology where a Content Object is the data structure that holds application payload. 
<!--	Therefore, it comprises an optional Name, a PayloadType, a Payload, and an optional Signature. -->
	Terms defined specifically for FLIC are enumerated below in <xref target="Terminology"/>.</t></aside> 
	
      <t>To enable requests with full names, consumers need a priori knowledge of content digests.  Manifests, a form of catalog, are data structures commonly employed to store and transport this information.  Typically, ICN manifests are signed content objects (data) which carry a collection of hash digests.  Therefore, as content objects, manifests themselves may be fetched by full name. Thus, manifests may contain either hash digests of, or pointers to, either other manifests or content objects.  A collection of manifests and content objects represents a large piece of application data, e.g., one that cannot otherwise fit in a single content object.</t>
      
      <t>Structurally, this relationship between manifests and content objects is reminiscent of the UNIX inode concept with index tables and memory pointers.  In this document, we specify a simple, yet extensible, manifest data structure called FLIC - <em>File-Like ICN Collection</em>.  FLIC is suitable for ICN protocol suites such as CCNx and NDN.  We describe the FLIC design, grammar, and various use cases, e.g., ordered fetch, seeking, de-duplication, extension, and variable-sized encoding. We also include FLIC encoding examples for CCNx and NDN.</t>
      
      <t>The purpose of a manifest is to concisely name, and hence point to, the constiuent pieces of a larger object. A FLIC manifest does this by using a <em>root</em> manifest to name and cryptographically sign the data structure and then use concise lists of hash-based names to indicate the constituent pieces.  This maintains strong security from a single signature. A Manifest entry gives one enough information to create an <em>Interest</em> for that entry, so it must specify the name, the hash digest, and if needed, the locators.</t>

      
      <t>FLIC is a distributed data structure illustrated by the following picture.</t>
      <figure>
        <name>A FLIC manifest and its directed acyclic graph</name>
        <artwork name="" type="" align="center" alt=""><![CDATA[
            	                root manifest
.------------------------------------.
| optional name:                     |
|   /icn/name/of/this/flic           |
|                                    |
| HashGroup (HG):                    |
|   optional metadata:               |
|     overall digest, locator, etc.  |    .------.
|   hash-valued data pointer -----------> | data |
|     ...                            |    `------'  sub manifest
|   hash-valued manifest pointer ------.     .------------------.
|                                    |  `--> |                ----->
| optional additional HashGroups ..  |       |                ----->
|                                    |       `------------------'
| optional signature                 |
`------------------------------------'
            	]]></artwork>
      </figure>

      <t>A key design decision is how one names the root manifest, the application data, and subsidiary manifests. For this, FLIC uses the concept of a Name Constructor.  The root manifest (in fact, any FLIC manifest) may include a Name Constructor that instructs a manifest reader how to properly create Interests for the associated application data and                subsidiary manifests. The Name Constructors allow interest construction using a well-known, application-independent set of rules.  Some name constructor forms are tailored towards specific ICN protocols, such as CCNx or NDN; some are more general and could work with many protocols.  We describe the allowed Name Constructor methods in  <xref target="nameconstructors" format="default"/>. There are also particulars of how to encode the name schema in a given ICN protocol, which we describe in <xref target="encodings" format="default"/>.</t>
      


      
      <t>FLIC has encodings for <xref target="CCNxEncoding" format="default">CCNx</xref> as per <xref target="RFC8609" format="default">RFC 8609</xref> and for <xref target="NDNEncoding" format="default">NDN</xref>.</t>
      
      <t>An example implementation in Python may be found at <xref target="FLICImplementation" format="default"/>.</t>
      
      <section><name>FLIC as an ICN experimental tool</name>
        <t>FLIC enables experimentation with how to structure and retrieve large data objects and collections in ICN. By having a common data structure applications can rely on, with a common library of code that can be used to create and parse manifest data structures, applications using ICN protocols can both avoid unnecessary reinvention and also have enhanced interoperability. Since the design attempts to balance simplicity, universality, and extensibility, there are a number of important experimental goals to achieve that may wind up in conflict with one another. We provide a partial list of these experimental issues in <xref target="experiments"/>. It is also important for users of FLIC to understand that some flexibility and extensions might be removed if use cases do not materialize to justify their inclusion in an eventual standard.</t>
      </section>
      
      <section><name>Requirements Language</name>
        <t>The key words "MUST", "MUST NOT", "REQUIRED", "SHALL", "SHALL NOT", "SHOULD", "SHOULD  NOT", "RECOMMENDED", "MAY", and "OPTIONAL" in this document are to be interpreted as described in <xref target="RFC2119"/>.</t>
      </section>
    </section>

    <section anchor="overview"><name>Design Overview</name>
      <t>The FLIC design adopts the proven UNIX inode concept of direct and indirect pointers, but without the specific structural forms of direct versus indirect. The pointers in FLIC use hash-based naming of Content Objects analogous to the function block numbers play in UNIX inodes. More generally, a FLIC structure is in most cases a tree, although any acyclic di-graph is a legal form.</t>
      
      <t>In FLIC terms, a direct pointer links to application-level data, which is a Content Object with application data in the Payload.  An indirect pointer links to a Content Object with a FLIC Manifest in the Payload.</t>
      
      <t> Because FLIC uses hash-based pointers as names, FLIC graphs are inherently acyclic. Both CCNx and NDN support hash-based naming, though the details differ (see <xref target="CCNxEncoding" format="default"/> and <xref target="NDNEncoding" format="default"/>).</t>
      
      <aside><t>Note: A substantial advantage of using hash-based naming is that it permits block-level de-duplication of application data because two blocks with the same payload will have the same hash name. </t></aside>
      
      <t>The FLIC structure that it is expected most applications would use consists of a root manifest with a strong cryptographic signature and then cryptographically strong  (e.g. SHA256 <xref target="SHS"/>) hash names as pointers to other manifests.  The advantage of this structure is that the single signature in the root manifest covers the entire data structure no matter how many additional manifests are in the data structure.  Another advantage of this structure is it removes the need to use chunk (CCNx) or segment (NDN) name components for the subordinate manifests.</t>
      
      <t>FLIC supports manifest encryption separate from application payload encryption (See <xref target="encryption"/>).  It has a flexible encryption envelope to support various encryption algorithms and key discovery mechanisms. The byte layout allows for in-place encryption and decryption.</t>
      
      <t>A limitation of this approach is that one cannot construct a hash-based name for a child until one knows the payload of that child. In practical terms, this means that one must have the complete application payload available at the time of manifest creation.</t>
      
      <t>FLIC's design allows straightforward applications that just need to traverse a linear set of related objects to do so simply, but FLIC has two extensibility mechanisms that allow for more sophisticated uses: manifest metadata, and pointer annotations. These are described in <xref target="metadata" format="default"/> and <xref target="annotations"/> respectively.</t>
      
      <t anchor='libraries'>FLIC goes to considerable lengths to allow creation and parsing by application-independent library code. Therefore, any options used by applications in the data structure or encryption capabilities MUST NOT require applications to have application-specific Manifest traversal algorithms. This ensures that such application agnostic libraries can always successfully parse and traverse any FLIC Manifest by ignoring the optional capabilities.</t>
    </section>

    <section anchor="structure"><name>FLIC Structure</name>
      <section anchor="Terminology"><name>Terminology</name>
        <dl newline="false" spacing="normal">
          <dt>Data Object:</dt>
          <dd>a CCNx nameless Content Object that usually only has Payload. It might also have an ExpiryTime to limit the lifetime of the data.</dd>
          
          <dt>Direct Pointer:</dt>
          <dd>borrowed from inode terminology, it is a CCNx link using a content object hash restriction and a locator name to point to a Data Object.</dd>
          
          <dt>Indirect Pointer:</dt>
          <dd>borrowed from inode terminology, it is a CCNx link using a content object hash restriction and a locator name to point to a manifest content object.</dd>
          
          <dt>Manifest:</dt>
          <dd>a CCNx ContentObject with PayloadType 'Manifest' and a Payload of the encoded manifest. A leaf manifest only has direct pointers. An internal manifest has a mixture of direct and indirect pointers.</dd>
          
          <dt>Leaf Manifest:</dt>
          <dd>all pointers are direct pointers.</dd>
          
          <dt>Internal Manifest:</dt>
          <dd>some or all pointers are indirect. The order and number of each is up to the manifest builder. By convention, all the direct manifests come first, then the indirect.</dd>
          
          <dt>Manifest Waste:</dt>
          <dd>a metric used to measure the amount of waste in a manifest tree. Waste is the number of unused pointers. For example, a leaf manifest might be able to hold 40 direct pointers, but only 30 of them are used, so the waste of this node is 10. Manifest tree waste is the sum of waste over all manifests in a tree.</dd>
          
          <dt>Root Manifest:</dt>
          <dd>A signed, named, manifest that points to nameless manifest nodes. This structure means that the internal tree structure of internal and leaf manifests have no names and thus may be located anywhere in a namespace, while the root manifest has a name to                            fetch it by.</dd>
          
          <dt>Top Manifest:</dt>
          <dd>One useful manifest structure is to use a Root manifest that points to a single Internal manifest called the Top Manifest. The Top manifest the begins the structure used to organize manifests. It is also possible to elide the two and use only a root manifest that also serves in the role of the top manifest.</dd>
          
          <dt>Name Constructor:</dt>
          <dd>The specification of how to construct an Interest for a Manifest entry.</dd>
          
          <dt>Locator:</dt>
          <dd>A routing hint in an Interest used by forwarding to get the Interest to where it can be matched based on its Name Constructor-derived name.</dd>
        </dl>
      </section>
      
      <section anchor="locators"><name>Locators</name>

        <t>Locators are routing hints used by forwarders to get an Interest to a node in the network that can resolve the Interest's name.  In some naming conventions, the name might only be a hash-based name so the Locator is the only available routing information. Locators exist in both CCNx  and NDN, though the specific protocol mechanisms differ. A FLIC manifest represents locators in the same way for both ICN protocols, though they are encoded differently in the underlying protocol. See <xref target="encodings" format="default"/> for encoding differences.</t>
        
        <t>A manifest Node may define one or more Locator prefixes that can be used in the construction of Interests from the pointers in the manifest.  The Locators are inherited when walking a manifest tree, so they do not need to be defined everywhere.  It is RECOMMENDED that only the Root manifest contain Locators so that a single operation can update the locators.  One use case when storing application payloads at different replicas is to replace the Root manifest with a new one that contains locators for the current replicas.</t>
      </section>
      
      <section anchor="nameconstructors"><name>Name Constructors</name>
        <t>A Manifest may define zero or more name constructors in NameConstructorDefinitions (NCD) located in the Manifest Node.  An NCD associates a Name Constructor Id (NCID) to a Name Constructor. The NCID is used in other parts of the Manifest to refer to that specific definition.</t>
        
        <t>NCID 0 is the default name constructor.  If it is not defined in an NCD, it is assumed to be a HashNamingConstructor.  A Manifest may re-define the default as needed.</t>
        
        <t>A Manifest MUST use locally unique NCIDs in the NCD.</t>
        
        <t>NCDs and their associated NCIDs are inherited as one traverses a manifest.  That is, a manifest consumer must remember the NCDs as it traverses manifests.  If it encounters a HashGroup that uses an unknown NCID, the RECOMMENDED action is to report a malformed manifest to the user.</t>
        
        <t>A Manifest may update an NCID.  If a child manifest re-defines an NCID, the manifest consumer MUST use the new definition from that point forward under that Manifest branch.</t>
        
        <t>It is RECOMMENDED that only the root or similar top-level manifest define NCDs and they not be re-defined in subsequent manifests.</t>
        <t>We expect that an application constructing a Manifest will take one of three approaches to name constructors.  The advantage of using, or re-defining, the default name constructor is that any hash groups that use it do not need to specify an NCID and thus might save some space.</t>
        
        <ul>
          <li>A manifest might define (or use) a default name constructor and mix subsequent Manifest and Data objects under that same namespace.  The manifest only needs to use one Hash Group and can freely mix Manifest and Data pointers.</li>
          <li>
          	<t>A manifest might define (or use) a default name constructor for subsequent Manifests and define a second NCD for the application data. This places all subsequent manifests under the default constructor and places all application data under the second NCD.  The Manifest must use at least two Hash Groups.</t>
          	<t>There are a few options on how to organize the Hash Groups:</t>
        	<ol type="(%d)" spacing="compact"> 
          		<li>Manifest Hash Group followed by Data Hash group,</li>
          		<li>Data Hash Group followed by Manifest Hash Group,</li>
          		<li>Intermix multiple manifest and data hash groups for interleaved reading, or</li>
          		<li>use a data-on-leaf only approach: the interior manifests would use the manifest hash group and the leaves would use the data hash group. Other organizations are possible.</li>
        	</ol>
          </li>
          <li>Define multiple NCDs for subsequent manifests and data, or not use the default NCD, or use some other organization.</li>
        </ul>
        
        <t>In this specification, we define the following four types of Name Constructors.  Additional name constructor types may be specified in a subsequent revision of the specification.  Here, we informally define the name constructors.  <xref target="grammar"/> specifies the encoding of each name constructor.</t>
        
        <dl newline="false" spacing="normal">
          <dt>Type 0 (Interest-Derived Naming):</dt>
          <dd>Use whatever name was used in the Interest to retrieve this Manifest, less a hash component, and append the desired hash value.</dd>
          
          <dt>Type 1 (Data-Derived Naming):</dt>
          <dd>Use the Manifest Name, less a hash component, as the Interest name, and append the desired hash value.</dd>
          
          <dt>Type 2 (Prefix List):</dt>
          <dd>The NCD specifies a list of 1 or more name prefixes.  The consumer may use any (or all) of those prefixes with the desired hash appended.</dd>
          
          <dt>Type 3 (Segmented Naming):</dt>
          <dd>As in Type 2, but the consumer MUST maintain a 0-based counter for each NCID associated with the in-order index of each hash and use that counter as a Segment number in the name.</dd>
        </dl>
      </section>
      
      <section anchor="metadata"><name>Manifest Metadata</name>
        <t>The FLIC Manifest may be extended by defining TLVs that apply to the Manifest as a whole, or alternatively, individually to every data object pointed to by the Manifest. This basic specification does not specify any, but metadata TLVs may be defined through additional RFCs or via Vendor TLVs.  FLIC uses a Vendor TLV structure identical to <xref target="RFC8609" format="default"/> for vendor-specific annotations that require no standardization process.</t>
        
        <t>For example, some applications may find it useful to allow specialized consumers such as <em>repositories</em> (for example <xref target="repository"/>) or enhanced forwarder caches to pre-place, or adaptively pre-fetch data in order to improve robustness and/or retrieval latency. Metadata can supply hints to such entities about what subset of the compound object to fetch and in what order.</t>
        
        <aside><t>Note: FLICs ability to use separate namespaces for the Manifest and the underlying Data allows different                    encryption keys to be used, hence giving a network element like a cache or repository access to the Manifest data does not as a side effect reveal the contents of the application data itself.</t></aside>
      </section>
      
      <section anchor="annotations"><name>Pointer Annotations</name>
        <t>FLIC allows each manifest pointer to be annotated with extra data. Annotations allow applications to exploit metadata about each Data Object pointed to without having to first fetch the corresponding Content Object. This specification defines one such annotation. The <em>SizeAnnotation</em> specifies the number of application layer octets covered by the pointer. </t>
        
        <t>An annotation may, for example, give hints about a desirable traversal order for fetching the data, or an importance/precedence indication to aid applications that do not require every content object pointed to in the manifest to be fetched. This can be very useful for real-time or streaming media applications that can perform error concealment when rendering the media.</t>
        
        <t>Additional annotations may be defined through additional RFCs or via Vendor TLVs. FLIC uses a Vendor TLV structure identical to <xref target="RFC8609"/> for vendor-specific annotations that require no standardization process.</t>
      </section>
      
      <section anchor="grammar"><name>Manifest Grammar (ABNF)</name>
        <t>The manifest grammar is mostly, but not entirely independent of the ICN protocol used to encode and transport it. The TLV encoding therefore follows the corresponding ICN protocol, so for CCNx FLIC uses 2 octet length, 2 octet type and for NDN uses the 1/3/5 octet types and lengths (see <xref target="NDNTLV"/> for details). There are also some differences in how one structures and resolves links. <xref target="RFC8569" format="default"/> defines HashValue and Link for CCNx encodings. The NDN ImplicitSha256DigestComponent defines HashValue and NDN Delegation (from Link Object) defines Link for NDN. <xref target="encodings"/> below                    specifies these differences.</t>
        
        <t>The basic structure of a FLIC manifest comprises a security context, a node, and an authentication tag.  The security context and authentication tag are not needed  if the node is unencrypted.  A node is made up of a set of metadata, the <tt>NodeData</tt>, that applies to the entire node, and one or more <tt>HashGroups</tt> that contain pointers.</t>
        
        <t>The <tt>NodeData</tt> element defines the namespaces used by the manifest. There may be multiple namespaces, depending on how one names subsequent  manifests or data objects.  Each <tt>HashGroup</tt> may reference a single namespace to control how one forms Interests from the <tt>HashGroup</tt>.  If one is using separate namespaces for                    manifests and application data, one needs at least two hash groups.  For a manifest structure of "MMMDDD," (where M means manifest (indirect pointer) and D means data (direct pointer)) for example, one would have a first <tt>HashGroup</tt> for the child manifests with its namespace and a second <tt>HashGroup</tt> for the data pointers with the other namespace. If one used a structure like "MMMDDDMMM," then one would need three hash groups.</t>
        
        <figure anchor="flicgrammar"><name>FLIC Grammar</name>
			<sourcecode type="abnf">
TYPE = 2OCTET / {1,3,5}OCTET ; As per CCNx or NDN TLV
LENGTH = 2OCTET / {1,3,5}OCTET ; As per CCNx or NDN TLV

Manifest = TYPE LENGTH [SecurityCtx] (EncryptedNode / Node) [AuthTag]

SecurityCtx = TYPE LENGTH AlgorithmCtx
AlgorithmCtx = AEADCtx / RsaKemCtx
AuthTag = TYPE LENGTH *OCTET ; e.g. AEAD authentication tag
EncryptedNode = TYPE LENGTH *OCTET ; Encrypted Node

Node = TYPE LENGTH [NodeData] 1*HashGroup
NodeData = TYPE LENGTH [SubtreeSize] [SubtreeDigest] [Locators] 0*Vendor 0*NcDef
SubtreeSize = TYPE LENGTH INTEGER
SubtreeDigest = TYPE LENGTH HashValue

NcDef = TYPE LENGTH NcId NcSchema
NcId = TYPE LENGTH INTEGER
NcSchema = InterestDerivedSchema / DataDerivedSchema / PrefixSchema / SegmentedSchema
InterestDerivedSchema = TYPE LENGTH [ProtocolFlags]
DataDerivedSchema = TYPE LENGTH [ProtocolFlags]
PrefixSchema = TYPE LENGTH Locators [ProtocolFlags]
SegmentedSchema = TYPE LENGTH Locators [ProtocolFlags]

Locators = TYPE LENGTH 1*Link
HashValue = TYPE LENGTH *OCTET ; As per ICN Protocol
Link = TYPE LENGTH *OCTET ; As per ICN protocol

ProtocolFlags = TYPE LENGTH *OCTET; ICN-specific flags, e.g. must be fresh

HashGroup = TYPE LENGTH [GroupData] (Ptrs / AnnotatedPtrs)
Ptrs = TYPE LENGTH *HashValue
AnnotatedPtrs = TYPE LENGTH *PointerBlock
PointerBlock = TYPE LENGTH *Annotation Ptr
Ptr = TYPE LENGTH HashValue

Annotation = SizeAnnotation / Vendor
SizeAnnotation = TYPE LENGTH Integer
Vendor = TYPE LENGTH PEN *OCTET

GroupData = TYPE LENGTH [NcId] [LeafSize] [LeafDigest] [SubtreeSize] [SubtreeDigest]
LeafSize = TYPE LENGTH INTEGER
LeafDigest = TYPE LENGTH HashValue

AEADCtx = TYPE LENGTH AEADData
AEADData = KeyNum AEADNonce Mode
KeyNum = TYPE LENGTH INTEGER
AEADNonce = TYPE LENGTH 1*OCTET
AEADMode = TYPE LENGTH (AEAD_AES_128_GCM / AEAD_AES_256_GCM / AEAD_AES_128_CCM / AEAD_AES_128_CCM)

RsaKemCtx = 2 LENGTH RsaKemData
RsaKemData = KeyId AEADNonce AEADMode WrappedKey LocatorPrefix
KeyId = TYPE LENGTH HashValue; ID of Key Encryption Key
WrappedKey = TYPE LENGTH 1*OCTET
LocatorPrefix = TYPE LENGTH Link
			</sourcecode>
        </figure>

        <dl newline="false" spacing="normal">
          <dt>SecurityCtx:</dt>
          <dd>information about how to decrypt an <tt>EncryptedNode</tt>. The structure will depend on the specific encryption algorithm.</dd>
          
          <dt>AlgorithmId:</dt>
          <dd>The ID of the encryption method (e.g. preshared key, a broadcast encryption scheme, etc.)</dd>
          
          <dt>AlgorithmData:</dt>
          <dd>The context for the encryption algorithm.</dd>
          
          <dt>EncryptedNode:</dt>
          <dd>An opaque octet string with an optional authentication tag (i.e. for AEAD authentication tag)</dd>
          
          <dt>Node:</dt>
          <dd>A plain-text manifest node. The structure allows for in-place encryption/decryption.</dd>
          
          <dt>NodeData:</dt>
          <dd>the metadata about the Manifest node</dd>
          
          <dt>SubtreeSize:</dt>
          <dd>The size of all application data at and below the Node or Group</dd>
          
          <dt>SubtreeDigest:</dt>
          <dd>The cryptographic digest of all application data at and below the Node or Group</dd>
          
          <dt>Locators:</dt>
          <dd>An array of routing hints to find the manifest components</dd>
          
          <dt>HashGroup:</dt>
          <dd>A set of child pointers and associated metadata</dd>
          
          <dt>Ptrs:</dt>
          <dd>A list of one or more Hash Values</dd>
          
          <dt>GroupData:</dt>
          <dd>Metadata that applies to a HashGroup</dd>
          
          <dt>LeafSize:</dt>
          <dd>Size of all application data immediately under the Group (i.e. via direct pointers)</dd>
          
          <dt>LeafDigest:</dt>
          <dd>Digest of all application data immediately under the Group</dd>
          
          <dt>Ptr:</dt>
          <dd>The ContentObjectHash of a child, which may be a data ContentObject (i.e. with Payload) or another Manifest Node.</dd>
        </dl>
      </section>
      
      <section anchor="trees"><name>Manifest Trees</name>
        <section anchor="traversal"><name>Traversal</name>
          <t>FLIC manifests use a pre-order traversal.  This means they are read top to bottom, left to right. The algorithms in <xref target="pseudocode"/> show the pre-order forward traversal code and the reverse-order traversal code, which we use below to construct such a tree. This document does not mandate how to build trees.  <xref target="building-trees"/> provides a detailed example of building inode-like trees.</t>
          
          <t>If using Annotated Pointers, an annotation could influence the traversal order.</t>
          
          <figure anchor="pseudocode"><name>Traversal Pseudocode</name>
            <sourcecode type="c">
preorder(node)
    if (node = null)
        return
    visit(node)
    for (i = 0, i &lt; node.child.length, i++)
        preorder(node.child[i])
                                
reverse_preorder(node)
    if (node = null)
        return
    for (i = node.child.length - 1, i &gt;= 0, i-- )
        reverse_preorder(node.child[i])
    visit(node)
        	</sourcecode>
          </figure>
          
          <t>In terms of the FLIC grammar, one expands a node into its hash groups, visiting each hash group in order.  In each hash group, one follows each pointer in order. <xref target="expansion"/> shows how hash groups inside a manifest expand like virtual children in the tree.  The in-order traversal is M0, HG1, M1, HG3, D0, D1, D2, HG2, D3, D4.</t>
          
          <figure anchor="expansion"><name>Node Expansion</name>
            <artwork align="left" name="" type="" alt=""><![CDATA[
M0 ____
|      \
HG1    HG2
| \    |  \
M1 D2  D3  D4
|
HG3
|  \
D0 D1
                        ]]></artwork>
          </figure>
          
          <t>Using the example manifest tree shown in <xref target="leafonly"/>, the in-order traversal would be: Root, M0, M1, D0, D1, D2, M2, D3, D4, D5, M3, D6, D7, D8. </t>
        </section>
      </section>
      
      <section anchor="encryption"><name>Manifest Encryption Modes</name>
        <t>This document specifies two encryption modes.  The first is a preshared key mode, where the parties are assumed to have the decryption keys already. It uses AES-GCM or AES-CCM. This is useful, for example, when using a key agreement protocol such as <xref target='I-D.wood-icnrg-ccnxkeyexchange'>CCNxKE</xref>. The second is an RSA key encapsulation mode (<xref target='RFC5990'>RsaKem</xref>), which may be used for group keying.</t>
        
        <t>Additional modes may be defined in subsequent specifications.  We expect that an RSA KemDem mode and Elliptic Curve mode should be specified.</t>
        
        <t>All encryption modes use standard encryption algorithms and specifications. Where appropriate, we adopt the TLS 1.2 standards for how to use the encryption algorithms. This section specifies how to encode algorithm parameters or ICN-specific data.</t>
        
        <t>For group key based encryption, we use RsaKem. This specification only details the pertinent aspects of the encryption.  It describes how a consumer locates the appropriate keys in the ICN namespace. It does not specify aspects of a key manager which may or may not be used as part of key distribution and management, nor does it specify the protocol between a key manager and a publisher.  In its simpliest                    form, the publisher could be the key manager, in which case there is no                    extra protocol needed between the publisher and key manager.</t>
        
        <t>While the preshared key algorithm is limited in use, the AES encryption mode described applies to the group key mechanisms too. The group key mechanism facilitates the distribution of the shared key without an on-line key agreement protocol like (the expired draft) <xref target='I-D.wood-icnrg-ccnxkeyexchange'>CCNxKE</xref>.</t>
                
        <section anchor="aead"><name>AEAD Mode</name>
          <t>This mechanism uses <xref target="AESGCM">AES-GEM</xref> or <xref target="RFC3310">AES-CCM</xref> for manifest encryption.  A publisher creating a <tt>SecurityCtx</tt> SHOULD use the mechanisms in <xref target="RFC6655"/> for AES-CCM Nonce generation and <xref target="RFC5288"/> for AES-GCM Nonce generation.</t>
          
          <t>As these references specify, it is essential that the publisher creating a Manifest never use a Nonce more than once for the same key.  For keys exchanged via a session protocol, such as CCNx, the publisher MUST use unique nonces on each Manifest for that session. If the key is derived via a group key mechanism, the publisher MUST ensure that the same Nonce is not used more than once for the same Content Encryption Key.</t>
          
          <t>The AEAD Mode uses <xref target="RFC5116"/> defined symbols AEAD_AES_128_CCM, AEAD_AES_128_GCM, AEAD_AES_256_CCM and AEAH_AES_256_GCM to specify the key length and algorithm.</t>
          
          <t> The <tt>KeyNum</tt> identifies a key on the receiver. The key MUST be exactly of the length specific by the Mode. Many receivers may have the same key with the same <tt>KeyNum</tt>.</t>
          
          <t>When a Consumer reads a manifest that specifies a <tt>KeyNum</tt>, the consumer SHOULD verify that the Manifest's publisher is an expected one for the KeyNum's usage.  This trust mechanism employed to ascertain whether the publisher is expected is beyond the scope of this document, but we provide an outline of one such possible trust mechanism.  When a consumer learns a shared key and <tt>KeyNum</tt>, it associates that <tt>KeyNum</tt> with the publisher ID used in a public key signature.  When the consumer receives a signed manifest (e.g. the root manifest of a manifest tree), the consumer matches the KeyNum's publisher with the Manifest's publisher.</t>
          
          <t> Each encrypted manifest node has a full security context (<tt>KeyNum, Nonce, Mode</tt>). The AEAD decryption is independent for each manifest so Manifest objects can be fetched and decrypted in any order. This design also ensures that if a manifest tree points to the same subtree repeatedly, such as for deduplication, the decryptions are all idempotent.</t>
          
          <t>To encrypt a Manifest, the publisher:</t>
          <ol>
           	<li>Removes any <tt>SecurityCtx</tt> or <tt>AuthTag</tt> from the Manifest.</li>
           	<li>Creates a <tt>SecurityCtx</tt> and adds it to the Manifest.</li>
           	<li>Treats the Manifest TLV through the end of the Node TLV Length as unencrypted authenticated Header.  That includes anything from the start of the Manifest up to but not including the start of the Node's body.</li>
           	<li>Treats the body of the Node to the end of the Manifest as encrypted data.</li>
           	<li>Appends the AEAD <tt>AuthTag</tt> to the end of the Manifest, increasing the Manifest's length</li>            <li>Changes the TLV type of the Node to <tt>EncryptedNode</tt>.</li>
          </ol>
          	
          <t>To decrypt a Manifest, the consumer:</t>
          <ol>
           	<li>Verifies that the <tt>KeyNum</tt> exists and the publisher is trusted for that <tt>KeyNum</tt>.</li>
           	<li>Saves the <tt>AuthTag</tt> and removes it from the Manifest, decreasing the Manifest length.</li>
           	<li>Changes the <tt>EncryptedNode</tt> type to <tt>Node</tt>.</li>
           	<li>Treats everything from the Manifest TLV through the end of the Node Length as unencrypted authenticated Header.  That is, all bytes from the start of the Manifest up to but not including the start of the Node's body.</li>
           	<li>Treats the body of the Node to the end of the Manifest as encrypted data.</li>
           	<li>Verifies and decrypts the data using the key and saved <tt>AuthTag</tt>.</li>
           	<li>If the decryption fails, the consumer SHOULD notify the user and stop further processing of the manifest.</li>
          </ol>
        </section>
        
        <section anchor="rsaoaep"><name>RSA-OAEP Key Transport Mode</name>
          <t>The RSA-OAEP mode uses RSA-OAEP (see <xref target="RFC8017" format="default">RFC8017 Sec 7.1</xref> and <xref target="RSAKEM" format="default"/>) to encrypt a symmetric key that is used to encrypt the Manifest.  We call this RSA key the Key Encryption Key (KEK) and                    each group member has this private key.  A separate key distribuiton system is responsible for distributing the KEK. For our purposes, it is reasonable to assume that the KEK private key is available at a Locator and that group members can decrypt this private key.</t>
          
          <t>The symmetric key MUST be one that is compatible with the AEAD Mode, i.e. a 128-bit or 256-bit random number. Further, the symmetric key MUST fit in the OAEP envelope (which will be true for normal-sized keys).</t>
          
          <t>Any group key protocol and system needed are outside the scope of this document.  We assume there is a Key Manager (KM) and a Publisher (P) and a set of group members.  Through some means, the Publisher therefore has at its disposal:</t>
          <ul>
            <li>A Content Encryption Key (CEK), i.e. the symmetric key.</li>
            <li>The RSA-OAEP wrapped CEK.</li>
            <li>The KeyId of the KEK used to wrap the CEK.</li>
            <li>The Locator of the KEK, which is shared under some group key protocol.</li>
          </ul>
          
          <t>This Manifest specification requires that if a group member fetches the KEK key at Locator it can decrypt the <tt>WrappedKey</tt> and retrieve the CEK.</t>
          
          <t>In one example, a publisher could request a key for a group and the Key Manager could securely communicate (<tt>CEK, Wapped_CEK, KeyId, Locator</tt>) back to the publisher. The Key Manager is responsible for publishing the Locator. In another example, the publisher could be a group member and have a group private key in which case the publisher can create their own key encryption key, publish it under the Locator and proceed. The publisher generates <tt>CEK, Wrapped_CEK, KeyId</tt>, and a Locator on its own.</t>
          
          <t>To create the wrapped key using a Key Encryption Key:</t>
          <ol>
          	<li>Obtain the CEK in binary format (e.g. 32 bytes for 256 bits)</li>
            <li>RSA encrypt the CEK using the KEK public key with OAEP padding, following <xref target="RFC8017">RFC8017 Sec 7.1</xref>. The encryption is not signed because the root Manifest must have been signed by the publisher already.</li>
          </ol>
          
          <t>To decrypt the wrapped key using a Key Encryption Key:</t>
          <ol>
          	<li>RSA decrypt the WrappedKey using the KEK private key with OAEP padding, following <xref target="RFC8017" format="default">RFC8017 Sec 7.1</xref>.</li>
            <li>Verify the unwrapped key is a valid length for the AEADMode.</li>
          </ol>
          
          <t>To encrypt a Manifest, the publisher:</t>
          <ol>
          	<li>Acquires the set of (CEK, Wrapped_CEK, KeyId, Locator).</li>
            <li>Creates a <tt>SecurityCtx</tt> and adds it to the Manifest.  The <tt>SecurityCtx</tt> includes an <tt>AEADNonce</tt> and <tt>AEADMode</tt>, as per AEAD mode.</li>
            <li>Encrypts the Manifest as per AEAD Mode using the RSA-OAEP SecurityCtx and CEK.</li>
          </ol>
          
          <t>To decrypt a Manifest, the consumer:</t>
          <ol>
          	<li>Acquires the KEK from the Key Locator.  If the consumer already has a cached copy of the KeyId in memory, it may use that cached key.</li>
            <li>SHOULD verify that it trusts the Manifest publisher to use the provided key Locator.</li>
            <li>Decrypts the WrappedKey to get the CEK.  If the consumer has already decrypted the same exact WrappedKey TLV block,                            it may use that cached CEK.</li>
            <li>Using the <tt>CEK</tt>, <tt>AEADNonce</tt>, and <tt>AEADMode</tt>, decrypt the Manifest as per AEAD Mode, ignoring the KeyNum steps.</li>
          </ol>
        </section>
        
        <!--
                <section anchor="group" title="ECC Group Keys">
                    <t>NOTE: THIS IS INFORMALLY SPECIFIED AT THIS STAGE</t>
                    <t>
                        We use an EC group key that is encrypted for each user under their public key (also assumed EC) using
                        a standard NIST wrapping C(1e, 2s, ECC CDH)-ECDSA plus AES-KWP.  When a publisher wants to publish,
                        it creates an EC wrapping keypair and publishes the private key wrapped under the group key
                        (use C(1e, 1s, ECC CDH)-ECDSA plus AES-KWP).  Finally, the publisher starts creating master
                        secrets by creating ECDHE-ECDSA offers under the wapping key and using a KDF (e.g. HMAC-SHA256)
                        for the data key.
                    </t>
                    <t>
                        Note that the offer, in this case, is just an ephemeral EC public key plus the curve specification.
                        In compressed form that EC public key is only 48 bytes (SECP384R1), so the whole offer is maybe 60 bytes
                        including TLV encoding.  Compare that to an RSA 4096-bit encryption of a 32-byte AES key at 512 bytes.
                        That's over an 8x improvement.
                    </t>
                    <t>
                        The complete security context would be larger, as you still need the keyid of the wrapping key
                        (32 bytes) plus a link to the wrapping key content object.  Once someone has the wrapping key, they
                        can verify the key id and avoid fetching anything, then calculate the shared secret from the ECDHE
                        offer, then derive the data key using a KDF.   For IoT use cases, it would be useful to be able to
                        link to the whole offer.
                    </t>
                    <t>
                        The KDF is tied to input from the manifest, so the publisher can create many data keys so each
                        FLIC manifest tree has a different key all from the same master key derived from the wrapping key.
                        The offers could be embedded in the root manifest or published externally.  The preferred method
                        is to embed the offer in the root manifest.
                        When the group key changes, a publisher only needs to re-encrypt the wrapping keys.
                        It can save a lot of work depending on how many items it used the wrapping key under that group key.
                        For example, if the publisher uses only one wrapping key per group key, then they only need to re-encrypt one
                        thing for the new group to have access to the old data.
                    </t>
                    <t>
                        This works with RSA keys too, though the specific algorithms change some.  We would use RSA-OAEP instead of
                        the ECDHE method.
                    </t>
                    <t>
                        TODO: There are a variety of issues about how you name the keys and publish them.  We would also need to
                        decide if we want key fetching to be on-line or available from repos, and if so how to do current key fetching.
                    </t>
                </section>
 -->
      </section>
            
      <section anchor="encodings"> <name>Protocol Encodings</name>
      
        <section anchor="CCNxEncoding"><name>CCNx Encoding</name>
          <t>In CCNx, application data content objects use a PayloadType of T_PAYLOADTYPE_DATA. In order to clearly distinguish FLIC Manifests from application data, a different payload type is required. Therefore this specification defines a new payload type of T_PAYLOADTYPE_FLIC.</t>
          
          <figure anchor="CCNxGrammer"><name>CCNx Embedding Grammar</name>
            <sourcecode type="abnf">
ManifestContentObject = TYPE LENGTH [Name] [ExpiryTime] PayloadType Payload
Name = TYPE LENGTH *OCTET ; As per RFC8569
ExpiryTime = TYPE LENGTH *OCTET ; As per RFC8569
PayloadType = TYPE LENGTH T_PAYLOADTYPE_FLIC ; Value TBD
Payload : TYPE LENGTH *OCTET ; the serialized Manifest object
            </sourcecode>
          </figure>
          
          <section anchor="CCNxHashNaming"><name>CCNx Hash Naming</name>
            <t>The Hash Naming namespace uses CCNx nameless content objects.</t>
            <t>It proceeds as follows:</t>
            <ul>
              <li>The Root Manifest content object bound to a name assigned by the publisher and signed by the publisher. It also may have a set of Locators used to fetch the remainder of the manifest. The root manifest has a single HashPointer that points to the Top Manifest.  It may also have cache control directives, such as ExpiryTime.</li>
              
              <li>The Root Manifest has an NsDef that specifies HashSchema.  Its GroupData uses that NsId. All internal and leaf manifests use the same GroupData NsId.  A Manifest Tree MAY omit the NsDef and NsId elements and rely on the default being HashSchema.</li>
              
              <li>The Top Manifest is a nameless CCNx content object.  It may have cache control directies.</li>
              
              <li>Internal and Leaf manifests are nameless CCNx content objects, possibly with cache control directives.</li>
              
              <li>The Data content objects are nameless CCNx content objects, possibly with cache control directives.</li>
              
              <li>To form an Interest for a direct or indirect pointer, use a Name from one of the Locators and put the pointer HashValue into the ContentObjectHashRestriction.</li>
            </ul>
          </section>
          
          <section anchor="CCNxSinglePrefix"><name>CCNx Single Prefix</name>
            <t>The Single Prefix schema uses the same name in all Content Objects and distinguishes them via their ContentObjectHash.  Note that in CCNx, using a SinglePrefix name means that Locators are not used.</t>
            
            <t>It proceeds as follows:</t>
            <ul>
              <li>The Root Manifest content object has a name used to fetch the manifest.  It is signed by the publisher. It has a set of Locators used to fetch the remainder of the manifest.  It has a single HashPointer that points to the Top Manifest.  It may also have cache control directives, such as ExpiryTime.</li>
              
              <li>The Root Manifest has an NsDef that specifies SinglePrefix and the SinglePrefixSchema element specifies the SinglePrefixName.</li>
              
              <li>The Top Manifest has the name SinglePrefixName.  It may have cache control directies.  Its GroupData elements must have an NsId that references the NsDef.</li>
              
              <li>An Internal or Leaf manifest has the name SinglePrefixName, possibly with cache control directives. Its GroupData elements must have an NsId that references the NsDef.</li>
              
              <li>The Data content objects have the name SinglePrefixName, possibly with cache control directives.</li>
              
              <li>To form an Interest for a direct or indirect pointer, use SinglePrefixName as the Name and put the pointer HashValue into the ContentObjectHashRestriction.</li>
            </ul>
          </section>
          
          <section anchor="CCNxSegmentedPrefix">
          <name>CCNx Segmented Prefix</name>
            <t>The Segmented Prefix schema uses a different name in all Content Objects and distinguishes them via their ContentObjectHash.  Note that in CCNx, using a SegmentedPrefixSchema means that Locators are not used.</t>
            	<aside><t><strong>Optional</strong>: Use AnnotatedPointers to indicate the segment number of each hash pointer to avoid needing to infer the segment numbers.</t></aside>

            <t>It proceeds as follows:</t>
            <ul spacing="normal">
              <li>The Root Manifest content object has a name used to fetch the manifest.  It is signed by the publisher. It has a set of Locators used to fetch the remainder of the manifest.  It has a single HashPointer that points to the Top Manifest.  It may also have cache control directives, such as ExpiryTime.</li>
              
              <li>The Root Manifest has an NsDef that specifies SegmentedPrefix and the SegmentedPrefixSchema element specifies the SegmentedPrefixName.</li>
              
              <li>The publisher tracks the chunk number of each content object within the NsId. Objects are be numbered in their traversal order.  Within each manifest, the name can be constructed from the SegmentedPrefixName plus a Chunk name component.</li>
              
              <li>The Top Manifest has the name SegmentedPrefixName plus chunk  number.  It may have cache control directies.  It's GroupData elements must have an NsId that references the NsDef.</li>
              
              <li>An Internal or Leaf manifest has the name SegmentedPrefixName  plus chunk number, possibly with cache control directives. Its GroupData elements must have an NsId that references the NsDef.</li>
              
              <li>The Data content objects have the name SegmentedPrefixName plus chunk number, possibly with cache control directives.</li>
              
              <li>To form an Interest for a direct or indirect pointer, use SegmentedPrefixName plus chunk number as the Name and put the  pointer HashValue into the ContentObjectHashRestriction.  A consumer must track the chunk number in traversal order for each SegmentedPrefixSchema NsId.</li>
            </ul>
          </section>
          
          <section anchor="CCNxHybrid"><name>CCNx Hybrid Schema</name>
            <t>A manifest may use multiple schemas.  For example, the application payload in data content objects might use SegmentedPrefix while the manifest content objects might use HashNaming.</t>
            
            <t>The Root Manifest should specify an NsDef with a first NsId (say 1) as the HashNaming schema and a second                             NsDef with a second NsId (say 2) as the SegmentedPrefix schema along with the SegmentedPrefixName.</t>
            
            <t>Each manifest (Top, Internal, Leaf) uses two or more HashGroups, where each HashGroup has only Direct (with the second NsId) or Indirect (with the first NsId).  The number of hash groups will depend on how the publisher wishes to interleave direct and indirect pointers.</t>
            
            <t>Manifests and data objects derive their names according to the application's naming schema.</t>
          </section>
        </section>
        
        <section anchor="NDNEncoding"><name>NDN Encoding</name>
          <t>In NDN, all Manifest Data objects use a ContentType of FLIC (1024), while all application data content objects use a PayloadType of Blob.</t>
          
          <section anchor="NDNHashNaming"><name>NDN Hash Naming</name>
            <t>In NDN Hash Naming, a Data Object has a 0-length name.  This means that an Interest will only have an ImplicitDigest name component in it.  This method relies on using NDN Forwarding Hints.                         </t>
            
            <t>It proceeds as follows:</t>
            <ul>
              <li>The Root Manifest Data has a name used to fetch the manifest. It is signed by the publisher. It has a set of Locators used to fetch the remainder of the manifest.  It has a single HashPointer that points to the Top Manifest.  It may also have cache control directives.</li>
              
              <li>The Root Manifest has an NsDef that specifies HashSchema.  Its GroupData uses that NsId. All internal and leaf manifests use the same GroupData NsId.  A Manifest Tree MAY omit the NsDef and NsId elements and rely on the default being HashSchema.</li>
              
              <li>The Top Manifest has a 0-length Name.  It may have cache control directies.</li>
              
              <li>Internal and Leaf manifests has a 0-length Name, possibly with cache control directives.</li>
              
              <li>The application Data use a 0-length name, possibly with cache control directives.</li>
              
              <li>To form an Interest for a direct or indirect pointer, the name is only the Implicit Digest name component derived from a pointer's HashValue.  The ForwardingHints come from the Locators.  In NDN, one may use one or more locators within a single Interest.</li>
            </ul>
          </section>

          <section anchor="NDNSinglePrefix"><name>NDN Single Prefix</name>
            <t>In Single Prefix, the Data name is a common prefix used between all objects in that namespace, without a Segment or other counter.  They are distinguished via the Implicit Digest name component.  The FLIC Locators go in the ForwardingHints.</t>
            
            <t>It proceeds as follows:</t>
            <ul spacing="normal">
              <li>The Root Manifest Data object has a name used to fetch the manifest.  It is signed by the publisher.  It has a set of Locators used to fetch the remainder of the  manifest.  It has a single HashPointer that points to the Top Manifest.  It may also have cache control directives.</li>
              
              <li>The Root Manifest has an NsDef that specifies SinglePrefix and the SinglePrefixSchema element specifies the SinglePrefixName.</li>
              
              <li>The Top Manifest has the name SinglePrefixName.  It may have cache control directies.  Its GroupData elements must have an NsId that references the NsDef.</li>
              
              <li>An Internal or Leaf manifest has the name SinglePrefixName,  possibly with cache control directives. Its GroupData elements must have an NsId that references the NsDef.</li>
              
              <li>The Data content objects have the name SinglePrefixName, possibly with cache control directives.</li>
              
              <li>To form an Interest for a direct or indirect pointer, use SinglePrefixName as the Name and append the pointer's HashValue into an ImplicitDigest name component. Set the ForwardingHints from the FLIC locators.</li>
            </ul>
          </section>
          
          <section anchor="NDNSegmentedPrefix"><name>NDN Segmented Prefix</name>
            <t>In Segmented Prefix, the Data name is a common prefix plus a segment number, so each manifest or application data object has a unique full name before the implicit digest. This means the consumer must maintain a counter for each SegmentedPrefix namespace.</t>
            	<aside><t><strong>Optional</strong>: Use AnnotatedPointers to indicate the segment number of each hash pointer to avoid needing to infer the segment numbers.</t></aside>
            
            <t>It proceeds as follows:</t>
            <ul spacing="normal">
              <li>The Root Manifest Data object has a name used to fetch the  manifest.  It is signed by the publisher.  It has a set of Locators used to fetch the remainder of the manifest.  It has a single HashPointer that  points to the Top Manifest.  It may also have cache control directives.</li>
              
              <li>The Root Manifest has an NsDef that specifies SegmentedPrefix and the SegmentedPrefixSchema element specifies the SegmentedPrefixName.</li>
              
              <li>The publisher tracks the segment number of each Data object within a SegmentedPrefix NsId. Data is numbered in  traversal order.  Within each manifest, the name is constructed from the SegmentedPrefixName plus a Segment name component.</li>
              
              <li>The Top Manifest has the name SegmentedPrefixName plus segment number.  It may have cache control directies.  Its GroupData elements must have an NsId that references the NsDef.</li>
              
              <li>An Internal or Leaf manifest has the name SegmentedPrefixName plus segment number, possibly with cache control directives. Its GroupData elements must have an NsId that references the NsDef.</li>
              
              <li>The Data content objects have the name SegmentedPrefixName plus chunk number, possibly with cache control directives.</li>
              
              <li>To form an Interest for a direct or indirect pointer, use  SegmentedPrefixName plus segment number as the Name and put the pointer HashValue into the ImplicitDigest name component.  A  consumer must track the segment number in traversal order for each SegmentedPrefixSchema NsId.</li>
            </ul>
          </section>
          
          <section anchor="NDNHybrid"><name>NDN Hybrid Schema</name>
            <t>A manifest may use multiple schemas.  For example, the application payload in data content objects might use SegmentedPrefix while the manifest content objects might use HashNaming.</t>
            
            <t>The Root Manifest should specify an NsDef with a first NsId (say 1) as the HashNaming schema and a second                             NsDef with a second NsId (say 2) as the SegmentedPrefix schema along with the SegmentedPrefixName.</t>
            
            <t>Each manifest (Top, Internal, Leaf) uses two or more HashGroups, where eash HashGroup has only Direct (with the second NsId) or Indirect (with the first NsId).  The number of hash groups will depend on how the publisher wishes to interleave direct and indirect pointers.</t>
            
            <t>Manifests and data objects derive their names according to the application's naming schema.</t>
            
          </section>
        </section>
      </section>
      
      <section><name>Example Structures</name>
        <section><name>Leaf-only data</name>
          <figure anchor="leafonly"><name>Leaf-only manifest tree</name>
            <artwork align="center" name="" type="" alt=""><![CDATA[
           Root
            |
     ______ M0 _____
    /       |       \
   M1       M2       M3
 / | \    / | \    / | \
D0 D1 D2 D3 D4 D5 D6 D7 D8
                        ]]></artwork>
          </figure>
        </section>
        
        <section><name>Linear</name>
          <t>Of special interest are "skewed trees" where a pointer to a manifest may only appear as last pointer of (sub-) manifests. Such a tree becomes a sequential list of manifests with a maximum of datapointers per manifest packet. Beside the tree shape we also show this data structure in form of packet content where D stands for a data pointer and M is the hash of a manifest packet.</t>
          <artwork align="left" name="" type="" alt=""><![CDATA[
Root -> M0 ----> M1 ----> ...
|->DDDD  |->DDDD
                        ]]></artwork>
        </section>
      </section>
    </section>
    
    <section anchor="experiments" numbered="true" toc="default">
      <name>Experimenting with FLIC</name>
      <t>FLIC is expected to enable a number of salient experiments in the use of ICN protools by applications. These experiments will help not only to inform the desirable structure of ICN applications but reflect back to the features included in FLIC to evaluate their usefulness to those applications. While many interesting design aspects of FLIC remain to be discovered through experience, a number of important questions to be answered through experimentation include:</t>
      <ul>
        <li>use for just files or other collections like directories</li>
        <li>use for particular applications, like streaming media manifests</li>
        <li>utility of pointer annotations to optimize retrieval</li>
        <li>utility of the encryption options for use by repositories and forwarders</li>
        <li>need for application metadata in manifests</li>
      </ul>
    </section>
    
    <section anchor="Usage"><name>Usage Examples</name>
      <section anchor="Locating">
      <name>Locating FLIC leaf and manifest nodes</name>
        <t>The names of manifest and data objects are often missing or not unique, unless using specific naming conventions. In this example, we show how using manifest locators is used to generate Interests.  Take for example the figure below where the root manifest is named             by hash h0.  It has nameless children with hashes with hashes h1 ... hN.             </t>
        
        <figure anchor="Locating1"><name>Data Organization</name>
          <artwork align="left" name="" type="" alt=""><![CDATA[
Objects:
manifest(name=/a/b/c, ptr=h1, ptr=hN)  - has hash h0
nameless(data1)                        - has hash h1
...
nameless(dataN)                        - has hash hN

Query for the manifest:
interest(name=/a/b/c, implicitDigest=h0)
                ]]></artwork>
        </figure>
        
        <t>After obtaining the manifest, the client fetches the contents. In this first instance, the manifest does not provide any Locators data structure, so the client must continue using the name it used for the manifest.</t>
        
        <figure anchor="Locating2"><name>Data Interests</name>
          <artwork align="left" name="" type="" alt=""><![CDATA[
interest(name=/a/b/c, implicitDigest=h1)
...
interest(name=/a/b/c, implicitDigest=hN)
                ]]></artwork>
        </figure>
        
        <t>Using the locator metadata entry, this behavior can be changed:</t>
        <figure anchor="Locating3"><name>Using Locators</name>
          <artwork align="left" name="" type="" alt=""><![CDATA[
Objects:
manifest(name=/a/b/c,
hashgroup(loc=/x/y/z, ptr=h1)
hashgroup(ptr=h2)             - has hash h0
nameless(data1)               - has hash h1
nameless(data2)               - has hash h2

Queries:
interest(name=/a/b/c, implicitDigest=h0)
interest(name=/x/y/z, implicitDigest=h1)
interest(name=/a/b/c, implicitDigest=h2)
                ]]></artwork>
        </figure>
      </section>
      
      <section anchor="Seeking"><name>Seeking</name>
        <t>Fast seeking (without having to sequentially fetch all content) works                     by skipping over entries for which we know their size.  The following                     expression shows how to compute the byte offset of the data pointed                     at by pointer <tt>P_i</tt>, call it <tt>offset_i</tt>.  In this formula, let <tt>P_i.size</tt> represent the Size value of the <em>i</em>-th pointer.</t>
        
        <sourcecode type="pseudocode">
offset_i = \sum_{k=1}^{i-1} &gt;  P_k.size.
	</sourcecode>
        
        <t>With this offset, seeking is done as follows:</t>
        <figure anchor="Seeking1"><name>Seeking Algorithm</name>
          <sourcecode type='pseudocode'>
Input: seek_pos P, a FLIC manifest with a hash group having N entries
Output: pointer index i and byte offset o, or out-of-range error
Algorithm:
    offset = 0
    for i in 1..N do
        if (P &gt; offset + P_i.size)
            return (i, P - offset)
    offset += P_i.size
    return out-of-range
          </sourcecode>
        </figure>
        
        <t>Seeking in a BlockHashGroup is different since offsets can be quickly                 computed.  This is because the size of each pointer P_i except the                 last is equal to the SizePerPtr value.  For a BlockHashGroup with N                 pointers, OverallByteCount D, and SizePerPointer L, the size of P_N                 is equal to the following:</t>
        <artwork align="left" name="" type="" alt="">
        	<![CDATA[D - ((N - 1) * L)]]>
        </artwork>

        <t>In a BlockHashGroup with k pointers, the size of P_k is equal to:</t>
        <artwork align="left" name="" type="" alt="">
        	<![CDATA[D - L * (k - 1)]]>
        </artwork>

        <t>Using these, the seeking algorithm can be thus simplified to the following:</t>
        <figure anchor="Seeking2"><name>Seeking Algorithm</name>
          <sourcecode type='pseudocode'>
Input: seek_pos P, a FLIC manifest with a hash group having
OverallByteCount S and SizePerPointer L.
Output: pointer index i and byte offset o, or out-of-range error
Algo:
    if (P > S)
        return out-of-range
    i = floor(P / L)
    if (i > N)
        return out-of-range # bad FLIC encoding
    o = P mod L
    return (i, o)
           </sourcecode>
        </figure>
        
        <aside><t><strong>Note</strong>: In both cases, if the pointer at position i is a manifest pointer, this algorithm has to be called once more, seeking to seek_pos o inside that manifest.</t></aside>
        
      </section>
      <section anchor="dedup"><name>Block-level de-duplication</name>
        <t>Consider a huge file, e.g. an ISO image of a DVD or program in binary be patched. In this case, all existing encoded ICN chunks can remain in the repository while only the chunks for the patch itself is added to a                 new manifest data structure, as is shown in the diagram below.  For                 example, the <xref target="venti">venti archival file system of Plan9</xref> uses this technique.</t>
        
        <figure anchor="dedup1"><name>De-duplication</name>
          <artwork align="center" name="" type="" alt=""><![CDATA[
old_mfst -  - > h1 --> oldData1  <-- h1 < -  -  new_mfst
         \  - > h2 --> oldData2  <-- h2 < -  - /
          \            replace3  <-- h5 < -  -/
           \- > h3 --> oldData3              /
            \ > h4 --> oldData4  <-- h4 < - /
                    ]]></artwork>
        </figure>
      </section>
      
      <section anchor="GrowingCollections"><name>Growing ICN collections</name>
        <t>A log file, for example, grows over time.  Instead of having to re-                     FLIC the grown file it suffices to construct a new manifest with a                     manifest pointer to the old root manifest plus the sequence of data                     hash pointers for the new data (or additional sub-manifests if                     necessary).</t>
        <aside><t><strong>Note</strong> that this tree will not be skewed (anymore).</t></aside>
        
        <figure anchor="Growing"><name>Growing A Collection</name>
          <artwork align="center" name="" type="" alt=""><![CDATA[
old data < -  -  -  mfst_old <-- h_old -  - mfst_new
                                            /
new data1 <-- h_1 -  -  -  -  -  -  -  -  -/
new data2                                 /
...                                      /
new dataN <-- h_N -  -  -  -  -  -  -  -/
                    ]]></artwork>
        </figure>
      </section>
      
      <section anchor="Republishing">
      <name>Re-publishing a FLIC under a new name</name>
        <t>There are several use cases for republishing a collection under a new namespace, or having one collection exist under several namespaces:</t>
        <ul spacing="normal">
          <li>It can happen that a publisher's namespace is part of a service provider's prefix.  When switching provider, the publisher may want to republish the old data under a new name.</li>
          
          <li>A publishes wishes to distribute its content to several repositories and would like a result to be delivered from the repository for consumers who have good connectivity to that repository.  For example, the publisher /alpha wishes to place content at /beta and /gamma, but routing only to /alpha would not send a request to either /beta or /gamma.  The operators of of /beta and /gamma could create a named and signed version of the root manifest with appropriate keys (or delegate that to /alpha) so the results are always delivered by the corresponding repository without having to change the bulk of the manifest tree.</li>
        </ul>
        <t>This can easily be achieved with a single nameless root manifest for the large FLIC plus arbitrarily many per-name manifests (which are signed by whomever wants to publish this data):</t>
        
        <figure anchor="Relocating"><name>Relocating A Collection</name>
          <artwork align="left" name="" type="" alt=""><![CDATA[

data < - nameless_mfst() <-- h  < - mfst(/com/example/east/the/flic)
                                < - mfst(/com/example/west/old/the/flic)
                                < - mfst(/internet/archive/flic234)
                    ]]></artwork>
        </figure>
        <aside><t>Note that the hash computation (of h) only requires reading the nameless root manifest, not the entire FLIC.</t></aside>
        
        <t>This example points out the problem of HashGroups having their own locator metadata elements: A retriever would be urged to follow these hints which are "hardcoded" deep inside the FLIC but might have become  outdated.  We therefore recommend to name FLIC manifests only at the highest level (where these names have no locator function).  Child nodes in a FLIC manifest should not be named as these names serve no purpose except retrieving a sub-tree's manifest by name, if would be required.                  </t>
      </section>
    </section>

    <section anchor="IANA"><name>IANA Considerations</name>
      <t>IANA is requested to perform the actions in the following sub-sections.</t>
      <aside><t>IANA should also note that FLIC uses the definitions of AEAD_AES_128_GCM, AEAD_AES_128_CCM, AEAD_AES_256_GCM, AEAD_AES_256_CCM from <xref target="RFC5116"/>.</t></aside>
      
      <section><name>FLIC Payload Type</name>
      	<t>Register FLIC as a Payload Type in the <em>CCNx Payload Types</em> Registry referring to the description in <xref target='CCNxEncoding'/> as follows:</t>
      	<table><name>FLIC CCNx Payload Type</name>
      	  <thead>
      		<tr>
      		  <th align='center'>Type</th>
      		  <th align='center'>Name</th>
      		  <th align='center'>Reference</th>
      		</tr>
      	   </thead>
      	  <tbody>
      	  	<tr>
      	  	  <td align='center'>TBA</td>
      	  	  <td align='center'>T_PAYLOADTYPE_FLIC</td>
      	  	  <td align='center'><xref target='CCNxEncoding'/> and Section 3.6.2.2.1 of <xref target='RFC8609'/> </td>
      	  	</tr>
      	  </tbody>
      	</table>
      </section>
      
      <section><name>FLIC Manifest Metadata and Annotation TLVs</name>
      	<t>Create the following registry to be titled <em>FLIC Manifest Metadata and Annotation TLVs</em> Manifest Metadata is described in <xref target='metadata'/>; Pointer Annotations are described in <xref target='annotations'/>. The registration procedure is <strong>Specification Required</strong>. The Type value is 2 octets.  The range is 0x0000-0xFFFF. Allocate a value for the single <em>SizeAnnotation</em> TLV.</t>
      	<table><name>FLIC Manifest Metadata and Annotation TLVs</name>			
      	  <thead>
      		<tr>
      		  <th align='center'>Type</th>
      		  <th align='center'>Name</th>
      		  <th align='center'>Reference</th>
      		</tr>
      	   </thead>
      	  <tbody>
      	  	<tr>
      	  	  <td align='center'>TBA</td>
      	  	  <td align='center'>T_SIZE_ANNOTATION</td>
      	  	  <td align='center'>Size (<xref target='annotations'/>)</td>
      	  	</tr>
      	  </tbody>
      	</table>
      </section>
    </section>
    
    <section anchor="Security"><name>Security Considerations</name>
      <t>TODO Need a discussion on:</t>
      <ul spacing="normal">
        <li>signing and hash chaining security. (<strong>Note: Did I cover this adequately below?</strong>)</li>
        <li>republishing under a new namespace. (<strong>Note: need help here - is this to reinforce that you can re-publish application data by creating a new root Manifest and signing that, requiring only one signature to change?</strong>)</li>
        <li>encryption mechanisms. (<strong>Note: did I cover this adequately below?</strong>)</li>
        <li>encryption key distribution mechanisms.(<strong>Note: not sure what needs to be said here</strong>)</li>
        <li>discussion of privacy, leaking of linkability information - <strong>could really use some help here</strong>.</li> 
      </ul>
      <t><strong>Anything else?????</strong>.</t>
      
      <section><name>Integrity and Origin Authentication of FLIC Manifests</name>
      	<t>A FLIC Manifest is used to describe how to form Interests to access large CCNx or NDN application data. The Manifest is itself either an individual content object, or a tree of content objects linked together via the corresponding content hashes. The NDN and CCnx protocol architectures directly provide both individual object integrity (using cryptographically strong hashes) and data origin authentication (using signatures). The protocol specifications, <xref target='NDN'/> and <xref target='RFC8609'>CCNx</xref> respectively, provide the protocol machinery and keying to support strong integrity and authentication. Therefore, FLIC utilizes the existing protocol specifications for these functions, rather than providing its own. There are a few subtle differences in the handling of signatures and keys in NDN and CCNx worth recapitulating here:</t>
      	<ul>
      		<li>NDN in general adds a signature to every individual data packet rather than aggregating signatures via some object-level scheme. When employing FLIC Manifests to multi-packet NDN objects, it is expected that the individual packet signatures would be elided and the signture on the Manifest used instead.</li>
      		<li>In contrast, CCNx is biased to have primitive objects or pieces thereof be "nameless" in the sense they are identified only by their hashes rather than each having a name directly bound to the content through an individual signature. Therefore, CCNx depends heavily on FLIC (or an alternative method) to provide the name and the signed binding of the name to the content described in the Manifest</li>
      	</ul>
      	<t>A FLIC Manifest therefore gets integrity of its individual pieces through the existing secure hashing procedures of the underlying protocols. Origin authentication of the entire Manifest is achieved through hash chaining and applying a signature <strong>only</strong> to the root Manifest of a manifest tree. It is important to note that the Name of the Manifest, which is what the signature is bound to, need not bear any particular relationship to the names of the application objects pointed to in the Manifest via Name Constructors. This has a number of important benefits described in <xref target='nameconstructors'/>.</t>
      </section>
      
      <section><name>Confidentiality of Manifest Data</name>
      	<t>ICN protocol architectures like CCNx and NDN, while providing integrity and origin authentication as described above, leaves confidentiality issues entirely in the domain of the ICN application. Therefore, since FLIC is an application-level construct in both NDN and CCNx, it is incumbent on this specification for FLIC to provide the desired confidentiality properties using encryption. One could leave the specification of Manifest encryption entirely in the hands of the individual application utilizing FLIC, but this would be undesirable for a number of reasons:</t>
      	<ul>
      		<li>The sensitivity of the information in a Manifest may be different from the sensitivity of the application data it describes. In some cases, it may not be necessary to encrypt manifests, or to encrypt them with a different keying scheme from that used for the application data</li>
      		<li>One of the major capabilities enabled by FLIC is to allow repositories or forwarding caches to operate on Manifests (see in particular <xref target='metadata'/>). In order to allow such intermediaries to interpret manifests without revealing the underlying application data, separate encryption and keying is necessary</li>
      		<li>A strong design goal of FLIC is <em>universality</em> such that it can be used transparently by many different ICN applications. This argues that FLIC should have a set of common encryption and keying capabilities that can be delegated to library code and not have to be re-worked by each individual application (see <xref target='libraries'/>)</li>
      	</ul>
      	
      	<t>Therefore, this specification directly specifies two encryption encapsulations and associated links to key management, as described in <xref target='encryption'/>. As more experience is gained with various use cases, additional encryption capabilities may be needed and hence we expect the encryption aspects of this specification to evolve over time.</t>
      </section>
      
      <section><name>Privacy of names and linkability of access patterns</name>
      	<t>What to say here, if anything?</t>
      </section>
      	  
    </section>
  </middle>
  
  <back>
    <references><name>References</name>
      <references><name>Normative References</name>
        <xi:include href="https://xml2rfc.tools.ietf.org/public/rfc/bibxml/reference.RFC.2119.xml"/>
		<xi:include href="https://xml2rfc.tools.ietf.org/public/rfc/bibxml/reference.RFC.3310.xml"/>
<!--    <xi:include href="https://xml2rfc.tools.ietf.org/public/rfc/bibxml/reference.RFC.3394.xml"/> -->
		<xi:include href="https://xml2rfc.tools.ietf.org/public/rfc/bibxml/reference.RFC.5116.xml"/>
		<xi:include href="https://xml2rfc.tools.ietf.org/public/rfc/bibxml/reference.RFC.5288.xml"/>
		<xi:include href="https://xml2rfc.tools.ietf.org/public/rfc/bibxml/reference.RFC.5990.xml"/>	
		<xi:include href="https://xml2rfc.tools.ietf.org/public/rfc/bibxml/reference.RFC.6655.xml"/>
		<xi:include href="https://xml2rfc.tools.ietf.org/public/rfc/bibxml/reference.RFC.8017.xml"/>
		<xi:include href="https://xml2rfc.tools.ietf.org/public/rfc/bibxml/reference.RFC.8569.xml"/>
		<xi:include href="https://xml2rfc.tools.ietf.org/public/rfc/bibxml/reference.RFC.8609.xml"/>
      </references>

      <references><name>Informative References</name>
		<xi:include href="https://xml2rfc.tools.ietf.org/public/rfc/bibxml/reference.RFC.7927.xml"/>
        <xi:include href="https://xml2rfc.tools.ietf.org/public/rfc/bibxml3/reference.I-D.draft-wood-icnrg-ccnxkeyexchange-02.xml"/>
        
        <reference anchor="AESGCM" target="https://doi.org/10.6028/NIST.SP.800-38D">
          <front>
            <title>Recommendation for Block Cipher Modes of Operation: Galois/Counter Mode (GCM) and GMAC</title>
            <author surname="Dworkin" initials="M."/>
            <date year="2007"/>
          </front>
          <seriesInfo name="National Institute of Standards and Technology" value="SP 800-38D"/>
        </reference>

        <reference anchor="SHS" target="https://csrc.nist.gov/publications/fips/fips180-4/
              fips180-4_final.pdf">
          <front>
            <title>Secure Hash Standard, United States of American,
              National Institute of Science and Technology, Federal
              Information Processing Standard (FIPS) 180-4</title>
            <author fullname="National Institute of Science and Technology"/>
        	<date year="2012"/>
          </front>
          <seriesInfo name="National Institute of Standards and Technology" value="SP 180-4"/>
        </reference>
        
        <reference anchor="RSAKEM" target="https://doi.org/10.6028/NIST.SP.800-56Br2">
          <front>
            <title>Recommendation for Pair-Wise Key-Establishment Using Integer Factorization Cryptography</title>
            <author surname="Barker" initials="E."/>
            <author surname="Chen" initials="L."/>
            <author surname="Roginsky" initials="A."/>
            <author surname="Vassilev" initials="A."/>
            <author surname="Davis" initials="R."/>
            <author surname="Simon" initials="S."/>
            <date year="2019"/>
          </front>
          <seriesInfo name="National Institute of Standards and Technology" value="SP 800-56B Rev. 2"/>
        </reference>

        <reference anchor="NDN" target="https://named-data.net/project/execsummary/">
          <front>
            <title>Named Data Networking</title>
            <author surname="NDN team"/>
            <date>various</date>
          </front>
        </reference>

        <reference anchor="NDNTLV" target="http://named-data.net/doc/ndn-tlv/">
          <front>
            <title>NDN Packet Format Specification.</title>
            <author surname="NDN Project Team"/>
            <date year="2016"/>
          </front>
        </reference>

        <reference anchor="repository" target="https://redmine.named-data.net/projects/repo-ng/wiki/Repo_Protocol_Specification">
          <front>
            <title>Repo Protocol Specification</title>
            <author surname="NDN Project Team"/>
            <date>Various</date>
          </front>
        </reference>
        
        <reference anchor="venti" target="http://doc.cat-v.org/plan_9/4th_edition/papers/venti/">
        	<front>
        		<title>Venti: a new approach to archival storage</title>
        		<author surname="Sean Quinlan and Sean Dorward"/>
        		<date year="2002"/>
        	</front>
        	<seriesInfo name="Bell Labs Document Archive" value="/sts/doc"/>
        </reference>

        <reference anchor="FLICImplementation" target="https://github.com/mmosko/ccnpy">
          <front>
            <title>FLIC Implementation in Python</title>
            <author surname="Mosko" initials="M."/>
            <date>various</date>
          </front>
        </reference>
      
      </references>
    </references>
  
    <section anchor="building-trees"><name>Building Trees</name>
      <t>This appendix describes one method to build trees.  It constructs a pre-order tree in a single pass of the application data, going from the tail to the beginning. This allows us to work up the right side of the tree in a single pass, then work down each left branch until we exhaust the data.  Using the reverse-order traversal, we create the right-most-child manifest, then its parent, then the indirect pointers of that parent, then the parent's direct pointers,then the parent of the parent (repeating). This process uses recursion, as it is the clearest way to show the code. A more optimized approach could do it in a true single pass.</t>
      
      <t>Because we're building from the bottom up, we use the term 'level' to be the distance from the right-most child up. Level 0 is the bottom-most level of the tree, such as where node 7 is:</t>
      <artwork align="center" name="" type="" alt=""><![CDATA[
      1
  2      3
4  5    6  7
preorder: 1 2 4 5 3 6 7
reverse:  7 6 3 5 4 2 1
                ]]></artwork>
                
      <t>The Python-like pseudocode build_tree(data, n, k, m) algorithm creates a tree of n data objects. The data[] array is an array of Content Objects that hold application payload; the application data has already been packetized into n Content Object packets.An interior manifest node has k direct pointers and m indirect pointers.</t>
      <sourcecode type='python'>
build_tree(data[0..n-1], n, k, m)
    # data is an array of Content Objects (Data in NDN) with application payload.
    # n is the number of data items
    # k is the number of direct pointers per internal node
    # m is the number of indirect pointers per internal node
            
    segment = namedtuple('Segment', 'head tail')(0, n)
    level = 0
    
    # This bootstraps the process by creating the right most child manifest
    # A leaf manifest has no indirect pointers, so k+m are direct pointers
    root = leaf_manifest(data, segment, k + m)
    
    # Keep building subtrees until we're out of direct pointers
    while not segment.empty():
        level += 1
        root = bottom_up_preorder(data, segment, level, k, m, root)
    
    return root
      </sourcecode>
      
      <sourcecode type='python'>
bottom_up_preorder(data, segment, level, k, m, right_most_child=None)
    manifest = None
    if level == 0:
        assert right_most_child is None
        # build a leaf manifest with only direct pointers
        manifest = leaf_manifest(data, segment, k + m)
    else:
        # If the number of remaining direct pointers will fit in a leaf node, make one of those.
        # Otherwise, we need to be an interior node
        if right_most_child is None and segment.length() &lt;= k + m:
            manifest = leaf_manifest(data, segment, k+m)
        else:
            manifest = interior_manifest(data, segment, level, k, m, right_most_child)
    return manifest
      </sourcecode>
      
      <sourcecode type='python'>
leaf_manifest(data, segment, count)
    # At most count items, but never go before the head
    start = max(segment.head(), segment.tail() - count)
    manifest = Manifest(data[start:segment.tail])
    segment.tail -= segment.tail() - start
    return manifest
      </sourcecode>
      
      <sourcecode type='python'>
interior_manifest(data, segment, level, k, m, right_most_child)
    children = []
    if right_most_child is not None:
        children.append(right_most_child)
    
    interior_indirect(data, segment, level, k, m, children)
    interior_direct(data, segment, level, k, m, children)
    
    manifest = Manifest(children)
    return manifest, tail
      </sourcecode>
      
      <sourcecode type='python'>
interior_indirect(data, segment, level, k, m, children)
    # Reserve space at the head of the segment for this node's direct pointers before
    # descending to children.  We want the top of the tree packed.
    reserve_count = min(k, segment.tail - segment.head)
    segment.head += reserve_count
    
    while len(children) &lt; m and not segment.head == segment.tail:
        child = bottom_up_preorder(data, segment, level - 1, k, m)
        # prepend
        children.insert(0, child)
    
    # Pull back our reservation and put those pointers in our direct children
    segment.head -= reserve_count
      </sourcecode>
      
      <sourcecode type='python'>
interior_direct(data, segment, level, k, m, children)
    while len(children) &lt; k+m and not segment.head == segment.tail:
        pointer = data[segment.tail() - 1]
        children.insert(0, pointer)
        segment.tail -= 1
      </sourcecode>
    </section>
  </back>
</rfc>
