<?xml version="1.0" encoding="UTF-8"?>
<?xml-stylesheet type="text/xsl" href="rfc2629.xslt" ?>
<!-- 
  Internet-Draft: draft-kamimura-scitt-refusal-events-00
  Title: Verifiable AI Refusal Events using SCITT Transparency Logs
  Author: TOKACHI KAMIMURA
  Organization: VeritasChain Standards Organization (VSO)
  Target WG: SCITT (Supply Chain Integrity, Transparency, and Trust)
  
  Revision Notes (-01):
  - Replaced "Negative Proof" with "Verifiable Refusal Record"
  - Clarified completeness invariant scope (event-semantics, verifier-side)
  - Added explicit statement: TS does not enforce completeness
  - Removed salting as normative requirement
  - Added Conformance subsection
  - Added Motivation subsection
  - Added Future Work section (non-normative)
  - Clarified ERROR semantics and failure modes
  - Strengthened policy-agnostic stance
-->
<rfc xmlns:xi="http://www.w3.org/2001/XInclude"
     docName="draft-kamimura-scitt-refusal-events-00"
     category="info"
     ipr="trust200902"
     submissionType="IETF"
     version="3">

  <front>
    <title abbrev="SCITT-Refusal-Events">
      Verifiable AI Refusal Events using SCITT Transparency Logs
    </title>

    <seriesInfo name="Internet-Draft" 
                value="draft-kamimura-scitt-refusal-events-00"/>

    <author fullname="TOKACHI KAMIMURA" initials="T." surname="Kamimura">
      <organization>VeritasChain Standards Organization</organization>
      <address>
        <postal>
          <country>Japan</country>
        </postal>
        <email>standards@veritaschain.org</email>
        <uri>https://veritaschain.org</uri>
      </address>
    </author>

    <date year="2026" month="January" day="10"/>

    <area>Security</area>
    <workgroup>Supply Chain Integrity, Transparency, and Trust</workgroup>

    <keyword>SCITT</keyword>
    <keyword>AI safety</keyword>
    <keyword>refusal events</keyword>
    <keyword>transparency service</keyword>
    <keyword>audit trail</keyword>
    <keyword>verifiable refusal record</keyword>

    <abstract>
      <t>This document describes a SCITT-based mechanism for creating 
      verifiable records of AI content refusal events. It defines how 
      refusal decisions can be encoded as SCITT Signed Statements, 
      registered with Transparency Services, and verified by third 
      parties using Receipts.</t>
      
      <t>This specification provides auditability of refusal decisions 
      that are logged, not cryptographic proof that no unlogged 
      generation occurred. It does not define content moderation 
      policies, classification criteria, or what AI systems should 
      refuse; it addresses only the audit trail mechanism.</t>
    </abstract>

    <note removeInRFC="true">
      <name>About This Document</name>
      <t>The latest version of this document, along with implementation
      resources and examples, can be found at <xref target="CAP-SRP"/>.</t>
      <t>Discussion of this document takes place on the SCITT Working
      Group mailing list (scitt@ietf.org).</t>
    </note>
  </front>

  <middle>
    <!-- ===== Section 1: Introduction ===== -->
    <section anchor="introduction">
      <name>Introduction</name>
      
      <t>This document is NOT a content moderation policy. It does not 
      prescribe what AI systems should or should not refuse to generate, 
      nor does it define criteria for classifying requests as harmful. 
      The mechanism described herein is agnostic to the reasons for 
      refusal decisions; it provides only an interoperable format for 
      recording that such decisions occurred. Policy decisions regarding 
      acceptable content remain the domain of AI providers, regulators, 
      and applicable law.</t>

      <section anchor="motivation">
        <name>Motivation</name>
        <t>AI systems capable of generating content increasingly implement 
        safety mechanisms to refuse requests deemed harmful, illegal, or 
        policy-violating. However, these refusal decisions typically leave 
        no verifiable audit trail. When a system refuses to generate content, 
        the event vanishes—there is no receipt, no log entry accessible to 
        external parties, and no mechanism for third-party verification.</t>

        <t>This creates several problems:</t>
        <ul>
          <li>Regulators cannot independently verify that AI providers 
          enforce stated policies</li>
          <li>Providers cannot prove to external auditors that specific 
          requests were refused</li>
          <li>Third parties investigating incidents have no way to 
          establish refusal without trusting provider claims</li>
          <li>The completeness of audit logs cannot be verified externally</li>
        </ul>

        <t>The SCITT architecture <xref target="I-D.ietf-scitt-architecture"/> 
        provides primitives—Signed Statements, Transparency Services, and 
        Receipts—that can address this gap. This document describes how 
        these primitives can be applied to AI refusal events.</t>
      </section>

      <section anchor="requirements-language">
        <name>Requirements Language</name>
        <t>The key words "MUST", "MUST NOT", "REQUIRED", "SHALL", 
        "SHALL NOT", "SHOULD", "SHOULD NOT", "RECOMMENDED", "NOT RECOMMENDED",
        "MAY", and "OPTIONAL" in this document are to be interpreted as
        described in BCP 14 <xref target="RFC2119"/> <xref target="RFC8174"/>
        when, and only when, they appear in all capitals, as shown here.</t>
      </section>

      <section anchor="scope">
        <name>Scope</name>
        <t>This document describes:</t>
        <ul>
          <li>Terminology for refusal events mapped to SCITT primitives</li>
          <li>A data model for ATTEMPT and DENY events as Signed Statement 
          payloads</li>
          <li>A completeness invariant for audit trail integrity checking</li>
          <li>An integration approach with SCITT Transparency Services</li>
        </ul>
        <t>This document does NOT define:</t>
        <ul>
          <li>Content moderation policies (what should be refused)</li>
          <li>Classification algorithms or risk scoring methods</li>
          <li>Thresholds or criteria for refusal decisions</li>
          <li>General SCITT architecture (see 
          <xref target="I-D.ietf-scitt-architecture"/>)</li>
          <li>Legal or regulatory requirements for specific jurisdictions</li>
        </ul>
      </section>

      <section anchor="limitations">
        <name>Limitations</name>
        <t>This specification provides auditability of refusal decisions 
        that are logged, not cryptographic proof that no unlogged generation 
        occurred. An AI system that bypasses logging entirely cannot be 
        detected by this mechanism alone. Detection of such bypass requires 
        external enforcement mechanisms (e.g., trusted execution environments, 
        attestation) which are outside the scope of this document.</t>
        
        <t>This profile does not require Transparency Services to enforce 
        completeness invariants; such checks are performed by verifiers 
        using application-level logic.</t>
      </section>
    </section>

    <!-- ===== Section 2: Terminology ===== -->
    <section anchor="terminology">
      <name>Terminology</name>
      
      <t>This document uses terminology from 
      <xref target="I-D.ietf-scitt-architecture"/>. The following terms 
      are specific to this profile:</t>
      
      <dl>
        <dt>Generation Request</dt>
        <dd>A request submitted to an AI system to produce content. 
        May include text prompts, reference images, or other inputs.</dd>
        
        <dt>Refusal Event</dt>
        <dd>A decision by an AI system to decline a generation request 
        based on safety, policy, or other criteria. Results in no 
        content being produced.</dd>
        
        <dt>ATTEMPT</dt>
        <dd>A Signed Statement recording that a generation request was 
        received. Does not indicate the outcome.</dd>
        
        <dt>DENY</dt>
        <dd>A Signed Statement recording that a generation request was 
        refused. References the corresponding ATTEMPT via AttemptID.</dd>
        
        <dt>GENERATE</dt>
        <dd>A Signed Statement recording that content was successfully 
        generated in response to a request. References the corresponding 
        ATTEMPT via AttemptID.</dd>
        
        <dt>ERROR</dt>
        <dd>A Signed Statement indicating that no content was generated 
        due to system failure (e.g., timeout, resource exhaustion, model 
        error) rather than a policy decision. ERROR does not constitute 
        a refusal and does not indicate policy enforcement. References 
        the corresponding ATTEMPT via AttemptID.</dd>
        
        <dt>Outcome</dt>
        <dd>A Signed Statement recording the result of a generation 
        request: DENY (refusal), GENERATE (successful generation), or 
        ERROR (system failure).</dd>
        
        <dt>Verifiable Refusal Record</dt>
        <dd>An auditable record consisting of an ATTEMPT Signed Statement, 
        a DENY Signed Statement, and Receipts proving their inclusion in 
        a Transparency Service. This provides evidence that a refusal 
        decision was logged, but does not cryptographically prove that 
        no unlogged generation occurred.</dd>
        
        <dt>Completeness Invariant</dt>
        <dd>The property that every logged ATTEMPT has exactly one 
        corresponding Outcome. This invariant is checked by verifiers 
        at the application level; it is not enforced by Transparency 
        Services.</dd>

        <dt>Prompt Hash</dt>
        <dd>A cryptographic hash of the generation request content. 
        Enables verification that a specific request was processed 
        without storing the potentially harmful prompt text.</dd>
      </dl>
      
      <t>This document focuses on refusal events because successful 
      generation is already observable through content existence and 
      downstream provenance mechanisms (e.g., C2PA manifests, watermarks). 
      Refusal events, by contrast, are negative events that leave no 
      external artifact unless explicitly logged. The GENERATE and ERROR 
      outcomes are defined for completeness invariant verification but 
      are not the primary focus of this specification. This document 
      does not attempt to standardize generation provenance; it focuses 
      solely on refusal events as a complementary profile.</t>

      <section anchor="scitt-mapping">
        <name>Mapping to SCITT Primitives</name>
        <t>This profile maps refusal event concepts directly to SCITT 
        primitives, minimizing new terminology:</t>
        
        <table>
          <thead>
            <tr>
              <th>This Document</th>
              <th>SCITT Primitive</th>
            </tr>
          </thead>
          <tbody>
            <tr>
              <td>ATTEMPT</td>
              <td>Signed Statement</td>
            </tr>
            <tr>
              <td>DENY / GENERATE / ERROR</td>
              <td>Signed Statement</td>
            </tr>
            <tr>
              <td>AI System</td>
              <td>Issuer</td>
            </tr>
            <tr>
              <td>Inclusion Proof</td>
              <td>Receipt</td>
            </tr>
          </tbody>
        </table>
        
        <t>Refusal events are registered with a standard SCITT 
        Transparency Service; this document does not define a separate 
        log type.</t>
      </section>
    </section>

    <!-- ===== Section 3: Use Cases ===== -->
    <section anchor="use-cases">
      <name>Use Cases</name>
      
      <section anchor="use-case-audit">
        <name>Regulatory Audit</name>
        <t>A regulatory authority investigating AI system compliance 
        needs to verify that a provider's stated content policies are 
        actually enforced. Without verifiable refusal events, the 
        regulator must trust provider self-reports. With this mechanism, 
        regulators can request Receipts for refusal events within a time 
        range, verify ATTEMPT/Outcome completeness for logged events, 
        and confirm refusal decisions are anchored in an append-only log.</t>
      </section>

      <section anchor="use-case-investigation">
        <name>Incident Investigation</name>
        <t>When investigating whether an AI system refused a specific 
        request, investigators need to establish provenance. A Verifiable 
        Refusal Record (ATTEMPT + DENY + Receipts) demonstrates that a 
        specific request was received, classified as policy-violating, 
        refused, and the refusal was logged.</t>
      </section>

      <section anchor="use-case-accountability">
        <name>Provider Accountability</name>
        <t>AI service providers may need to demonstrate to stakeholders 
        that safety mechanisms function as claimed. Verifiable refusal 
        events enable statistical reporting on logged refusal rates, 
        third-party verification of safety claims, and auditable proof 
        that specific requests were refused.</t>
      </section>

      <section anchor="use-case-legal">
        <name>Legal Proceedings</name>
        <t>In legal proceedings concerning AI-generated content, parties 
        may need evidence that a system declined a request. Verifiable 
        Refusal Records provide such evidence, subject to the limitation 
        that they demonstrate logged refusals, not the absence of 
        unlogged generation.</t>
      </section>
    </section>

    <!-- ===== Section 4: Requirements ===== -->
    <section anchor="requirements">
      <name>Requirements</name>
      
      <t>This section defines requirements for implementations. To 
      maximize interoperability while allowing implementation flexibility, 
      only the core completeness invariant uses MUST; other requirements 
      use SHOULD or MAY.</t>
      
      <section anchor="req-completeness">
        <name>Completeness Invariant</name>
        <t>The completeness invariant is the central requirement of 
        this profile:</t>
        <ul>
          <li>Every logged ATTEMPT Signed Statement MUST have exactly 
          one corresponding Outcome Signed Statement (DENY, GENERATE, 
          or ERROR).</li>
          <li>Outcome Signed Statements MUST reference their 
          corresponding ATTEMPT via the AttemptID field.</li>
          <li>Prompt content MUST NOT be stored in cleartext in 
          Signed Statements; implementations MUST use cryptographic 
          hashes (PromptHash) instead.</li>
        </ul>
        <t>Verifiers SHOULD flag any logged ATTEMPT without a 
        corresponding Outcome as potential evidence of incomplete 
        logging or system failure.</t>
        
        <t>This completeness invariant is defined at the event semantics 
        level and applies only to logged events. It cannot detect ATTEMPT 
        events that were never logged. Cryptographic detection of 
        invariant violations depends on the properties of the underlying 
        Transparency Service and verifier logic; it is discussed further 
        in <xref target="security"/>.</t>
        
        <t>This profile does not require Transparency Services to 
        enforce completeness invariants; such checks are performed by 
        verifiers using application-level logic.</t>
      </section>

      <section anchor="req-integrity">
        <name>Integrity</name>
        <t>To protect against tampering, implementations SHOULD:</t>
        <ul>
          <li>Include a cryptographic hash of event content in each 
          Signed Statement</li>
          <li>Digitally sign all Signed Statements</li>
          <li>Chain events via PrevHash fields to detect deletion or 
          reordering</li>
          <li>Register Signed Statements with a SCITT Transparency 
          Service and obtain Receipts</li>
        </ul>
        <t>PrevHash chaining is RECOMMENDED but not required because 
        append-only guarantees are primarily provided by the 
        Transparency Service. PrevHash provides an additional, 
        issuer-local integrity signal that can detect tampering even 
        before Transparency Service registration.</t>
        <t>SHA-256 for hashing and Ed25519 for signatures are 
        RECOMMENDED. Other algorithms registered with COSE MAY be 
        used.</t>
      </section>

      <section anchor="req-privacy">
        <name>Privacy</name>
        <t>Refusal events may be triggered by harmful or sensitive 
        content. To avoid the audit log becoming a repository of 
        harmful content, implementations SHOULD:</t>
        <ul>
          <li>Replace prompt text with PromptHash</li>
          <li>Replace reference images with cryptographic hashes</li>
          <li>Ensure refusal reasons do not quote or describe prompt 
          content in detail</li>
          <li>Pseudonymize actor identifiers where appropriate</li>
        </ul>
        <t>The hash function SHOULD be collision-resistant to prevent 
        an adversary from claiming a benign prompt hashes to the 
        same value as a harmful one.</t>
        
        <t>Hashing without salting may be vulnerable to dictionary 
        attacks if an adversary has a list of candidate prompts. 
        Mitigations include access controls on event queries, 
        time-limited retention policies, and monitoring for bulk 
        query patterns. Salting may provide additional protection 
        but introduces complexity; if used, implementations must 
        ensure verification remains possible without requiring 
        disclosure of the salt to third-party verifiers.</t>
      </section>

      <section anchor="req-verifiability">
        <name>Third-Party Verifiability</name>
        <t>To enable external verification without access to internal 
        systems, implementations SHOULD:</t>
        <ul>
          <li>Ensure verification requires only the Signed Statement 
          and Receipt</li>
          <li>Publish Issuer public signing keys or certificates</li>
          <li>Make Transparency Service logs queryable by authorized 
          auditors</li>
          <li>Support offline verification given the necessary 
          cryptographic material</li>
        </ul>
      </section>

      <section anchor="req-timeliness">
        <name>Timeliness</name>
        <t>To maintain audit trail integrity, implementations SHOULD:</t>
        <ul>
          <li>Create ATTEMPT Signed Statements promptly upon request 
          receipt (within seconds)</li>
          <li>Create Outcome Signed Statements promptly upon decision 
          (within seconds for automated decisions)</li>
          <li>Register Signed Statements with the Transparency Service 
          within a reasonable window (e.g., 60 seconds)</li>
        </ul>
        <t>Some operational scenarios may require delayed outcomes:</t>
        <ul>
          <li>Human review processes may take minutes, hours, or days</li>
          <li>System crashes may delay outcome logging until recovery</li>
          <li>Network failures may delay Transparency Service registration</li>
        </ul>
        <t>Implementations SHOULD document expected latency bounds in 
        their Registration Policy. Extended delays SHOULD trigger 
        monitoring alerts.</t>
      </section>

      <section anchor="req-conformance">
        <name>Conformance</name>
        <t>An implementation conforms to this specification if it 
        satisfies the following requirements:</t>
        <ul>
          <li>MUST: Every logged ATTEMPT has exactly one Outcome</li>
          <li>MUST: Outcomes reference ATTEMPTs via AttemptID</li>
          <li>MUST: Prompt content is hashed, not stored in cleartext</li>
          <li>MUST: Signed Statements are encoded as COSE_Sign1</li>
        </ul>
        <t>All other requirements (SHOULD, RECOMMENDED, MAY) are 
        guidance for interoperability and security best practices 
        but are not required for conformance.</t>
        <t>Implementations MAY extend the data model with additional 
        fields provided the core conformance requirements are satisfied.</t>
      </section>
    </section>

    <!-- ===== Section 5: Data Model ===== -->
    <section anchor="data-model">
      <name>Data Model</name>
      
      <t>This section defines the data model for ATTEMPT and DENY 
      Signed Statements. These are encoded as JSON payloads. This 
      data model is non-normative; implementations MAY extend or 
      modify these structures provided the conformance requirements 
      in <xref target="req-conformance"/> are satisfied.</t>

      <section anchor="data-attempt">
        <name>ATTEMPT Signed Statement Payload</name>
        <t>An ATTEMPT records that a generation request was received:</t>
        <sourcecode type="json"><![CDATA[
{
  "eventType": "ATTEMPT",
  "eventId": "019467a1-0001-7000-0000-000000000001",
  "timestamp": "2026-01-10T14:23:45.100Z",
  "issuer": "urn:example:ai-service:img-gen-prod",
  "promptHash": "sha256:7f83b1657ff1fc53b92dc18148a1d65d...",
  "inputType": "text+image",
  "referenceInputHashes": [
    "sha256:9f86d081884c7d659a2feaa0c55ad015..."
  ],
  "sessionId": "019467a1-0001-7000-0000-000000000000",
  "actorHash": "sha256:e3b0c44298fc1c149afbf4c8996fb924...",
  "modelId": "img-gen-v4.2.1",
  "policyId": "content-safety-v2",
  "prevHash": "sha256:0000000000000000000000000000000...",
  "eventHash": "sha256:a1b2c3d4e5f6a1b2c3d4e5f6a1b2c3d4..."
}
]]></sourcecode>
        
        <t>Field definitions:</t>
        <dl>
          <dt>eventType</dt>
          <dd>"ATTEMPT"</dd>
          
          <dt>eventId</dt>
          <dd>Unique identifier (UUID v7 <xref target="RFC9562"/> 
          RECOMMENDED for temporal ordering)</dd>
          
          <dt>timestamp</dt>
          <dd>ISO 8601 timestamp of request receipt</dd>
          
          <dt>issuer</dt>
          <dd>URN identifying the AI system</dd>
          
          <dt>promptHash</dt>
          <dd>Hash of the textual prompt (if any)</dd>
          
          <dt>inputType</dt>
          <dd>Type of input: "text", "image", "text+image", "audio", 
          "video"</dd>
          
          <dt>referenceInputHashes</dt>
          <dd>Array of hashes for non-text inputs</dd>
          
          <dt>sessionId</dt>
          <dd>Session identifier for correlation</dd>
          
          <dt>actorHash</dt>
          <dd>Pseudonymized hash of the requesting user/system</dd>
          
          <dt>modelId</dt>
          <dd>Identifier and version of the AI model</dd>
          
          <dt>policyId</dt>
          <dd>Identifier of the content policy applied</dd>
          
          <dt>prevHash</dt>
          <dd>Hash of the previous event (for chain integrity)</dd>
          
          <dt>eventHash</dt>
          <dd>Hash of this event's canonical form</dd>
        </dl>
      </section>

      <section anchor="data-deny">
        <name>DENY Signed Statement Payload</name>
        <t>A DENY records that a request was refused:</t>
        <sourcecode type="json"><![CDATA[
{
  "eventType": "DENY",
  "eventId": "019467a1-0001-7000-0000-000000000002",
  "timestamp": "2026-01-10T14:23:45.150Z",
  "issuer": "urn:example:ai-service:img-gen-prod",
  "attemptId": "019467a1-0001-7000-0000-000000000001",
  "riskCategory": "POLICY_VIOLATION",
  "riskScore": 0.94,
  "refusalReason": "Content policy: prohibited category",
  "modelDecision": "DENY",
  "humanOverride": false,
  "prevHash": "sha256:a1b2c3d4e5f6a1b2c3d4e5f6a1b2c3d4...",
  "eventHash": "sha256:e5f6g7h8i9j0e5f6g7h8i9j0e5f6g7h8..."
}
]]></sourcecode>
        
        <t>Field definitions:</t>
        <dl>
          <dt>eventType</dt>
          <dd>"DENY"</dd>
          
          <dt>eventId</dt>
          <dd>Unique identifier</dd>
          
          <dt>timestamp</dt>
          <dd>ISO 8601 timestamp of refusal decision</dd>
          
          <dt>attemptId</dt>
          <dd>Reference to the corresponding ATTEMPT (required for 
          completeness invariant)</dd>
          
          <dt>riskCategory</dt>
          <dd>Category of policy violation detected. Values are 
          implementation-defined and not standardized by this 
          specification (examples: "POLICY_VIOLATION", "LEGAL_RISK", 
          "SAFETY_CONCERN").</dd>
          
          <dt>riskScore</dt>
          <dd>Confidence score (0.0 to 1.0) if available. Scoring 
          methodology is implementation-defined.</dd>
          
          <dt>refusalReason</dt>
          <dd>Human-readable reason (SHOULD NOT contain prompt content). 
          The taxonomy of reasons is implementation-defined.</dd>
          
          <dt>modelDecision</dt>
          <dd>The action taken: "DENY", "WARN", "ESCALATE"</dd>
          
          <dt>humanOverride</dt>
          <dd>Boolean indicating if a human reviewer was involved</dd>
          
          <dt>prevHash</dt>
          <dd>Hash of the previous event</dd>
          
          <dt>eventHash</dt>
          <dd>Hash of this event's canonical form</dd>
        </dl>
        
        <t>This specification does not standardize content moderation 
        categories, risk taxonomies, or refusal reason formats. These 
        are policy decisions that remain the domain of AI providers 
        and applicable regulations.</t>
      </section>

      <section anchor="data-error">
        <name>ERROR Signed Statement Payload</name>
        <t>An ERROR records that a request failed due to system issues:</t>
        <sourcecode type="json"><![CDATA[
{
  "eventType": "ERROR",
  "eventId": "019467a1-0001-7000-0000-000000000003",
  "timestamp": "2026-01-10T14:23:45.200Z",
  "issuer": "urn:example:ai-service:img-gen-prod",
  "attemptId": "019467a1-0001-7000-0000-000000000001",
  "errorCode": "TIMEOUT",
  "errorMessage": "Model inference timeout after 30s",
  "prevHash": "sha256:e5f6g7h8i9j0e5f6g7h8i9j0e5f6g7h8...",
  "eventHash": "sha256:h8i9j0k1l2m3h8i9j0k1l2m3h8i9j0k1..."
}
]]></sourcecode>
        
        <t>ERROR events indicate system failures, not policy decisions. 
        A high ERROR rate may indicate operational issues or potential 
        abuse (e.g., adversarial inputs designed to crash the system). 
        Implementations SHOULD monitor ERROR rates and investigate 
        anomalies.</t>
      </section>
    </section>

    <!-- ===== Section 6: SCITT Integration ===== -->
    <section anchor="scitt-integration">
      <name>SCITT Integration</name>

      <section anchor="scitt-signed-statements">
        <name>Encoding as Signed Statements</name>
        <t>ATTEMPT, DENY, GENERATE, and ERROR events are encoded as 
        SCITT Signed Statements:</t>
        <ul>
          <li>The event JSON is the Signed Statement payload</li>
          <li>The Issuer is the AI system's signing identity</li>
          <li>The Content Type MAY use an implementation-defined media 
          type (example: "application/vnd.scitt.refusal-event+json")</li>
          <li>The Signed Statement is wrapped in COSE_Sign1 per 
          <xref target="RFC9052"/></li>
        </ul>
        <t>The JSON payload is canonicalized per <xref target="RFC8785"/> 
        and signed as the COSE_Sign1 payload bytes. This ensures 
        deterministic serialization for signature verification.</t>
      </section>

      <section anchor="scitt-registration">
        <name>Registration</name>
        <t>After creating a Signed Statement, the Issuer SHOULD register 
        it with a SCITT Transparency Service:</t>
        <ol>
          <li>Submit the Signed Statement via SCRAPI 
          <xref target="I-D.ietf-scitt-scrapi"/></li>
          <li>Receive a Receipt proving inclusion</li>
          <li>Store the Receipt for future verification requests</li>
        </ol>
        <t>The Transparency Service's Registration Policy MAY verify 
        that required fields are present and timestamps are within 
        acceptable bounds.</t>
        
        <t>Registration may fail due to network issues, service 
        unavailability, or policy rejection. Implementations SHOULD 
        implement retry logic with exponential backoff. Persistent 
        registration failures SHOULD be logged locally and trigger 
        operational alerts.</t>
      </section>

      <section anchor="scitt-registration-policy">
        <name>Registration Policy Guidance (Non-Normative)</name>
        <t>A Transparency Service operating as a refusal event log 
        MAY implement a Registration Policy that validates:</t>
        <ul>
          <li>Signature validity (COSE_Sign1 verification)</li>
          <li>Required fields present (eventType, eventId, timestamp, 
          issuer)</li>
          <li>Timestamp sanity (not in the future, not unreasonably 
          old)</li>
          <li>Issuer authorization (if the TS restricts which issuers 
          may register)</li>
        </ul>
        <t>This profile does not require Transparency Services to 
        enforce completeness invariants. A TS accepting refusal events 
        is not expected to verify that every ATTEMPT has an Outcome; 
        such verification is performed by auditors and verifiers at 
        the application level.</t>
      </section>

      <section anchor="scitt-verification">
        <name>Verification with Receipts</name>
        <t>A complete Verifiable Refusal Record consists of:</t>
        <ol>
          <li>The ATTEMPT Signed Statement and its Receipt</li>
          <li>The corresponding DENY Signed Statement and its Receipt</li>
          <li>Verification that attemptId in DENY matches eventId in 
          ATTEMPT</li>
        </ol>
        <t>Verifiers can confirm that a refusal was logged by 
        validating both Receipts and checking the ATTEMPT/DENY linkage. 
        This demonstrates that the refusal decision was recorded in 
        the Transparency Service, but does not prove that no unlogged 
        generation occurred.</t>
      </section>

      <section anchor="scitt-anchoring">
        <name>External Anchoring (Non-Normative)</name>
        <t>For additional assurance, implementations MAY periodically 
        anchor Merkle tree roots to external systems such as public 
        blockchains, multiple independent Transparency Services, or 
        regulatory authority registries. External anchoring provides 
        defense against a compromised Transparency Service.</t>
      </section>
    </section>

    <!-- ===== Section 7: IANA Considerations ===== -->
    <section anchor="iana">
      <name>IANA Considerations</name>
      
      <t>This document has no IANA actions at this time.</t>
      
      <t>Future revisions may request registration of media types 
      (e.g., "application/vnd.scitt.refusal-event+json") or establish 
      registries for standardized event type values.</t>
    </section>

    <!-- ===== Section 8: Security Considerations ===== -->
    <section anchor="security">
      <name>Security Considerations</name>

      <section anchor="security-threat-model">
        <name>Threat Model</name>
        <t>This specification assumes the following threat model:</t>
        <ul>
          <li>The AI system (Issuer) is partially trusted: it is 
          expected to log events but may have bugs or be compromised</li>
          <li>The Transparency Service is partially trusted: it 
          provides append-only guarantees but may be compromised or 
          present split views</li>
          <li>Verifiers are trusted to perform completeness checks 
          correctly</li>
          <li>External parties (regulators, auditors) have access to 
          Receipts and can query the Transparency Service</li>
        </ul>
        <t>This specification does NOT protect against:</t>
        <ul>
          <li>An AI system that bypasses logging entirely (no ATTEMPT 
          logged)</li>
          <li>Collusion between the Issuer and Transparency Service</li>
          <li>Compromise of all verifiers</li>
        </ul>
      </section>

      <section anchor="security-omission">
        <name>Omission Attacks</name>
        <t>An adversary controlling the AI system might attempt to omit 
        refusal events to hide policy violations or, conversely, omit 
        GENERATE events to falsely claim content was refused. The 
        completeness invariant provides detection for logged events: 
        auditors can identify ATTEMPT Signed Statements without 
        corresponding Outcomes. Hash chains detect deletion of 
        intermediate events.</t>
        
        <t>However, if an ATTEMPT is never logged, this specification 
        cannot detect the omission. Complete prevention of omission 
        attacks is beyond the scope of this specification and would 
        require external enforcement mechanisms such as trusted 
        execution environments, RATS attestation, or real-time 
        external monitoring.</t>
      </section>

      <section anchor="security-equivocation">
        <name>Log Equivocation</name>
        <t>A malicious Transparency Service might present different 
        views of the log to different parties (equivocation). For 
        example, it might show auditors a log containing DENY events 
        while providing a different view to other verifiers. Mitigations 
        include:</t>
        <ul>
          <li>Gossiping of Signed Tree Heads between verifiers to 
          detect inconsistencies</li>
          <li>Registration with multiple independent Transparency 
          Services</li>
          <li>External anchoring to public ledgers that provide global 
          consistency</li>
          <li>Auditor comparison of Receipts for the same time periods</li>
        </ul>
        <t>Detection of equivocation requires coordination between 
        verifiers; a single verifier in isolation cannot detect it.</t>
      </section>

      <section anchor="security-split-view">
        <name>Split-View Between Event Types</name>
        <t>A malicious Issuer might maintain separate logs for refusals 
        and generations, showing only the refusal log to auditors. The 
        completeness invariant mitigates this by requiring every logged 
        ATTEMPT to have an Outcome; if the GENERATE outcomes are hidden, 
        auditors will observe orphaned ATTEMPTs.</t>
      </section>

      <section anchor="security-tampering">
        <name>Log Tampering</name>
        <t>Direct modification of log entries is prevented by 
        cryptographic signatures on Signed Statements, hash chain 
        linking, Merkle tree inclusion proofs in Receipts, and the 
        append-only structure enforced by the Transparency Service.</t>
      </section>

      <section anchor="security-replay">
        <name>Replay Attacks</name>
        <t>An attacker might attempt to replay old refusal events to 
        inflate refusal statistics or create false alibis. UUID v7 
        provides temporal ordering, timestamps are verified against 
        Transparency Service registration time, and hash chain sequence 
        numbers detect out-of-order insertion.</t>
      </section>

      <section anchor="security-key-compromise">
        <name>Key Compromise</name>
        <t>If an Issuer's signing key is compromised, an attacker could 
        create fraudulent Signed Statements. Previously signed Signed 
        Statements remain valid. Implementations SHOULD support key 
        rotation and revocation. Transparency Service timestamps 
        provide evidence of when Signed Statements were registered, 
        which can help bound the impact of a compromise.</t>
      </section>

      <section anchor="security-dictionary">
        <name>Prompt Dictionary Attacks</name>
        <t>Although prompts are stored as hashes, an adversary with a 
        dictionary of known prompts could attempt to identify which 
        prompt was used by computing hashes and comparing. Mitigations 
        include access controls on event queries, time-limited retention 
        policies, monitoring for bulk query patterns, and rate limiting.</t>
        
        <t>Salted hashing may provide additional protection but 
        introduces operational complexity. If salting is used, the 
        salt must be managed such that verification remains possible 
        without disclosing the salt to third parties. This specification 
        does not mandate salting.</t>
      </section>

      <section anchor="security-dos">
        <name>Denial of Service</name>
        <t>An attacker could flood the system with generation requests 
        to create a large volume of ATTEMPT Signed Statements, 
        potentially overwhelming the Transparency Service or obscuring 
        legitimate events. Standard rate limiting and access controls 
        at the AI system level can mitigate this. The Transparency 
        Service MAY implement its own admission controls.</t>
      </section>
    </section>

    <!-- ===== Section 9: Privacy Considerations ===== -->
    <section anchor="privacy">
      <name>Privacy Considerations</name>

      <section anchor="privacy-harmful">
        <name>Harmful Content Storage</name>
        <t>This profile requires that harmful content not be stored. 
        Prompt text is replaced with PromptHash, reference images are 
        replaced with hashes, and refusal reasons SHOULD NOT quote or 
        describe prompt content in detail. This prevents the audit log 
        from becoming a repository of harmful content.</t>
      </section>

      <section anchor="privacy-actor">
        <name>Actor Identification</name>
        <t>Actor identification creates tension between accountability 
        and privacy. Implementations SHOULD use pseudonymous identifiers 
        (ActorHash) by default, maintain a separate access-controlled 
        mapping from pseudonyms to identities, define clear policies 
        for de-pseudonymization, and support erasure of the mapping 
        while preserving audit integrity (crypto-shredding).</t>
      </section>

      <section anchor="privacy-correlation">
        <name>Correlation Risks</name>
        <t>Event metadata may enable correlation attacks. Timestamps 
        could reveal user activity patterns, SessionIDs link multiple 
        requests, and ModelIDs reveal which AI systems a user interacts 
        with. Implementations SHOULD apply appropriate access controls 
        and MAY implement differential privacy techniques for aggregate 
        statistics.</t>
      </section>

      <section anchor="privacy-gdpr">
        <name>Data Subject Rights</name>
        <t>Where personal data protection regulations apply (e.g., 
        GDPR), implementations SHOULD support data subject access 
        requests, erasure requests via crypto-shredding (destroying 
        encryption keys for personal data while preserving cryptographic 
        integrity proofs), and purpose limitation.</t>
      </section>
    </section>

    <!-- ===== Section 10: Future Work (Non-Normative) ===== -->
    <section anchor="future-work">
      <name>Future Work (Non-Normative)</name>
      
      <t>This section describes potential extensions and research 
      directions that are outside the scope of this specification 
      but may be addressed in future work.</t>
      
      <section anchor="future-attestation">
        <name>RATS/Attestation Integration</name>
        <t>Integration with Remote ATtestation procedureS (RATS) 
        <xref target="RFC9334"/> could provide stronger guarantees 
        that the AI system is operating as expected and logging all 
        events. Hardware-backed attestation could reduce the trust 
        assumptions on the Issuer.</t>
      </section>

      <section anchor="future-batching">
        <name>Batching and Scalability</name>
        <t>High-volume AI systems may generate millions of events per 
        day. Future work could explore batching mechanisms, rolling 
        logs, and hierarchical Merkle structures to improve scalability 
        while maintaining verifiability.</t>
      </section>

      <section anchor="future-privacy">
        <name>Advanced Privacy Mechanisms</name>
        <t>More sophisticated privacy mechanisms could be explored, 
        including:</t>
        <ul>
          <li>Commitment schemes that allow selective disclosure</li>
          <li>Zero-knowledge proofs for aggregate statistics without 
          revealing individual events</li>
          <li>Homomorphic encryption for privacy-preserving audits</li>
        </ul>
        <t>These mechanisms would add complexity and are not required 
        for the core auditability goals of this specification.</t>
      </section>

      <section anchor="future-completeness">
        <name>External Completeness Enforcement</name>
        <t>Stronger completeness guarantees could be achieved through 
        external enforcement mechanisms such as:</t>
        <ul>
          <li>Trusted execution environments (TEEs) that guarantee 
          logging before generation</li>
          <li>Hardware security modules (HSMs) that control signing 
          keys</li>
          <li>Real-time monitoring by independent observers</li>
          <li>Blockchain-based commitment schemes</li>
        </ul>
        <t>These approaches involve significant architectural changes 
        and are outside the scope of this specification.</t>
      </section>
    </section>
  </middle>

  <back>
    <!-- ===== References ===== -->
    <references>
      <name>References</name>
      
      <references>
        <name>Normative References</name>
        
        <reference anchor="RFC2119">
          <front>
            <title>Key words for use in RFCs to Indicate Requirement Levels</title>
            <author fullname="S. Bradner"/>
            <date month="March" year="1997"/>
          </front>
          <seriesInfo name="BCP" value="14"/>
          <seriesInfo name="RFC" value="2119"/>
        </reference>
        
        <reference anchor="RFC8174">
          <front>
            <title>Ambiguity of Uppercase vs Lowercase in RFC 2119 Key Words</title>
            <author fullname="B. Leiba"/>
            <date month="May" year="2017"/>
          </front>
          <seriesInfo name="BCP" value="14"/>
          <seriesInfo name="RFC" value="8174"/>
        </reference>
        
        <reference anchor="RFC8785">
          <front>
            <title>JSON Canonicalization Scheme (JCS)</title>
            <author fullname="A. Rundgren"/>
            <date month="June" year="2020"/>
          </front>
          <seriesInfo name="RFC" value="8785"/>
        </reference>
        
        <reference anchor="RFC9052">
          <front>
            <title>CBOR Object Signing and Encryption (COSE): Structures and Process</title>
            <author fullname="J. Schaad"/>
            <date month="August" year="2022"/>
          </front>
          <seriesInfo name="RFC" value="9052"/>
        </reference>
        
        <reference anchor="I-D.ietf-scitt-architecture">
          <front>
            <title>An Architecture for Trustworthy and Transparent Digital Supply Chains</title>
            <author fullname="Henk Birkholz" initials="H." surname="Birkholz"/>
            <author fullname="Antoine Delignat-Lavaud" initials="A." surname="Delignat-Lavaud"/>
            <author fullname="Cedric Fournet" initials="C." surname="Fournet"/>
            <date year="2025"/>
          </front>
          <seriesInfo name="Internet-Draft" value="draft-ietf-scitt-architecture"/>
        </reference>
        
        <reference anchor="I-D.ietf-scitt-scrapi">
          <front>
            <title>SCITT Reference APIs</title>
            <author fullname="Orie Steele" initials="O." surname="Steele"/>
            <date year="2025"/>
          </front>
          <seriesInfo name="Internet-Draft" value="draft-ietf-scitt-scrapi"/>
        </reference>
      </references>
      
      <references>
        <name>Informative References</name>
        
        <reference anchor="RFC6962">
          <front>
            <title>Certificate Transparency</title>
            <author fullname="B. Laurie"/>
            <date month="June" year="2013"/>
          </front>
          <seriesInfo name="RFC" value="6962"/>
        </reference>
        
        <reference anchor="RFC9334">
          <front>
            <title>Remote ATtestation procedureS (RATS) Architecture</title>
            <author fullname="H. Birkholz"/>
            <date month="January" year="2023"/>
          </front>
          <seriesInfo name="RFC" value="9334"/>
        </reference>
        
        <reference anchor="RFC9562">
          <front>
            <title>Universally Unique IDentifiers (UUIDs)</title>
            <author fullname="K. Davis"/>
            <date month="May" year="2024"/>
          </front>
          <seriesInfo name="RFC" value="9562"/>
        </reference>
        
        <reference anchor="CAP-SRP">
          <front>
            <title>Content AI Profile - Safe Refusal Provenance Extension</title>
            <author>
              <organization>VeritasChain Standards Organization</organization>
            </author>
            <date year="2026"/>
          </front>
          <refcontent>https://github.com/veritaschain/cap-safe-refusal-provenance</refcontent>
        </reference>
      </references>
    </references>

    <!-- ===== Appendix A: Complete Example Flow ===== -->
    <section anchor="appendix-flow">
      <name>Example: Complete Refusal Event Flow</name>
      
      <t>This appendix illustrates a complete flow from request 
      receipt to Verifiable Refusal Record verification.</t>
      
      <section anchor="appendix-flow-steps">
        <name>Event Sequence</name>
        <ol>
          <li>User submits generation request to AI system</li>
          <li>AI system creates ATTEMPT Signed Statement (computes 
          PromptHash = SHA256(prompt), generates UUID v7 EventId, 
          signs as COSE_Sign1)</li>
          <li>AI system registers ATTEMPT with Transparency Service</li>
          <li>Transparency Service returns Receipt_ATTEMPT</li>
          <li>AI system evaluates request against content policy</li>
          <li>Policy classifier determines refusal is required</li>
          <li>AI system creates DENY Signed Statement (sets 
          AttemptId = ATTEMPT.EventId, records RiskCategory and 
          RefusalReason, signs as COSE_Sign1)</li>
          <li>AI system registers DENY with Transparency Service</li>
          <li>Transparency Service returns Receipt_DENY</li>
          <li>User receives refusal response</li>
        </ol>
      </section>
      
      <section anchor="appendix-flow-verification">
        <name>Third-Party Verification</name>
        <t>An auditor verifying the Verifiable Refusal Record:</t>
        <ol>
          <li>Obtains ATTEMPT Signed Statement and Receipt_ATTEMPT</li>
          <li>Obtains DENY Signed Statement and Receipt_DENY</li>
          <li>Verifies Issuer signature on both Signed Statements</li>
          <li>Verifies both Receipts against Transparency Service 
          public key</li>
          <li>Confirms DENY.AttemptId equals ATTEMPT.EventId</li>
          <li>Confirms DENY.Timestamp is after ATTEMPT.Timestamp</li>
          <li>Concludes: The request identified by ATTEMPT.PromptHash 
          was refused and the refusal was logged at DENY.Timestamp</li>
        </ol>
        <t>This verification confirms that a refusal was logged, but 
        does not prove that no unlogged generation occurred.</t>
      </section>
    </section>

    <!-- ===== Acknowledgements ===== -->
    <section anchor="acknowledgements" numbered="false">
      <name>Acknowledgements</name>
      <t>The authors thank the members of the SCITT Working Group 
      for developing the foundational architecture. This work builds 
      upon the transparency log concepts from Certificate Transparency 
      <xref target="RFC6962"/>.</t>
    </section>
  </back>
</rfc>
