<?xml version="1.0" encoding="UTF-8"?>
  <?xml-stylesheet type="text/xsl" href="rfc2629.xslt" ?>
  <!-- generated by https://github.com/cabo/kramdown-rfc version 1.7.29 (Ruby 3.4.5) -->


<!DOCTYPE rfc  [
  <!ENTITY nbsp    "&#160;">
  <!ENTITY zwsp   "&#8203;">
  <!ENTITY nbhy   "&#8209;">
  <!ENTITY wj     "&#8288;">

]>


<rfc ipr="trust200902" docName="draft-rswg-rfc7997bis-04" category="info" submissionType="editorial" obsoletes="7997" updates="7322" tocInclude="true" sortRefs="true" symRefs="true">
  <front>
    <title abbrev="Text in RFCs">Text in RFCs</title>

    <author initials="P." surname="Hoffman" fullname="Paul Hoffman">
      <organization></organization>
      <address>
        <email>paul.hoffman@icann.org</email>
      </address>
    </author>

    <date year="2025" month="September" day="13"/>

    
    
    <keyword>Internet-Draft</keyword>

    <abstract>


<?line 50?>

<t>The early policy for the RFC Series was that RFCs could only contain characters from the ASCII character set.
Later policy, from RFC 7997, allowed more characters and enforced an encoding for RFCs of UTF-8.
Since RFC 7997 was published, the IETF community has had much more experience of using non-ASCII characters in RFCs.</t>

<t>The policy for the RFC Series is that all displayable text is allowed as long as the reader of an RFC can interpret that text.
This policy does not change language policy of the RFC Series, namely that English is the required language for the series.</t>

<t>This document obsoletes RFC 7997 and updates the RFC Style Guide (RFC 7322).</t>

<t>[[ A repository for this draft can be found <eref target="https://github.com/paulehoffman/7997bis">here</eref>. ]]</t>



    </abstract>



  </front>

  <middle>


<?line 63?>

<section anchor="introduction"><name>Introduction</name>

<t>This document sets policy for the inclusion of characters in the definitive versions and publication formats of RFCs.
It also reaffirms the policy that the encoding format for the RFC Series is UTF-8, <xref target="STD63"/>.
This document obsoletes <xref target="RFC7997"/> and updates the RFC Style Guide <xref target="RFC7322"/>.
This document makes substantial changes to the policies in <xref target="RFC7997"/> based on the positive experience since its publication.</t>

<t>The RFC Publication Center (RPC) is responsible for implementing the policies in this document, as described in <xref target="RFC9720"/>.</t>

<section anchor="terminology"><name>Terminology</name>

<t>The term "non-ASCII characters" means characters outside the set that was defined in ASCII.
ASCII is described in <xref target="RFC20"/>.</t>

<t>The term "Unicode characters" means characters define in <xref target="UnicodeCurrent"/>.</t>

<t>More terminology about characters and encoding formats can be found in <xref target="RFC6365"/>.</t>

</section>
</section>
<section anchor="basic_requirements"><name>Basic Requirements for Text in RFCs</name>

<t>RFCs should be displayed correctly across a wide range of readers and browsers.
People whose systems do not have the fonts needed to display part of a particular RFC still need to be able to read the definitive versions and publication formats correctly in order to understand and implement the information described in the document.</t>

<t>As stated in the RFC Style Guide <xref target="RFC7322"/>, the language of the RFC Series is English.</t>

<t>Searches whose results might include RFCs should return accurate results and support appropriate Unicode string matching behaviors.</t>

</section>
<section anchor="policy-for-text-in-rfcs"><name>Policy for Text in RFCs</name>

<t>The policy for the RFC Series is that all displayable text is allowed as long as the reader of an RFC can interpret that text.</t>

<t>There are many Unicode characters that obviously cannot be displayed (such as control characters), and many whose ability to be displayed is debatable.
If an RFC includes such characters in normative or descriptive text, the RFC needs to also clearly describe the character.</t>

<t>The preferred method for describing such characters is using the "U+NNNN" syntax from <xref target="BCP137"/>.
<xref target="BCP137"/> describes the pros and cons of different options for identifying Unicode characters and may help authors decide how to represent the non-ASCII characters in their documents.</t>

<t>Note that this policy only applies to normative or descriptive text; text such as names does not need character description.
Further, some RFC authors might choose to use something other than the "U+NNNN" syntax to describing characters, such as if the RFC already covers a different syntax that the reader will understand from the rest of the RFC.</t>

<t>Characters in an RFC will generally appear in Normalization Form C (NFC) as defined in <xref target="UnicodeNorm"/>.
If the RFC would be more correct and more understandable with particular characters not in NFC, the RPC can use unnormalized text.
In such a case, a text note should be included to describe why unnormalized text was used.</t>

<section anchor="names"><name>Names</name>

<t>Authors of RFCs whose names include non-ASCII characters will likely have preferences for how their names are displayed based on their lived experiences.
These authors can give their names using only ASCII characters, or as Unicode characters and an ASCII interpretation of their name.
The RPC policy should be that authors' preferences for display of their names be honored.</t>

<t>Company names and geographic names generally do not need ASCII interpretations, but they can be included at the discretion of the author and the RPC.</t>

</section>
<section anchor="examples"><name>Examples</name>

<t>Where the use of non-ASCII characters is purely part of an example and not otherwise required for correct protocol operation, giving the Unicode equivalent of the non-ASCII characters is not required, but it can improve the readability of the RFC.
For example, for text that might just say "The value can be followed by a monetary symbol such as ¥ or €", it is likely more beneficial to the reader to instead say "The value can be followed by a monetary symbol such as ¥ (U+00A5) or € (U+20AC)".</t>

<t>RFCs are often displayed on systems that use only black and white, particularly when printed.
Because of this, examples should generally use characters that do not specify a color.
However, some examples might require text with color due to the nature of the examples.
If so, those examples need to also include the "U+NNNN" syntax.
For example, "A color display should be able to differentiate 🔴 (U+1F534), 🟢 (U+1F7E2), and 🔵 (U+1F535)."</t>

</section>
</section>
<section anchor="iana-considerations"><name>IANA Considerations</name>

<t>This document contains no IANA considerations.</t>

</section>
<section anchor="security-considerations"><name>Security Considerations</name>

<t>Valid Unicode that matches the expected text must be verified in order to preserve expected behavior and protocol information.</t>

</section>


  </middle>

  <back>


<references title='References' anchor="sec-combined-references">

    <references title='Normative References' anchor="sec-normative-references">



<referencegroup anchor="BCP137" target="https://www.rfc-editor.org/info/bcp137">
  <reference anchor="RFC5137" target="https://www.rfc-editor.org/info/rfc5137">
    <front>
      <title>ASCII Escaping of Unicode Characters</title>
      <author fullname="J. Klensin" initials="J." surname="Klensin"/>
      <date month="February" year="2008"/>
      <abstract>
        <t>There are a number of circumstances in which an escape mechanism is needed in conjunction with a protocol to encode characters that cannot be represented or transmitted directly. With ASCII coding, the traditional escape has been either the decimal or hexadecimal numeric value of the character, written in a variety of different ways. The move to Unicode, where characters occupy two or more octets and may be coded in several different forms, has further complicated the question of escapes. This document discusses some options now in use and discusses considerations for selecting one for use in new IETF protocols, and protocols that are now being internationalized. This document specifies an Internet Best Current Practices for the Internet Community, and requests discussion and suggestions for improvements.</t>
      </abstract>
    </front>
    <seriesInfo name="BCP" value="137"/>
    <seriesInfo name="RFC" value="5137"/>
    <seriesInfo name="DOI" value="10.17487/RFC5137"/>
  </reference>
</referencegroup>
<referencegroup anchor="STD63" target="https://www.rfc-editor.org/info/std63">
  <reference anchor="RFC3629" target="https://www.rfc-editor.org/info/rfc3629">
    <front>
      <title>UTF-8, a transformation format of ISO 10646</title>
      <author fullname="F. Yergeau" initials="F." surname="Yergeau"/>
      <date month="November" year="2003"/>
      <abstract>
        <t>ISO/IEC 10646-1 defines a large character set called the Universal Character Set (UCS) which encompasses most of the world's writing systems. The originally proposed encodings of the UCS, however, were not compatible with many current applications and protocols, and this has led to the development of UTF-8, the object of this memo. UTF-8 has the characteristic of preserving the full US-ASCII range, providing compatibility with file systems, parsers and other software that rely on US-ASCII values but are transparent to other values. This memo obsoletes and replaces RFC 2279.</t>
      </abstract>
    </front>
    <seriesInfo name="STD" value="63"/>
    <seriesInfo name="RFC" value="3629"/>
    <seriesInfo name="DOI" value="10.17487/RFC3629"/>
  </reference>
</referencegroup>
<reference anchor="RFC7997">
  <front>
    <title>The Use of Non-ASCII Characters in RFCs</title>
    <author fullname="H. Flanagan" initials="H." role="editor" surname="Flanagan"/>
    <date month="December" year="2016"/>
    <abstract>
      <t>In order to support the internationalization of protocols and a more diverse Internet community, the RFC Series must evolve to allow for the use of non-ASCII characters in RFCs. While English remains the required language of the Series, the encoding of future RFCs will be in UTF-8, allowing for a broader range of characters than typically used in the English language. This document describes the RFC Editor requirements and gives guidance regarding the use of non-ASCII characters in RFCs.</t>
      <t>This document updates RFC 7322. Please view this document in PDF form to see the full text.</t>
    </abstract>
  </front>
  <seriesInfo name="RFC" value="7997"/>
  <seriesInfo name="DOI" value="10.17487/RFC7997"/>
</reference>
<reference anchor="RFC9720">
  <front>
    <title>RFC Formats and Versions</title>
    <author fullname="P. Hoffman" initials="P." surname="Hoffman"/>
    <author fullname="H. Flanagan" initials="H." surname="Flanagan"/>
    <date month="January" year="2025"/>
    <abstract>
      <t>In order to improve the readability of RFCs while supporting their archivability, the definitive version of the RFC Series transitioned from plain-text ASCII to XML using the RFCXML vocabulary; different publication versions are rendered from that base document. This document describes how RFCs are published.</t>
      <t>This document obsoletes RFC 7990. This document also updates the stability policy in RFC 9280.</t>
    </abstract>
  </front>
  <seriesInfo name="RFC" value="9720"/>
  <seriesInfo name="DOI" value="10.17487/RFC9720"/>
</reference>

<reference anchor="UnicodeCurrent" target="http://www.unicode.org/versions/latest/">
  <front>
    <title>The Unicode Standard</title>
    <author >
      <organization>The Unicode Consortium</organization>
    </author>
    <date year="2023"/>
  </front>
</reference>


    </references>

    <references title='Informative References' anchor="sec-informative-references">



<reference anchor="RFC20">
  <front>
    <title>ASCII format for network interchange</title>
    <author fullname="V.G. Cerf" initials="V.G." surname="Cerf"/>
    <date month="October" year="1969"/>
  </front>
  <seriesInfo name="STD" value="80"/>
  <seriesInfo name="RFC" value="20"/>
  <seriesInfo name="DOI" value="10.17487/RFC0020"/>
</reference>
<reference anchor="RFC6365">
  <front>
    <title>Terminology Used in Internationalization in the IETF</title>
    <author fullname="P. Hoffman" initials="P." surname="Hoffman"/>
    <author fullname="J. Klensin" initials="J." surname="Klensin"/>
    <date month="September" year="2011"/>
    <abstract>
      <t>This document provides a list of terms used in the IETF when discussing internationalization. The purpose is to help frame discussions of internationalization in the various areas of the IETF and to help introduce the main concepts to IETF participants. This memo documents an Internet Best Current Practice.</t>
    </abstract>
  </front>
  <seriesInfo name="BCP" value="166"/>
  <seriesInfo name="RFC" value="6365"/>
  <seriesInfo name="DOI" value="10.17487/RFC6365"/>
</reference>
<reference anchor="RFC7322">
  <front>
    <title>RFC Style Guide</title>
    <author fullname="H. Flanagan" initials="H." surname="Flanagan"/>
    <author fullname="S. Ginoza" initials="S." surname="Ginoza"/>
    <date month="September" year="2014"/>
    <abstract>
      <t>This document describes the fundamental and unique style conventions and editorial policies currently in use for the RFC Series. It captures the RFC Editor's basic requirements and offers guidance regarding the style and structure of an RFC. Additional guidance is captured on a website that reflects the experimental nature of that guidance and prepares it for future inclusion in the RFC Style Guide. This document obsoletes RFC 2223, "Instructions to RFC Authors".</t>
    </abstract>
  </front>
  <seriesInfo name="RFC" value="7322"/>
  <seriesInfo name="DOI" value="10.17487/RFC7322"/>
</reference>

<reference anchor="UnicodeNorm" target="http://www.unicode.org/reports/tr15/">
  <front>
    <title>Unicode Standard Annex</title>
    <author >
      <organization>The Unicode Consortium</organization>
    </author>
    <date year="2023"/>
  </front>
</reference>


    </references>

</references>


<?line 136?>

<section anchor="acknowledgements"><name>Acknowledgements</name>

<t>This document is based on <xref target="RFC7997"/> that was authored by Heather Flanagan.</t>

<t>The acknowledgements from <xref target="RFC7997"/> are
to the members of the IAB i18n program,
to the RFC Format Design Team:
Nevil Brownlee, Tony Hansen, Joe
Hildebrand, Paul Hoffman, Ted Lemon, Julian Reschke, Adam Roach,
Alice Russo, Robert Sparks, and Dave Thaler.</t>

<t>This current document was greatly helped by contributions from the RFC Series Working Group (RSWG), including from
Brian Carpenter,
Carsten Bormann,
Eliot Lear,
John Levine,
and
Martin Thomson.</t>

</section>


  </back>

<!-- ##markdown-source:
H4sIAAAAAAAAA8VZzW7cuhXe6ykIZ1EbVzN27Ovkxt10PIkTB7mGETvNIrm4
oCTOiLVEqiTlydTIps/QfYAC3XXZvkAfpU9wH6HfIamfGTspihZogAQciTz/
5zsflclkkjjpKnHCrsUnx6Rib8/mNuFZZsTt1sNC54rX2FoYvnATY1fLiVnk
T589e5pJOzn4Pkms46r4mVdaYZszrUgg5CjRmdWVcMKeMNqdtE3Bw6+jw8Mk
kY3xu607PDh4dnCY3KxO2LlywijhJs9JW5JzdwJLFjqxbVZLa6VW1+sGakQh
nTaSV0nSyJOEMafzE7YWFkurjTNiYfvf63r4mfDWldrgyASvIBzPL6fslV4s
aq7oUXD3krfV+KmouaxOWIPH0zI8/o3MuVJTbZZJorSpuZO3gmw5nV8+PnpK
q6vr50+OaIFYUhDi8tnTwwNavlMy14WYt8YI5egJHImZKUX3ml1RgLkp/Pve
fvyZMCjf3DvXivyXbR2kcbMUCGLpXHOyv79araZt2El2798KQzG1+xWlxu37
M5SmE3Z4cHiELCH4I89ge7AciydHT44715DRkT8XOLLhzLYjbKaU+MQePT4e
3tEhXsk/QJdW7Ay//vfuGtFgs9135vHxfV8nkwnjmXWG5y5JSIvgplqzRlcy
XzMEgjk8hL/sShgpLFtxi0fc+VZhuW6rgmmFI7lWjqOF8pKTNESZLYyu/fnZ
1fz8fHjDrHDT5A2nZdCUhr2kh2omZbyq9EoUrNZGjEUimExQgnK85ApreCrV
0pvqTdIL9u76bPLDNLmSKhe9TG9502aVtKUoUm/X+YvrMxhe14iYW7MSO0oO
pW1eBs3iU0NukxzIbS1pUlpNthyyHXZMQxC/Hj4Zowf/WCFtU/E1zyrBnAcg
2/sNSwAuS+ajLZgRvECwYAP3ihjaEDqhujHCBZEkYgr1kBL1FxoalXZkqFoK
VuHfli97+yBu07zUIwGS6QW+UEsKVrCZbPh9Kw1s68V0/ll/2PuOvYDPtkZv
sx4NhxxQ+iIoDqrdGgF42UrU9q7fidbag7SPHz5+YDNGFWwJ+rqAkg7CSh+E
jMxoIfZDKYz4aZfawKIPltKVbTZFcvcJwEQEsP2I4ntT9vGnjz+FDqhlUVQi
SR4RGBtdtDk15LY7KFq7nVlUWNUSnFAsN8uBXhdiIVFZQBLWwY4PgS/DPLR9
ABtft6GCzqk6rKacLxbS1CFSUXHINPXpqPBx/iu15jshZXd3HpU/f55+NUV3
dxGvP3/+t1kKe5Gl+wJrfoNDGF00IR2GVaw9CNKDH946taEz41YQksRNNkRt
1H/Wd7N0dhy92G9k4OUopnNBrYFqupzvURiMsA1iL6nTKFCybipB9lIAt61y
Y4dS6sBC2NzIDPZ1RtM0I+eTR4/AHUwtla70ch2sgeqa7TwEEzusFhw1MKoU
3TpLMQ19FFt55ZWidoJKL2aaBGnyIXuiNYP2blh8U3dQEYRszmUv7UdCQDd4
h0kBa++j8UYh2s2u7AykyRkCxk65lTl7G9CEYmx9TsYUjN09ymjXz2a063OS
+He29EMHKiKAIg65htm5A3Lx3GgLy9iKomo87qG1AoAGizOjV4AstNql0CgE
tiq1RfTX1omaUu8xs+S3ISsLTSYqIQooQhVHreBFxnlE9iuZtxX3I4hZJ4Hu
dIC2w86A8L6ji/8YFwbXEBxtaAxAFGKLg8Qt/MG+oCMqRQYDMRu14nXHykYu
ZgimQ5f3777R6GFg9tB/b3RQXcaBAclXoBF5SXTBhxb911bwpZbL0gXQLAQb
ZxNTrDUK2ctbA4v6E+ScbRviMIw3jdENGLAbyBCoC9UenM1LWmQCeZOakotS
uxzQeoPh/7+HNOlHb3H8xUxas/vdGnbrDL60ltgVaDeKcqPody3xFG499TK6
Gh3fS33kvPCQAp7JijhOqMhBiIeTjDtyELOntz0mibAcSjZHW8/8UY+xwBr/
k7xL+2hSB3jc9+MsrwK17ArSb+vldsQJtxZhiGTUAiS48NmJJyi994yxkZSR
sJ13313gzw46GVT0U6CUd3fhakLgM6x7K+JwBWb4gOXUiUhgIReww0/Ixvn2
9GOjoJGxWJPCB1IWIg4aKaomkngKbk6dVOpVQAB4aLs+/RqTxDtp+kalUr7Q
TnSzf+B3nnejKyoZ5us38/LrUL5dzRDRswND9Gg1EPT+MI3Ys9bAIpPillmH
zHbOhX7OS00VRqhEMKopdRQiTafIavVgeghKh8wOEUh7G+UAMryi7qJbxq0P
9ShDnbiOGcU2XBEIj2Cyv4wgAW6EX4jufCP6sQH8+aVQwqDdfZhRv/T+/q2N
zdnuxRmoxubY7ocqnaD6Ox/8WXUzLFxwAsaHCqIHg90eeFYgs+MpMyoXSh4Z
dTaPjXcZcIdS0SoVbaVR5JHnXMXoYpMVQIlQFYrqaxissfmLUY5oSq7vS/RU
BbqKwIUuqKowWGJ9RE4bISiUXIf+Dxa/D3olb+gO4gdwAAQigKEFfR/59gjS
CEIHMBuTSGypUPvFiENaYquCwDDaR4FayjDne5EBUHxvbduXUmPB4a80P49U
bQD9UCOh2KKCaWCsSFPs4iHsYegE2351z/WOd2xIs3Su1MiKT8Fc1w1hfgwO
bFoKvTS8KUG5wsOhpCPR8a3/kN1wN2t9T607UtcXRuw12JRj++BkNN+rjuUY
KuPFJ04cBcXx3s8+ekk1imMPwyDxfEN10PMs3PaDEC+dTPcIs5J2dDmlSHXt
BFh3Osdg1KgA71JK6e7mRZdFOnrLK4/2i28Ac2i2TlMIjgy3UPAvoyNfJPzp
pu0YZoAUnQNpYB3UPz7nAUd/1wKYLDK8QxUCi1oxkOlIOTIgESBCIUW4D9t1
ncG9Di//8Reqz3/+8a87KRkGg2MreUzJkPgFbjm4k8W7WERK/JIK3Bfk9L/U
vvvuu4OD2fFeNIN+Hx7M5ns708jdqV31wgk1alrUTke9fTB8VVD3ZRXPb3yu
V6V0CNqAgBXRGkgBG0TJovJPRc5jOdGETLtI9wRzKHvatk20YivYBtN6QU6i
ajRIySu4fduPvl5myFeshIiDBNH+FCta0UVYcdDani135/0ksJoAm3CxF9td
GTxf6nDygdG5VUs7s05xRIgBUbqbRz8tPXn+5cuf/ka5eXx2fPQ9qOIvX778
Ofx++uIwUkfs+Xu353hvuuO/jswuZv7zIxhN6Ce7/ZEkfgWkVgnb843tnpVf
CZB86o5tUb/FYCn6tgydQcQ+sjQC8tx1g6embsn8BUouZJi4/fXIsyxzOzrT
XQzCPasDhtFNaRq+BWWoOTJylt8ovapEsQy3z21Hse6nzfg7Rn9/D0AYmuaV
4J4LneH+xJe8+2zBt3R0jHX0JcaIJJZSLepMhJnqP13OTpl8/AO1AAF8nXb7
iF2chW9Cz4WVS4WrD69PkgtxKyt2iruvqgSq5lpjTrziCmw0Za+1SF7JCjcB
3JgBbeP/CsBWePFG1ASfr9tKEkECKShvIGVW8Jq91Twv02SGcQb9raXSfqth
rWNX6NkbG0rqOQ306xJIa7pvhXn43DCElSK3BC7RfZd4dIifv+BI4G2g4h2T
G13Z3mtzQ7j+0ui2Ybtvr96/RCGHHvIfJ3AmOTVk+5ybxn8fShMsLYHRKcVL
qTR5UUnAwBswvTR5rUuFJcaFSBPYn/xI8INwlrq2VC//AsdEGE5VGgAA

-->

</rfc>

