<?xml version="1.0" encoding="UTF-8"?>
  <?xml-stylesheet type="text/xsl" href="rfc2629.xslt" ?>
  <!-- generated by https://github.com/cabo/kramdown-rfc version 1.7.29 (Ruby 3.4.7) -->


<!DOCTYPE rfc  [
  <!ENTITY nbsp    "&#160;">
  <!ENTITY zwsp   "&#8203;">
  <!ENTITY nbhy   "&#8209;">
  <!ENTITY wj     "&#8288;">

]>


<rfc ipr="trust200902" docName="draft-rswg-rfc7997bis-06" category="info" submissionType="editorial" obsoletes="7997" tocInclude="true" sortRefs="true" symRefs="true">
  <front>
    <title abbrev="Text in RFCs">Text in RFCs</title>

    <author initials="P." surname="Hoffman" fullname="Paul Hoffman">
      <organization></organization>
      <address>
        <email>paul.hoffman@icann.org</email>
      </address>
    </author>

    <date year="2025" month="December" day="14"/>

    
    
    <keyword>Internet-Draft</keyword>

    <abstract>


<?line 41?>

<t>This document sets policy for the inclusion of characters in the definitive versions and publication formats of RFCs.
The policy for the RFC Series is that all displayable text is allowed as long as there is a high expectation that readers of an RFC will be able to interpret its text as intended.
This document obsoletes RFC 7997.</t>

<t>[[ A repository for this draft can be found <eref target="https://github.com/paulehoffman/7997bis">here</eref>. ]]</t>



    </abstract>



  </front>

  <middle>


<?line 49?>

<section anchor="introduction"><name>Introduction</name>

<t>The early policy for the RFC Series was that RFCs could only contain characters from the ASCII character set.
Later policies, from <xref target="RFC7997"/>, allowed more characters, set the language of the RFC Series to be English, and set the encoding for RFCs of UTF-8.
In the time since <xref target="RFC7997"/> was published, the IETF community has had much more experience of using non-ASCII characters in RFCs.</t>

<t>This document obsoletes <xref target="RFC7997"/>.
This document makes substantial changes to the policies in <xref target="RFC7997"/> based on the positive experience since its publication.</t>

<t>The RFC Publication Center (RPC) is responsible for implementing the policies in this document, as described in <xref target="RFC9720"/>.
The RPC style guides may define which characters authors may use and how they are used.</t>

<section anchor="terminology"><name>Terminology</name>

<t>The term "non-ASCII characters" means characters outside the set that was defined in ASCII.
ASCII is described in <xref target="RFC20"/>.</t>

<t>The term "Unicode characters" means characters defined in <xref target="UnicodeLatest"/>.</t>

<t>"U+ notation" means the use of using the characters "U+" and a decimal number to represent a Unicode code point.
See <xref target="BCP137"/> for more on U+ notation.</t>

<t>More terminology about characters and encoding formats can be found in <xref target="RFC6365"/>.</t>

</section>
</section>
<section anchor="basic-requirements-for-text-in-rfcs"><name>Basic Requirements for Text in RFCs</name>

<t>RFCs should only contain text that can be displayed correctly across a wide range of readers and browsers.
People whose systems do not have the fonts needed to display part of a particular RFC still need to be able to read the definitive versions and publication formats correctly in order to understand and implement the information described in the document.</t>

<t>The ability to use non-ASCII characters in RFCs in a clear and consistent manner allows the correct display of proper names and improves the ability to describe internationalized protocols.</t>

<t>The ability to use non-ASCII characters in RFCs in a clear and consistent manner will allow the correct display of proper names and improve the ability to describe internationalized protocols.
Apart from their role in proper names, non-ASCII characters should be used only when they enhance the technical content and accuracy of the document.</t>

</section>
<section anchor="policy-for-text-in-rfcs"><name>Policy for Text in RFCs</name>

<t>English is the required language of RFCs.
However, because non-ASCII characters are often required for instances including proper names and examples, the policy for the RFC Series is that all displayable text is allowed as long as there is a high expectation that readers of an RFC will be able to interpret its text as intended.
Apart from their role in proper names, non-ASCII characters should be used only when they enhance the technical content and accuracy of the document.</t>

<t>There are many Unicode characters that obviously cannot be displayed (such as control characters), and many whose ability to be displayed is debatable.
If an RFC includes such characters in normative or descriptive text, the RFC needs to also clearly describe the character.</t>

<t>The preferred method for describing such characters is using the U+ notation from <xref target="BCP137"/> and/or using the character's official name from the Unicode Standard <xref target="UnicodeLatest"/>.
<xref target="BCP137"/> describes the pros and cons of different options for identifying Unicode characters and may help authors decide how to represent the non-ASCII characters in their documents.</t>

<t>Note that this policy only applies to normative or descriptive text; text such as names does not need character description.
Further, some RFC authors might choose to use something other than the U+ notation to describe characters, such as if the RFC already covers a different syntax that the reader will understand from the rest of the RFC.</t>

<t>Characters in an RFC will generally appear in Normalization Form C (NFC) as defined in <xref target="UnicodeLatest"/>.
If the RFC would be more correct and more understandable with particular characters not in NFC, the RPC can use unnormalized text.
In such a case, a note should be included to describe why unnormalized text was used.</t>

<section anchor="names"><name>Names</name>

<t>Authors of RFCs whose names include non-ASCII characters will likely have preferences for how their names are displayed based on their lived experiences.
Authors can give their names using only Latin script characters, or using non-Latin script and an equivalent in Latin script.
Authors' preferences for display of their names should be honored.</t>

<t>Company names and geographic names generally do not need Latin equivalents, but they can be included at the discretion of the author and the RPC.</t>

</section>
<section anchor="examples"><name>Examples</name>

<t>Where the use of non-ASCII characters is purely part of an example or not otherwise required for correct protocol operation, giving the Unicode code points and Unicode names of the non-ASCII characters is not required, but it can improve the readability of the RFC.
An RFC can use either or both forms, whichever is sensible in the circumstance.
For example, for text that might just say "The value can be followed by a monetary symbol such as ¥ or €", text that either says "The value can be followed by a monetary symbol such as ¥ (U+00A5) or € (U+20AC)" or text that says "The value can be followed by a monetary symbol such as ¥ (U+00A5) (YEN SIGN) or € (U+20AC) (EURO SIGN)" is likely more beneficial to the reader.</t>

<t>RFCs may be viewed using only black and white or grayscale, particularly when printed.
Because of this, examples should generally use characters that do not specify a color.
However, some examples might require text with color due to the nature of the examples.
If so, those examples need to also include U+ notation.
For example, "A color display should be able to differentiate 🔴 (U+1F534) (LARGE RED CIRCLE), 🟢 (U+1F7E2) (LARGE GREEN CIRCLE), and 🔵 (U+1F535) (LARGE BLUE CIRCLE)."</t>

</section>
</section>
<section anchor="rfc-publication-language"><name>RFC Publication Language</name>

<t>The RFC publication language is English.</t>

</section>
<section anchor="rfc-publication-encoding"><name>RFC Publication Encoding</name>

<t>The encoding format for the RFC Series is UTF-8 <xref target="STD63"/>.</t>

</section>
<section anchor="iana-considerations"><name>IANA Considerations</name>

<t>This document contains no IANA considerations.</t>

</section>
<section anchor="security-considerations"><name>Security Considerations</name>

<t>Authors and the RPC should cross-check that the used characters match their code point numbers or Unicode character names.</t>

</section>


  </middle>

  <back>


<references title='References' anchor="sec-combined-references">

    <references title='Normative References' anchor="sec-normative-references">



<referencegroup anchor="BCP137" target="https://www.rfc-editor.org/info/bcp137">
  <reference anchor="RFC5137" target="https://www.rfc-editor.org/info/rfc5137">
    <front>
      <title>ASCII Escaping of Unicode Characters</title>
      <author fullname="J. Klensin" initials="J." surname="Klensin"/>
      <date month="February" year="2008"/>
      <abstract>
        <t>There are a number of circumstances in which an escape mechanism is needed in conjunction with a protocol to encode characters that cannot be represented or transmitted directly. With ASCII coding, the traditional escape has been either the decimal or hexadecimal numeric value of the character, written in a variety of different ways. The move to Unicode, where characters occupy two or more octets and may be coded in several different forms, has further complicated the question of escapes. This document discusses some options now in use and discusses considerations for selecting one for use in new IETF protocols, and protocols that are now being internationalized. This document specifies an Internet Best Current Practices for the Internet Community, and requests discussion and suggestions for improvements.</t>
      </abstract>
    </front>
    <seriesInfo name="BCP" value="137"/>
    <seriesInfo name="RFC" value="5137"/>
    <seriesInfo name="DOI" value="10.17487/RFC5137"/>
  </reference>
</referencegroup>
<referencegroup anchor="STD63" target="https://www.rfc-editor.org/info/std63">
  <reference anchor="RFC3629" target="https://www.rfc-editor.org/info/rfc3629">
    <front>
      <title>UTF-8, a transformation format of ISO 10646</title>
      <author fullname="F. Yergeau" initials="F." surname="Yergeau"/>
      <date month="November" year="2003"/>
      <abstract>
        <t>ISO/IEC 10646-1 defines a large character set called the Universal Character Set (UCS) which encompasses most of the world's writing systems. The originally proposed encodings of the UCS, however, were not compatible with many current applications and protocols, and this has led to the development of UTF-8, the object of this memo. UTF-8 has the characteristic of preserving the full US-ASCII range, providing compatibility with file systems, parsers and other software that rely on US-ASCII values but are transparent to other values. This memo obsoletes and replaces RFC 2279.</t>
      </abstract>
    </front>
    <seriesInfo name="STD" value="63"/>
    <seriesInfo name="RFC" value="3629"/>
    <seriesInfo name="DOI" value="10.17487/RFC3629"/>
  </reference>
</referencegroup>
<reference anchor="RFC7997">
  <front>
    <title>The Use of Non-ASCII Characters in RFCs</title>
    <author fullname="H. Flanagan" initials="H." role="editor" surname="Flanagan"/>
    <date month="December" year="2016"/>
    <abstract>
      <t>In order to support the internationalization of protocols and a more diverse Internet community, the RFC Series must evolve to allow for the use of non-ASCII characters in RFCs. While English remains the required language of the Series, the encoding of future RFCs will be in UTF-8, allowing for a broader range of characters than typically used in the English language. This document describes the RFC Editor requirements and gives guidance regarding the use of non-ASCII characters in RFCs.</t>
      <t>This document updates RFC 7322. Please view this document in PDF form to see the full text.</t>
    </abstract>
  </front>
  <seriesInfo name="RFC" value="7997"/>
  <seriesInfo name="DOI" value="10.17487/RFC7997"/>
</reference>
<reference anchor="RFC9720">
  <front>
    <title>RFC Formats and Versions</title>
    <author fullname="P. Hoffman" initials="P." surname="Hoffman"/>
    <author fullname="H. Flanagan" initials="H." surname="Flanagan"/>
    <date month="January" year="2025"/>
    <abstract>
      <t>In order to improve the readability of RFCs while supporting their archivability, the definitive version of the RFC Series transitioned from plain-text ASCII to XML using the RFCXML vocabulary; different publication versions are rendered from that base document. This document describes how RFCs are published.</t>
      <t>This document obsoletes RFC 7990. This document also updates the stability policy in RFC 9280.</t>
    </abstract>
  </front>
  <seriesInfo name="RFC" value="9720"/>
  <seriesInfo name="DOI" value="10.17487/RFC9720"/>
</reference>

<reference anchor="UnicodeLatest" target="http://www.unicode.org/versions/latest/">
  <front>
    <title>The Unicode Standard</title>
    <author >
      <organization>The Unicode Consortium</organization>
    </author>
    <date year="n.d."/>
  </front>
</reference>


    </references>

    <references title='Informative References' anchor="sec-informative-references">



<reference anchor="RFC20">
  <front>
    <title>ASCII format for network interchange</title>
    <author fullname="V.G. Cerf" initials="V.G." surname="Cerf"/>
    <date month="October" year="1969"/>
  </front>
  <seriesInfo name="STD" value="80"/>
  <seriesInfo name="RFC" value="20"/>
  <seriesInfo name="DOI" value="10.17487/RFC0020"/>
</reference>
<reference anchor="RFC6365">
  <front>
    <title>Terminology Used in Internationalization in the IETF</title>
    <author fullname="P. Hoffman" initials="P." surname="Hoffman"/>
    <author fullname="J. Klensin" initials="J." surname="Klensin"/>
    <date month="September" year="2011"/>
    <abstract>
      <t>This document provides a list of terms used in the IETF when discussing internationalization. The purpose is to help frame discussions of internationalization in the various areas of the IETF and to help introduce the main concepts to IETF participants. This memo documents an Internet Best Current Practice.</t>
    </abstract>
  </front>
  <seriesInfo name="BCP" value="166"/>
  <seriesInfo name="RFC" value="6365"/>
  <seriesInfo name="DOI" value="10.17487/RFC6365"/>
</reference>



    </references>

</references>


<?line 140?>

<section anchor="acknowledgements"><name>Acknowledgements</name>

<t>This document is based on <xref target="RFC7997"/> which was authored by Heather Flanagan.</t>

<t>The acknowledgements from <xref target="RFC7997"/> are
to the members of the IAB i18n program,
to the RFC Format Design Team:
Nevil Brownlee, Tony Hansen, Joe
Hildebrand, Paul Hoffman, Ted Lemon, Julian Reschke, Adam Roach,
Alice Russo, Robert Sparks, and Dave Thaler.</t>

<t>Writing this document was greatly helped by contributions from the RFC Series Working Group (RSWG), including:
Brian Carpenter,
Carsten Bormann,
Eliot Lear,
John Klensin,
John Levine,
Martin Dürst,
Martin Thomson,
Pete Resnick,
Rob Sayre, and
Russ Housley.</t>

</section>


  </back>

<!-- ##markdown-source:
H4sIAAAAAAAAA9VZzW4cuRG+8ykI+bAydmaktWN7rVwyGo9sOVpFkGQYwXoR
cLo504y6m7Nkt8YTYy95htwXCJBbjsktp+RN8gT7CPmqyP4byQ42P0Dig93T
TRbr56uvqujxeCwqU+X6SF7r95U0pbw8mXmhFgunb3depjYpVYGlqVPLauz8
ZjV2y+TZ8+fPFsaPD58K4StVpr9SuS2xrHK1FhDyWNiFt7mutD+StFoIs3b8
3VePDg+fHz4SN5sjeVpW2pW6Gr8g+SJR1RHOXlrh60VhvDe2vN6uIVinprLO
qFyItTkSUlY2OZJb7fHoraucXvr297bofgpVV5l12DLGJwjH+4uJfGWXy0KV
9CoYeKHqvP9WF8rkR3KN15MsvP6ZSVRZTqxbCVFaV6jK3GrS5Xh28cXjZ/R0
df3i6WN6gPfI7Pj4/NmjQ3p8U5rEpvpMwS8VvYAdMRSZbr7KK/Kocil/b9XH
n7HE2cO1M1uS+aYugjTlVho+zKpqfXRwsNlsJnVYSWof3GpHLvUHOWtwgKjA
2T1LoGvQFA9PHz99ciTEeDyWauErp5JKiOvMeAlU1IUuK+l15eXa5ibZSsiR
FTQzZZLXdIq0S5lkivbhWIIUfU710pSGzpONNhLmynW9gBgogo1BJU8CCIUT
QRbvHIMP8ko7oyHZ442qpMpzmRq/ztVWLXItK0ayp/d2o1OpvARKV/QvJDjN
32RmVpnU79c6qcLpLMtplZLWUEFxLsiNgfiFlkG0hT0wa+00joCqfJby/LZM
dTrZ8VSbDiyLoDER4t3X776WU5y1tp7g3VhHGykfJPBGRy5tDQ99TTp/s0+h
9YjtylRZvZgktjggkOoI0oOYmw8n8t03774J8StMmuZaiAeUcM6mdUKWCvar
Vi7ffsK7GxXdS6GQia3zVNoSWxJbVgpR7cV46WzB+6dXs9PT7gsBZSII9i6c
BMGjsPrDh5gq3303aiNVWESnkzui/Sw3V+WqVitNcdnREyGBq+blKjc+GzGm
ml26RAYYRJ7MYzOw/c31yfjLiTgNsKxMoaUHdnVfIzaekekznY545en8+gS2
FwUSq9rKDCsyBZXrJAt6E5agEYnCMUgFHFzacrzjE9+Q7GQ3qzqs9FTZBVSh
brAANEn8W4EYSXK5Co6omnzh9CgHJi2U1xTBuMiHXOwpHbxAoO7l5CSAhfx9
0cvUmaYskPuXF7OHlE5O+zUy2lCOkLNNsc416UtO2NWq6hs0ouxJtU+cWUC/
RmlizmA8Dr+YSV9tIXpVGyyFD7aBT7TcZAb+7zk38GZYU3vNeMjshpTYSoUw
4SWyVDx4gIrnClPa3K62wUoIKOTefTHbk4VWYKzeQbauPLRh6wLgkCsbNoY0
Y1NYzEQEaeY+O4OVvdMbgv/k2b0jPnwYFBeWtvfmcyAv0Fqzm9Qkf7TQpBc9
kdizx85SkJ6YAsAq62KBIANXYCpEmOCn2grEf60tiG8irjSlTyiHgBohgFMC
SOmpAs2+ordV53ewKvw4iB9U6OctF4QBHzauozrF5j6Qx8qbRF7qb2vjGHae
dRi0NIIJwGd3iYw5nMMXz4nFBA5OrHMoEFisEmc91Y0NBd1RypEvm3pBai+c
3Xj8mIgLbYF/gNPC437rK10Q4skTII3bAJqlJT1LrVE2yMnxVLQeruL6w08m
qXPF7IUcoFJEGyLpNSWJlPjRRbYzDS6wLg2RhoOxsWIgkK+bPI4lPjYNEDOA
Mp8dEzqiWS1MTjxJMuGETxEh/atkkqMi8aEJMQl8xmxXllCM60PAcFS79RYc
tXYWJMb9nG+0dvZWhw09RRqdQw0v2RCVm9/ACGxAZ2lz/99Qn1sItuHHmvCv
WTBlCDVF2TjpUFdIzf4xo/uNigmyCEQZEmWT6TIQqC5RbJKgVqWTDFxAFciW
bC6jJklqyNo2lboHiwfyous2hrkZy3do6ZBeIZPTQd0PNfMVGgVgewQNE/XR
0BDR2yWU6kRxWSoJ2gmXIXSrzDF3XK/fK0K9H3WF6/+o/fwfjf01W05hQU5s
5d0qF+y3i1tja0/kjNQBWw7YeN9Ts6U8Hwq7etsfhs6PhQfa7WXNQAiX4YWq
yKFoA1tHB0RwbzXsKOC7duoDUcYcXPNP8v+oBQZRM/dhKvc2MEK+7XJ2UHEj
zyCSS+0In4VG4xJgGncQOu8o43vlu1dbm6a6rcHwxgFE3VPrPyN4LdGNUYkH
GrrufXcQva+56B3RGBZyFvjyLf8RAlKzhGnc2K4rLkWcgil1hcstqXUPCkIQ
0V7rfN32ctSQYBm3cf1mhI79GDMH8DcIJFo/t5UOKOMGNCY2g1yt13mcJT4Z
6p+GhGtgGDgjtfiLsMqVuZt92s3U+ZzUjpIfE40tAljaRhVEQB2QJdDGckOL
oCRcZGkXaV3eiXi/HAxmpqid6UYllROxUMdzy07uxcZv0QS9b/yiIwMF2uk1
Ay1G4PqqN4XBr7OB3/u0tdKofqBBdjCVR3w/J/+iYgUbTvBLzuT++QnmCPXP
GtvTzqJNw1RhZIwVldFDLzrNmTc3mJn73VQPKhQ4UutkFvMYwwa1gRSGuiyj
ttRyIfI8Ngb/YpHXIB0SoHvMGWkkHYRnk23vCuNhoZtGzglMQkwjLGLBi2QW
kBZl34959nhubnS+DT1moBbN5Y4yL05Bpq11rk+L/fEQS3JAPu1Nh9RVRM3I
OysTepNWWOAZTibECw4N2B8As6Uj0n+wKrSbkkr1rcoJlvjWX9Ge/tkds3pd
VF+hLiKZhePZyzNbrKlAdLV+pe3KqTWGyPiyg2xs1zmpgyqdejBmUVehLMaZ
oY17zCOolaBKx/swbuPYAj42Ai0Efh77DSHecpHsTWr3kxsN6I7C3E4KZdO0
kI9Ja6aNjfF62P80edL0ipJaAs7EEYW0rSt3RrzgreZ9cFU062M6khrN4cFd
JgxY/daW2KYp1H1SmQYWaRJRG6ZBWLCAZTzCIAQ8+1MzSMehIITrhziOJMaB
+kO7B/rF1uiiUejl2pkv8O+va9CaB4z2qCojyrXups7YyC3AY6CXUlfKbem2
eQEPNmz71z+Qen//7R/3Rj3hUXEI9v+O5P03nx8eTp88jEfQ70eH09nDPTkw
5T92zP4v5+fy6vTl+Z0T5f78zeUvwrc98ntkHGbdBZInNhbxMipUk0kcvqmw
Q59bo0mbHmUscpXcMMQQ04pRjLTcejSYiFdH3E0/unbU8CKlj+MQwNgxwETT
vDf53+UzLdttN2OOe3ThaEmI1G1uXW/I4GrdygxQiaCOHE6VhXfJtNaN2RjM
atdeVjb7uYB5S3WGOL0V20z03DU2HD+4NxnAd2/aHBiZr6O6Zkhoy7tB7ZQ/
fP+7P1EAvzh58vgniODZ9PLlXF7OX8jZ6eXsbI7e+Yfvv/99WPJs/qhd8vJy
Dhy0iyg+kPXnRtaTduHx2Zt5s26yR5Pe7oXhWRzkuuvE/p1EO+YBUHEUnNwn
ZR4vhuIN9vCa6CMzGt/3opvg/6KJ90Wn0/Mp/xcK2srAf373MjZeDhGPheXJ
YDmLudKYd4i6dkU1pbLH9U2U+BppDN4C4Numi8etHjhhDZIylLOOhOOFnKf0
uNM6B1KehHv/BdKJFJwmN6Xd5DpdhVuxXSPx3Nb9wfU336pSfxKKVuCOV1ox
m50gXGqlmrthtXPInft96jREzIxCRxNCapxOj6X54kueTpHxxahZRzE8CWF9
ob1ZlfJaq+JInOtbk8tjZzdlrpEM1xb1/JUqQf8j+dpq8crkGO8cPD8a/N8e
llIh1wXVutd1bqhRRX+W3UDKNFWFvLQqyUZiCrTh/NpTpl5aaFvJK1DQjQ8Z
8IJ6q+sMzES89hbxD2Wz71fy3ArcR7drNMkE//HUalAJwzDUdNQ9uL617oak
vXS2Xsv9y6u3L5F37V3FkTh2pPdMuTVfwI8EHumqSR6Tr8pyJOa5AaOdodse
idc2K+XPc6qMZfx1Bv+VeiS+Ik4t5Yu//QX725/XmS08HCQuNHgD7gHKbkYC
XpBXaus0e0CQb+BWjOl6OxH/AKT/npNYHgAA

-->

</rfc>

