← RFC Reference

RFC 6532 – Internationalized Email Headers

Proposed Standard Core SMTP & Message Format Obsoletes RFC 5322 Published February 2012
ELI5: Before this RFC, if you wanted to write a name like “Müller” or a subject line in Japanese in an email header, you had to encode it into an unreadable mess like =?UTF-8?B?...?=. RFC 6532 says “just use UTF-8 directly.” The From line can say 田太郎 <taro@example.jp> in plain text, and the Subject can be in any language without encoding tricks. The email looks the same on the wire as it does on screen.

Why This RFC Exists

The original email message format (RFC 5322) restricts header fields to US-ASCII characters. To include non-ASCII text in headers like Subject, From display names, or address comments, senders had to use RFC 2047 encoded words — a cumbersome scheme that Base64 or Q-encodes the text and wraps it in =?charset?encoding?...?= markers.

This system works but produces headers that are unreadable in their raw form, difficult to implement correctly, and limited in where they can appear. For example, RFC 2047 encoded words cannot be used inside the local-part of an email address.

RFC 6532 updates RFC 5322 to allow UTF-8 directly in email headers. Combined with RFC 6531 (SMTPUTF8 for the SMTP envelope), this enables fully internationalized email where addresses, display names, subjects, and other headers can all use native scripts without encoding workarounds.

How It Works

  1. The message is transmitted via SMTP with the SMTPUTF8 extension (RFC 6531), signaling that the message uses internationalized content.
  2. Header fields may contain raw UTF-8 octets anywhere that RFC 5322 allows text.
  3. Email addresses in headers (From, To, Cc, Reply-To) may contain UTF-8 characters in the local-part and the domain part.
  4. The Content-Type for the message header block is implicitly message/global (defined in RFC 6532) instead of the traditional message/rfc822.
  5. Receiving mail clients that support RFC 6532 display the headers natively. Clients that don't may display raw UTF-8 bytes or fail to parse the headers.

Header Examples

Traditional RFC 2047 encoded headers vs. RFC 6532 headers:

# Old way: RFC 2047 encoded words (unreadable on the wire) From: =?UTF-8?B?w4ltaWxpZSBEdXBvbnQ=?= <emilie@example.fr> Subject: =?UTF-8?B?5Lya6K2w44Gu56K66KqN?= To: =?UTF-8?B?5bGx55Sw5aSq6YOO?= <taro@example.jp> # New way: RFC 6532 with raw UTF-8 (human-readable) From: Émilie Dupont <émilie@exemple.fr> Subject: 会議の確認 To: 田太郎 <田太郎@例.jp>

A complete internationalized message:

From: 田太郎 <田太郎@例.jp> To: Émilie Dupont <émilie@exemple.fr> Subject: 会議の確認 Date: Wed, 11 Mar 2026 19:00:00 +0900 Message-ID: <20260311.001@例.jp> MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit 来週の会議を確認します。

Key Technical Details

message/global vs. message/rfc822

RFC 6532 introduces a new MIME type: message/global. This is functionally identical to message/rfc822 but signals that the message may contain UTF-8 in headers. When a message is transmitted with SMTPUTF8, attached messages and forwarded messages should use message/global instead of message/rfc822:

MIME Type Header Encoding Usage
message/rfc822 ASCII only (RFC 2047 for non-ASCII) Traditional email
message/global UTF-8 allowed natively Internationalized email

Where UTF-8 Is Allowed

RFC 6532 permits UTF-8 in all header field positions where RFC 5322 allows text:

DKIM Signing Considerations

DKIM signatures cover specific headers. When those headers contain raw UTF-8, the signing and verification process must handle the bytes correctly. The DKIM h= tag lists headers to sign, and the canonicalization applies to the raw UTF-8 bytes. Both sender and verifier must process the same byte sequence for signatures to match.

Backward Compatibility

RFC 6532 messages are not backward compatible with mail systems that only understand RFC 5322. A server or client that receives a message/global message but doesn't support RFC 6532 may:

Common Mistakes

Deliverability Impact

Related RFCs