UTF8 File Documentation


Overview

Feature Value
Format Name Unicode UTF8-Encoded Text Document
File Extension .txt
MIME Type text/plain; charset=UTF-8
Encoding Type Variable-width encoding
Character Set Universal Character Set (Unicode)
Byte Order Mark (BOM) Optional Yes (EF BB BF)
Maximum Character Size 4 Bytes
Number of Characters Over 1,112,064
Character Range U+0000 to U+10FFFF
Support for Surrogate Pairs Yes
Support for Combining Characters Yes
Endian Independent Yes
Backward Compatibility With ASCII
Use in XML Default encoding
Use in JSON Default encoding
Supports All Unicode Characters Yes
Normalization Forms Supports NFC, NFD, NFKC, NFKD
Common Usage Text files, Source code, Data interchange
Advantages Universal compatibility, supports all languages and emojis