UTF8 File Documentation
Overview
Feature | Value |
---|---|
Format Name | Unicode UTF8-Encoded Text Document |
File Extension | .txt |
MIME Type | text/plain; charset=UTF-8 |
Encoding Type | Variable-width encoding |
Character Set | Universal Character Set (Unicode) |
Byte Order Mark (BOM) Optional | Yes (EF BB BF) |
Maximum Character Size | 4 Bytes |
Number of Characters | Over 1,112,064 |
Character Range | U+0000 to U+10FFFF |
Support for Surrogate Pairs | Yes |
Support for Combining Characters | Yes |
Endian Independent | Yes |
Backward Compatibility | With ASCII |
Use in XML | Default encoding |
Use in JSON | Default encoding |
Supports All Unicode Characters | Yes |
Normalization Forms | Supports NFC, NFD, NFKC, NFKD |
Common Usage | Text files, Source code, Data interchange |
Advantages | Universal compatibility, supports all languages and emojis |
Feedback
Was this page helpful?
Glad to hear it! Please tell us how we can improve.
Sorry to hear that. Please tell us how we can improve.