UTF8 File Documentation
Overview
| Feature | Value |
|---|---|
| Format Name | Unicode UTF8-Encoded Text Document |
| File Extension | .txt |
| MIME Type | text/plain; charset=UTF-8 |
| Encoding Type | Variable-width encoding |
| Character Set | Universal Character Set (Unicode) |
| Byte Order Mark (BOM) Optional | Yes (EF BB BF) |
| Maximum Character Size | 4 Bytes |
| Number of Characters | Over 1,112,064 |
| Character Range | U+0000 to U+10FFFF |
| Support for Surrogate Pairs | Yes |
| Support for Combining Characters | Yes |
| Endian Independent | Yes |
| Backward Compatibility | With ASCII |
| Use in XML | Default encoding |
| Use in JSON | Default encoding |
| Supports All Unicode Characters | Yes |
| Normalization Forms | Supports NFC, NFD, NFKC, NFKD |
| Common Usage | Text files, Source code, Data interchange |
| Advantages | Universal compatibility, supports all languages and emojis |
Feedback
Was this page helpful?
Glad to hear it! Please tell us how we can improve.
Sorry to hear that. Please tell us how we can improve.