U+00A8 Diaeresis
U+00A8 was added in Unicode version 1.1 in 1993. It belongs to the block
This character is a Modifier Symbol and is commonly used, that is, in no specific script.
The glyph is a compatibility composition of the glyphs
The CLDR project calls this character “diaeresis” for use in screen reading software. It assigns these additional labels, e.g. for search in emoji pickers: tréma, umlaut.
The Wikipedia has the following information about this codepoint:
Diacritical marks of two dots ¨, placed side-by-side over or under a letter, are used in several languages for several different purposes. The most familiar to English-language speakers are the diaeresis and the umlaut, though there are numerous others. For example, in Albanian, ë represents a schwa. Such diacritics are also sometimes used for stylistic reasons (as in the family name Brontë or the band name Mötley Crüe).
In modern computer systems using Unicode, the two-dot diacritics are almost always encoded identically, having the same code point. For example, U+00F6 ö LATIN SMALL LETTER O WITH DIAERESIS represents both o-umlaut and o-diaeresis. Their appearance in print or on screen may vary between typefaces but rarely within the same typeface.
The word trema (French: tréma), used in linguistics and also classical scholarship, describes the form of both the umlaut diacritic and the diaeresis rather than their function and is used in those contexts to refer to either.
Representations
System | Representation |
---|---|
Nº | 168 |
UTF-8 | C2 A8 |
UTF-16 | 00 A8 |
UTF-32 | 00 00 00 A8 |
URL-Quoted | %C2%A8 |
HTML hex reference | ¨ |
Wrong windows-1252 Mojibake | ◌¨ |
HTML named entity | ¨ |
HTML named entity | ¨ |
HTML named entity | ¨ |
HTML named entity | ¨ |
HTML named entity | ¨ |
Encoding: BIG5HKSCS (hex bytes) | C6 D8 |
Encoding: CP037 (hex bytes) | BD |
Encoding: CP273 (hex bytes) | BD |
Encoding: CP424 (hex bytes) | BD |
Encoding: CP500 (hex bytes) | BD |
Encoding: CP850 (hex bytes) | F9 |
Encoding: CP852 (hex bytes) | F9 |
Encoding: CP856 (hex bytes) | F9 |
Encoding: CP857 (hex bytes) | F9 |
Encoding: CP858 (hex bytes) | F9 |
Encoding: CP863 (hex bytes) | A4 |
Encoding: CP869 (hex bytes) | F9 |
Encoding: CP875 (hex bytes) | 70 |
Encoding: CP932 (hex bytes) | 81 4E |
Encoding: CP949 (hex bytes) | A1 A7 |
Encoding: CP1026 (hex bytes) | BD |
Encoding: CP1140 (hex bytes) | BD |
Encoding: CP1250 (hex bytes) | A8 |
Encoding: CP1252 (hex bytes) | A8 |
Encoding: CP1253 (hex bytes) | A8 |
Encoding: CP1254 (hex bytes) | A8 |
Encoding: CP1255 (hex bytes) | A8 |
Encoding: CP1256 (hex bytes) | A8 |
Encoding: CP1257 (hex bytes) | 8D |
Encoding: CP1258 (hex bytes) | A8 |
Encoding: EUC_JP (hex bytes) | A1 AF |
Encoding: EUC_JIS_2004 (hex bytes) | A1 AF |
Encoding: EUC_JISX0213 (hex bytes) | A1 AF |
Encoding: EUC_KR (hex bytes) | A1 A7 |
Encoding: GB2312 (hex bytes) | A1 A7 |
Encoding: GBK (hex bytes) | A1 A7 |
Encoding: GB18030 (hex bytes) | A1 A7 |
Encoding: HZ (hex bytes) | 7E 7B 21 27 7E 7D |
Encoding: ISO2022_JP (hex bytes) | 1B 24 42 21 2F 1B 28 42 |
Encoding: ISO2022_JP_1 (hex bytes) | 1B 24 42 21 2F 1B 28 42 |
Encoding: ISO2022_JP_2 (hex bytes) | 1B 24 42 21 2F 1B 28 42 |
Encoding: ISO2022_JP_2004 (hex bytes) | 1B 24 42 21 2F 1B 28 42 |
Encoding: ISO2022_JP_3 (hex bytes) | 1B 24 42 21 2F 1B 28 42 |
Encoding: ISO2022_JP_EXT (hex bytes) | 1B 24 42 21 2F 1B 28 42 |
Encoding: ISO2022_KR (hex bytes) | 1B 24 29 43 0E 21 27 0F |
Encoding: LATIN_1 (hex bytes) | A8 |
Encoding: ISO8859_2 (hex bytes) | A8 |
Encoding: ISO8859_3 (hex bytes) | A8 |
Encoding: ISO8859_4 (hex bytes) | A8 |
Encoding: ISO8859_7 (hex bytes) | A8 |
Encoding: ISO8859_8 (hex bytes) | A8 |
Encoding: ISO8859_9 (hex bytes) | A8 |
Encoding: JOHAB (hex bytes) | D9 37 |
Encoding: MAC_GREEK (hex bytes) | 8C |
Encoding: MAC_ICELAND (hex bytes) | AC |
Encoding: MAC_LATIN2 (hex bytes) | AC |
Encoding: MAC_ROMAN (hex bytes) | AC |
Encoding: MAC_TURKISH (hex bytes) | AC |
Encoding: SHIFT_JIS (hex bytes) | 81 4E |
Encoding: SHIFT_JIS_2004 (hex bytes) | 81 4E |
Encoding: SHIFT_JISX0213 (hex bytes) | 81 4E |
Encoding: CP037 (hex bytes) | BD |
Encoding: CP1047 (hex bytes) | BB |
Encoding: CP1122 (hex bytes) | BD |
Encoding: CP1140 (hex bytes) | BD |
Encoding: CP1141 (hex bytes) | BD |
Encoding: CP1142 (hex bytes) | BD |
Encoding: CP1143 (hex bytes) | BD |
Encoding: CP1144 (hex bytes) | BD |
Encoding: CP1145 (hex bytes) | A1 |
Encoding: CP1146 (hex bytes) | BD |
Encoding: CP1147 (hex bytes) | A1 |
Encoding: CP1148 (hex bytes) | BD |
Encoding: CP1148MS (hex bytes) | BD |
Encoding: CP1149 (hex bytes) | BD |
Encoding: CP273 (hex bytes) | BD |
Encoding: CP277 (hex bytes) | BD |
Encoding: CP278 (hex bytes) | BD |
Encoding: CP280 (hex bytes) | BD |
Encoding: CP284 (hex bytes) | A1 |
Encoding: CP285 (hex bytes) | BD |
Encoding: CP297 (hex bytes) | A1 |
Encoding: CP424 (hex bytes) | BD |
Encoding: CP500 (hex bytes) | BD |
Encoding: CP500MS (hex bytes) | BD |
Encoding: CP870 (hex bytes) | BD |
Encoding: CP871 (hex bytes) | BD |
Encoding: CP875 (hex bytes) | 70 |
LATEX | \textasciidieresis |
AGL: Latin-1 | dieresis |
AGL: Latin-2 | dieresis |
AGL: Latin-3 | dieresis |
AGL: Latin-4 | dieresis |
AGL: Latin-5 | dieresis |
Adobe Glyph List | dieresis |
digraph | ': |
Related Characters
Elsewhere
Complete Record
Property | Value |
---|---|
1.1 (1993) | |
DIAERESIS | |
SPACING DIAERESIS | |
Latin-1 Supplement | |
Modifier Symbol | |
Common | |
Other Neutral | |
Not Reordered | |
compatibility | |
|
|
✘ | |
|
|
|
|
✘ | |
|
|
|
|
|
|
|
|
|
|
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✔ | |
✘ | |
✘ | |
✔ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✔ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
|
|
Any | |
✔ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
0 | |
0 | |
0 | |
✘ | |
None | |
— | |
NA | |
Other | |
— | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
Yes | |
Yes | |
|
|
No | |
|
|
No | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
Other | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
Other | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✔ | |
✔ | |
|
|
None | |
ambiguous | |
Not Applicable | |
— | |
No_Joining_Group | |
Non Joining | |
Ambiguous (Alphabetic or Ideographic) | |
none | |
not a number | |
|
|
R |