U+00A8 Diaeresis
U+00A8 was added in Unicode version 1.1 in 1993. It belongs to the block
This character is a Modifier Symbol and is commonly used, that is, in no specific script.
The glyph is a compatibility composition of the glyphs
The CLDR project calls this character “diaeresis” for use in screen reading software. It assigns these additional labels, e.g. for search in emoji pickers: tréma, umlaut.
The Wikipedia has the following information about this codepoint:
Diacritical marks of two dots ¨, placed side-by-side over or under a letter, are used in a number of languages for several different purposes. The most familiar to English-language speakers are the diaeresis and the umlaut, though there are numerous others. For example, in Albanian, ë represents a schwa. Such diacritics are also sometimes used for stylistic reasons (as in the family name Brontë or the band name Mötley Crüe).
In modern computer systems using Unicode, the two-dot diacritics are almost always encoded identically, having the same code point. For example, U+00E4 ä LATIN SMALL LETTER A WITH DIAERESIS represents both a-umlaut and a-diaeresis. Their appearance in print or on screen may vary between typefaces but rarely within the same typeface.
Representations
System | Representation |
---|---|
Nº | 168 |
UTF-8 | C2 A8 |
UTF-16 | 00 A8 |
UTF-32 | 00 00 00 A8 |
URL-Quoted | %C2%A8 |
HTML hex reference | ¨ |
Wrong windows-1252 Mojibake | ◌¨ |
HTML named entity | ¨ |
HTML named entity | ¨ |
HTML named entity | ¨ |
HTML named entity | ¨ |
HTML named entity | ¨ |
Encoding: EUC-KR (hex bytes) | A1 A7 |
Encoding: ISO-8859-2 (hex bytes) | A8 |
Encoding: ISO-8859-3 (hex bytes) | A8 |
Encoding: ISO-8859-4 (hex bytes) | A8 |
Encoding: ISO-8859-7 (hex bytes) | A8 |
Encoding: ISO-8859-8 (hex bytes) | A8 |
Encoding: JIS0208 (hex bytes) | A1 AF |
Encoding: MACINTOSH (hex bytes) | AC |
Encoding: WINDOWS-1250 (hex bytes) | A8 |
Encoding: WINDOWS-1252 (hex bytes) | A8 |
Encoding: WINDOWS-1253 (hex bytes) | A8 |
Encoding: WINDOWS-1254 (hex bytes) | A8 |
Encoding: WINDOWS-1255 (hex bytes) | A8 |
Encoding: WINDOWS-1256 (hex bytes) | A8 |
Encoding: WINDOWS-1257 (hex bytes) | 8D |
Encoding: WINDOWS-1258 (hex bytes) | A8 |
LATEX | \textasciidieresis |
AGL: Latin-1 | dieresis |
AGL: Latin-2 | dieresis |
AGL: Latin-3 | dieresis |
AGL: Latin-4 | dieresis |
AGL: Latin-5 | dieresis |
Adobe Glyph List | dieresis |
digraph | ': |
Related Characters
Elsewhere
Complete Record
Property | Value |
---|---|
1.1 (1993) | |
DIAERESIS | |
SPACING DIAERESIS | |
Latin-1 Supplement | |
Modifier Symbol | |
Common | |
Other Neutral | |
Not Reordered | |
compatibility | |
|
|
✘ | |
|
|
|
|
✘ | |
|
|
|
|
|
|
|
|
|
|
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✔ | |
✘ | |
✘ | |
✔ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✔ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
|
|
Any | |
✔ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
0 | |
0 | |
0 | |
✘ | |
None | |
— | |
NA | |
Other | |
— | |
✘ | |
✘ | |
✘ | |
✘ | |
Yes | |
Yes | |
|
|
No | |
|
|
No | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
Other | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
Other | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✔ | |
✔ | |
|
|
None | |
ambiguous | |
Not Applicable | |
— | |
No_Joining_Group | |
Non Joining | |
Ambiguous (Alphabetic or Ideographic) | |
none | |
not a number | |
|
|
R |