U+2010 Hyphen
U+2010 was added in Unicode version 1.1 in 1993. It belongs to the block
This character is a Dash Punctuation and is commonly used, that is, in no specific script.
The glyph is not a composition. Its width in East Asian texts is determined by its context. It can be displayed wide or narrow. In bidirectional text it acts as Other Neutral. When changing direction it is not mirrored. U+2010 offers a line break opportunity after its position. The glyph can be confused with one other glyph.
The CLDR project calls this character “hyphen” for use in screen reading software. It assigns these additional labels, e.g. for search in emoji pickers: dash.
The Wikipedia has the following information about this codepoint:
The hyphen ‐ is a punctuation mark used to join words and to separate syllables of a single word. The use of hyphens is called hyphenation. Son-in-law is an example of a hyphenated word.
The hyphen is sometimes confused with dashes (en dash –, em dash — and others), which are wider, or with the minus sign −, which is also wider and usually drawn a little higher to match the crossbar in the plus sign +.
As an orthographic concept, the hyphen is a single entity. In character encoding for use with computers, it is represented in Unicode by any of several characters. These include the dual-use hyphen-minus, the soft hyphen, the nonbreaking hyphen, and an unambiguous form known familiarly as the "Unicode hyphen". The character most often used to represent a hyphen (and the one produced by the key on a keyboard) is called the "hyphen-minus" by Unicode, deriving from the original ASCII standard, where it was called "hyphen (minus)".
Representations
System | Representation |
---|---|
Nº | 8208 |
UTF-8 | E2 80 90 |
UTF-16 | 20 10 |
UTF-32 | 00 00 20 10 |
URL-Quoted | %E2%80%90 |
HTML hex reference | ‐ |
Wrong windows-1252 Mojibake | †|
HTML named entity | ‐ |
HTML named entity | ‐ |
Encoding: JIS0208 (hex bytes) | A1 BE |
LATEX | - |
AGL: Latin-4 | uni2010 |
AGL: Latin-5 | uni2010 |
Adobe Glyph List | hyphentwo |
digraph | -1 |
Related Characters
Confusables
Elsewhere
Complete Record
Property | Value |
---|---|
1.1 (1993) | |
HYPHEN | |
— | |
General Punctuation | |
Dash Punctuation | |
Common | |
Other Neutral | |
Not Reordered | |
none | |
|
|
✘ | |
|
|
|
|
✘ | |
|
|
|
|
|
|
|
|
|
|
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✔ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
|
|
Any | |
✔ | |
✘ | |
✘ | |
✘ | |
✔ | |
✘ | |
✘ | |
✘ | |
✘ | |
0 | |
0 | |
0 | |
✘ | |
None | |
— | |
NA | |
Consonant_Placeholder | |
— | |
✘ | |
✘ | |
✘ | |
✘ | |
Yes | |
Yes | |
|
|
Yes | |
|
|
Yes | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✔ | |
✘ | |
✘ | |
✘ | |
✘ | |
Other | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
Other | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
|
|
None | |
ambiguous | |
Not Applicable | |
— | |
No_Joining_Group | |
Non Joining | |
Break After | |
none | |
not a number | |
|
|
R |