U+3001 Ideographic Comma
U+3001 was added in Unicode version 1.1 in 1993. It belongs to the block
This character is a Other Punctuation and is commonly used, that is, in no specific script. It is also used in the scripts Bopomofo, Hangul, Han, Hiragana, Katakana, Yi.
The glyph is not a composition. Its East Asian Width is wide. In bidirectional text it acts as Other Neutral. When changing direction it is not mirrored. It will not end a sentence. U+3001 prohibits a line break before it.
The CLDR project calls this character “ideographic comma” for use in screen reading software. It assigns these additional labels, e.g. for search in emoji pickers: comma, ideographic.
The Wikipedia has the following information about this codepoint:
The comma , is a punctuation mark that appears in several variants in different languages. It has the same shape as an apostrophe or single closing quotation mark (’) in many typefaces, but it differs from them in being placed on the baseline of the text. Some typefaces render it as a small line, slightly curved or straight, but inclined from the vertical. Other fonts give it the appearance of a miniature filled-in figure 9 on the baseline.
The comma is used in many contexts and languages, mainly to separate parts of a sentence such as clauses, and items in lists mainly when there are three or more items listed. The word comma comes from the Greek κόμμα (kómma), which originally meant a cut-off piece, specifically in grammar, a short clause.
A comma-shaped mark is used as a diacritic in several writing systems and is considered distinct from the cedilla. In Byzantine and modern copies of Ancient Greek, the "rough" and "smooth breathings" (ἁ, ἀ) appear above the letter. In Latvian, Romanian, and Livonian, the comma diacritic appears below the letter, as in ș.
In spoken language, a common rule of thumb is that the function of a comma is generally performed by a pause.
In this article, ⟨x⟩ denotes a grapheme (writing) and /x/ denotes a phoneme (sound).
Representations
System | Representation |
---|---|
Nº | 12289 |
UTF-8 | E3 80 81 |
UTF-16 | 30 01 |
UTF-32 | 00 00 30 01 |
URL-Quoted | %E3%80%81 |
HTML hex reference | 、 |
Wrong windows-1252 Mojibake | 〠|
Encoding: EUC-KR (hex bytes) | A1 A2 |
Encoding: JIS0208 (hex bytes) | A1 A2 |
Adobe Glyph List | ideographiccomma |
digraph | ,_ |
Related Characters
Elsewhere
Complete Record
Property | Value |
---|---|
1.1 (1993) | |
IDEOGRAPHIC COMMA | |
— | |
CJK Symbols and Punctuation | |
Other Punctuation | |
Common | |
Other Neutral | |
Not Reordered | |
none | |
|
|
✘ | |
|
|
|
|
✘ | |
|
|
|
|
|
|
|
|
|
|
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
|
|
Any | |
✔ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
0 | |
0 | |
0 | |
✘ | |
None | |
— | |
NA | |
Other | |
— | |
✘ | |
✘ | |
✘ | |
✘ | |
Yes | |
Yes | |
|
|
Yes | |
|
|
Yes | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✔ | |
✘ | |
✘ | |
✘ | |
✘ | |
Sentence Continue | |
✘ | |
✘ | |
✔ | |
✘ | |
✘ | |
Other | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
✘ | |
|
|
None | |
wide | |
Not Applicable | |
— | |
No_Joining_Group | |
Non Joining | |
Close Punctuation | |
none | |
not a number | |
|
|
Bopomofo Hangul Han Hiragana Katakana Yi | |
Tu |