Start: go to the homepage U+0300 bis U+036F Combining Diacritical Marks
Zeichen für U+034F
Quelle: Noto Sans

U+034F Combining Grapheme Joiner

U+034F wurde in Version 3.2 in 2002 zu Unicode hinzugefügt. Er gehört zum Block U+0300 bis U+036F Combining Diacritical Marks in der U+0000 bis U+FFFF Basic Multilingual Plane.

Dieses Zeichen ist ein Nonspacing Mark und erbt seine Schrift-Eigenschaft vom vorhergehenden Zeichen.

Das Zeichen ist keine Zusammensetzung. Seine Weite in ostasiatischen Texten wird vom Kontext bestimmt. Es kann weit oder schmal sein. In bidirektionalem Text handelt es als Nonspacing Mark. Bei einem Richtungswechsel wird es nicht gespiegelt. U+034F verbietet einen Zeilenumbruch um sich herum.

Die Wikipedia hat die folgende Information zu diesem Codepunkt:

The combining grapheme joiner (CGJ), U+034F ͏ COMBINING GRAPHEME JOINER is a Unicode character that has no visible glyph and is "default ignorable" by applications. Its name is a misnomer and does not describe its function: the character does not join graphemes. Its purpose is to semantically separate characters that should not be considered digraphs as well as to block canonical reordering of combining marks during normalization.

For example, in a Hungarian language context, adjoining letters c and s would normally be considered equivalent to the cs digraph. If they are separated by the CGJ, they will be considered as two separate graphemes. However, in contrast to the zero-width joiner and similar characters, the CGJ does not affect whether the two letters are rendered separately or as a ligature or cursively joined—the default behavior for this is determined by the font.

The CGJ is also needed for complex scripts. For example, in most cases the Hebrew cantillation accent metheg is supposed to appear to the left of the vowel point and by default most display systems will render it like this even if it is typed before the vowel. But in some words in Biblical Hebrew the metheg appears to the right of the vowel, and to tell the display engine to render it properly on the right, CGJ must be typed between the metheg and the vowel. Compare:

In the case of several consecutive combining diacritics, an intervening CGJ indicates that they should not be subject to canonical reordering.

In contrast, the "zero-width non-joiner" (at U+200C in the General Punctuation range) prevents two adjacent character from turning into a ligature.

Darstellungen

System Darstellung
Nr. 847
UTF-8 CD 8F
UTF-16 03 4F
UTF-32 00 00 03 4F
URL-kodiert %CD%8F
HTML hex reference ͏
Falsches windows-1252-Mojibake ◌͏
Abkürzung CGJ
Kodierung: GB18030 (Hex-Bytes) 81 30 C4 35

Anderswo

Vollständiger Eintrag

Eigenschaft Wert
Alter (age) 3.2 (2002)
Unicode-Name (na) COMBINING GRAPHEME JOINER
Unicode-1-Name (na1)
Block (blk) Combining Diacritical Marks
Allgemeine Kategorie (gc) Nonspacing Mark
Schrift (sc) Inherited
Bidirectional Category (bc) Nonspacing Mark
Combining Class (ccc) Not Reordered
Dekompositionstyp (dt) none
Decomposition Mapping (dm) Zeichen für U+034F Combining Grapheme Joiner
Kleinbuchstabe (Lower)
Simple Lowercase Mapping (slc) Zeichen für U+034F Combining Grapheme Joiner
Lowercase Mapping (lc) Zeichen für U+034F Combining Grapheme Joiner
Großbuchstabe (Upper)
Simple Uppercase Mapping (suc) Zeichen für U+034F Combining Grapheme Joiner
Uppercase Mapping (uc) Zeichen für U+034F Combining Grapheme Joiner
Simple Titlecase Mapping (stc) Zeichen für U+034F Combining Grapheme Joiner
Titlecase Mapping (tc) Zeichen für U+034F Combining Grapheme Joiner
Case Folding (cf) Zeichen für U+034F Combining Grapheme Joiner
ASCII Hex Digit (AHex)
Alphabetic (Alpha)
Bidi-Kontrollzeichen (Bidi_C)
Bidi Mirrored (Bidi_M)
Composition Exclusion (CE)
Case Ignorable (CI)
Changes When Casefolded (CWCF)
Changes When Casemapped (CWCM)
Changes When NFKC Casefolded (CWKCF)
Changes When Lowercased (CWL)
Changes When Titlecased (CWT)
Changes When Uppercased (CWU)
Cased (Cased)
Full Composition Exclusion (Comp_Ex)
Default Ignorable Code Point (DI)
Dash (Dash)
Veraltet (Dep)
Diakritisch (Dia)
Emoji Modifier Base (EBase)
Emoji Component (EComp)
Emoji Modifier (EMod)
Emoji-Darstellung (EPres)
Emoji (Emoji)
Extender (Ext)
Extended Pictographic (ExtPict)
FC NFKC Closure (FC_NFKC) Zeichen für U+034F Combining Grapheme Joiner
Grapheme Cluster Break (GCB) Extend
Grapheme Base (Gr_Base)
Grapheme Extend (Gr_Ext)
Grapheme Link (Gr_Link)
Hex Digit (Hex)
Hyphen (Hyphen)
ID Continue (IDC)
ID-Start (IDS)
IDS Binary Operator (IDSB)
IDS Trinary Operator and (IDST)
IDSU (IDSU) 0
ID_Compat_Math_Continue (ID_Compat_Math_Continue) 0
ID_Compat_Math_Start (ID_Compat_Math_Start) 0
Ideogramm (Ideo)
InCB (InCB) Extend
Indic Mantra Category (InMC)
Indic Positional Category (InPC) NA
Indic Syllabic Category (InSC) Other
Jamo Short Name (JSN)
Verbindungskontrollzeichen (Join_C)
Logische Reihenfolgenausnahme (LOE)
Modifier Combining Mark (MCM)
Math (Math)
Nicht-Zeichen-Codepunkt (NChar)
NFC Quick Check (NFC_QC) Ja
NFD Quick Check (NFD_QC) Ja
NFKC Quick Check (NFKC_QC) Ja
NFKD Quick Check (NFKD_QC) Ja
Other Alphabetic (OAlpha)
Other Default Ignorable Code Point (ODI)
Other Grapheme Extend (OGr_Ext)
Other ID Continue (OIDC)
Other ID Start (OIDS)
Other Lowercase (OLower)
Other Math (OMath)
Other Uppercase (OUpper)
Prepended Concatenation Mark (PCM)
Pattern Syntax (Pat_Syn)
Pattern White Space (Pat_WS)
Quotation Mark (QMark)
Regional Indicator (RI)
Radical (Radical)
Sentence Break (SB) Extend
Soft Dotted (SD)
Sentence Terminal (STerm)
Terminal Punctuation (Term)
Unified Ideograph (UIdeo)
Variation Selector (VS)
Word Break (WB) Extend
White Space (WSpace)
XID Continue (XIDC)
XID-Start (XIDS)
Expands On NFC (XO_NFC)
Expands On NFD (XO_NFD)
Expands On NFKC (XO_NFKC)
Expands On NFKD (XO_NFKD)
Bidi Paired Bracket (bpb) Zeichen für U+034F Combining Grapheme Joiner
Bidi Paired Bracket Type (bpt) None
Ostasiatische Weite (ea) ambiguous
Hangul Syllable Type (hst) Nicht anwendbar
ISO 10646 Comment (isc)
Joining Group (jg) No_Joining_Group
Joining Type (jt) Transparent
Line Break (lb) Non-breaking (“Glue”)
Numerischer Typ (nt) none
Numerischer Wert (nv) keine Nummer
Simple Case Folding (scf) Zeichen für U+034F Combining Grapheme Joiner
Schrifterweiterung (scx)
Vertical Orientation (vo) R