Home CJK Unified Ideograph-8335 U+4E00 to U+9FFF CJK Unified Ideographs CJK Unified Ideograph-8337

Source: Noto CJK

U+8336 CJK Unified Ideograph-8336

U+8336 was added in Unicode version 1.1 in 1993. It belongs to the block U+4E00 to U+9FFF CJK Unified Ideographs in the U+0000 to U+FFFF Basic Multilingual Plane.

This character is a Other Letter and is mainly used in the Han script. The Unihan Database defines it as tea. Its Pīnyīn pronunciation is chá.

The glyph is not a composition. Its East Asian Width is wide. In bidirectional text it is written from left to right. When changing direction it is not mirrored. U+8336 offers a line break opportunity at its position, except in some numeric contexts. The glyph can be confused with one other glyph.

The Wikipedia has the following information about this codepoint:

The etymology of the various Chinese words for the English word tea reflects the history of transmission of tea drinking culture and trade from China to countries around the world. In this context, tea generally refers to the plant Camellia sinensis and/or the aromatic beverage prepared by pouring hot boiling water over the leaves. Nearly all of the words for tea worldwide fall into three broad groups: te, cha and chai, present in English as tea, cha or char, and chai. The earliest of the three to enter English is cha, which came in the 1590s via the Portuguese, who traded in Macao and picked up the Cantonese pronunciation of the word. The more common tea form arrived in the 17th century via the Dutch, who acquired it either indirectly from the Malay teh, or directly from the tê pronunciation in Min Chinese. The third form chai (meaning "spiced tea") originated from a northern Chinese pronunciation of cha, which travelled overland to Central Asia and Persia where it picked up a Persian ending yi, and entered English via Hindustani in the 20th century.
The different regional pronunciations of the word in China are believed to have arisen from the same root, which diverged due to sound changes through the centuries. The written form of the Chinese word for tea was created in the mid-Tang dynasty by modifying the character 荼 pronounced tu, meaning a "bitter vegetable". Tu was used to refer to a variety of plants in ancient China, and acquired the additional meaning of "tea" by the Han dynasty. The Chinese word for tea was likely ultimately derived from the non-Sinitic languages of the botanical homeland of the tea plant in southwest China (or Burma), possibly from an archaic Austro-Asiatic root word *la, meaning "leaf".

Representations

System	Representation
Nº	33590
UTF-8	E8 8C B6
UTF-16	83 36
UTF-32	00 00 83 36
URL-Quoted	%E8%8C%B6
HTML hex reference	茶
Wrong windows-1252 Mojibake	èŒ¶
Encoding: BIG5 (hex bytes)	AF F9
Encoding: BIG5HKSCS (hex bytes)	AF F9
Encoding: CP932 (hex bytes)	92 83
Encoding: CP949 (hex bytes)	D2 FE
Encoding: CP950 (hex bytes)	AF F9
Encoding: EUC_JP (hex bytes)	C3 E3
Encoding: EUC_JIS_2004 (hex bytes)	C3 E3
Encoding: EUC_JISX0213 (hex bytes)	C3 E3
Encoding: EUC_KR (hex bytes)	D2 FE
Encoding: GB2312 (hex bytes)	B2 E8
Encoding: GBK (hex bytes)	B2 E8
Encoding: GB18030 (hex bytes)	B2 E8
Encoding: HZ (hex bytes)	7E 7B 32 68 7E 7D
Encoding: ISO2022_JP (hex bytes)	1B 24 42 43 63 1B 28 42
Encoding: ISO2022_JP_1 (hex bytes)	1B 24 42 43 63 1B 28 42
Encoding: ISO2022_JP_2 (hex bytes)	1B 24 42 43 63 1B 28 42
Encoding: ISO2022_JP_2004 (hex bytes)	1B 24 42 43 63 1B 28 42
Encoding: ISO2022_JP_3 (hex bytes)	1B 24 42 43 63 1B 28 42
Encoding: ISO2022_JP_EXT (hex bytes)	1B 24 42 43 63 1B 28 42
Encoding: ISO2022_KR (hex bytes)	1B 24 29 43 0E 52 7E 0F
Encoding: JOHAB (hex bytes)	E4 A0
Encoding: SHIFT_JIS (hex bytes)	92 83
Encoding: SHIFT_JIS_2004 (hex bytes)	92 83
Encoding: SHIFT_JISX0213 (hex bytes)	92 83
Pīnyīn	chá

Related Characters

CJK Compatibility Ideograph-F9fe

Confusables

CJK Compatibility Ideograph-F9fe

Elsewhere

Complete Record

Property	Value
Age (age)	1.1 (1993)
Unicode Name (na)	CJK UNIFIED IDEOGRAPH-8336
Unicode 1 Name (na1)	—
Block (blk)	CJK Unified Ideographs
General Category (gc)	Other Letter
Script (sc)	Han
Bidirectional Category (bc)	Left To Right
Combining Class (ccc)	Not Reordered
Decomposition Type (dt)	none
Decomposition Mapping (dm)	CJK Unified Ideograph-8336
Lowercase (Lower)	✘
Simple Lowercase Mapping (slc)	CJK Unified Ideograph-8336
Lowercase Mapping (lc)	CJK Unified Ideograph-8336
Uppercase (Upper)	✘
Simple Uppercase Mapping (suc)	CJK Unified Ideograph-8336
Uppercase Mapping (uc)	CJK Unified Ideograph-8336
Simple Titlecase Mapping (stc)	CJK Unified Ideograph-8336
Titlecase Mapping (tc)	CJK Unified Ideograph-8336
Case Folding (cf)	CJK Unified Ideograph-8336
ASCII Hex Digit (AHex)	✘
Alphabetic (Alpha)	✔
Bidi Control (Bidi_C)	✘
Bidi Mirrored (Bidi_M)	✘
Composition Exclusion (CE)	✘
Case Ignorable (CI)	✘
Changes When Casefolded (CWCF)	✘
Changes When Casemapped (CWCM)	✘
Changes When NFKC Casefolded (CWKCF)	✘
Changes When Lowercased (CWL)	✘
Changes When Titlecased (CWT)	✘
Changes When Uppercased (CWU)	✘
Cased (Cased)	✘
Full Composition Exclusion (Comp_Ex)	✘
Default Ignorable Code Point (DI)	✘
Dash (Dash)	✘
Deprecated (Dep)	✘
Diacritic (Dia)	✘
Emoji Modifier Base (EBase)	✘
Emoji Component (EComp)	✘
Emoji Modifier (EMod)	✘
Emoji Presentation (EPres)	✘
Emoji (Emoji)	✘
Extender (Ext)	✘
Extended Pictographic (ExtPict)	✘
FC NFKC Closure (FC_NFKC)	CJK Unified Ideograph-8336
Grapheme Cluster Break (GCB)	Any
Grapheme Base (Gr_Base)	✔
Grapheme Extend (Gr_Ext)	✘
Grapheme Link (Gr_Link)	✘
Hex Digit (Hex)	✘
Hyphen (Hyphen)	✘
ID Continue (IDC)	✔
ID Start (IDS)	✔
IDS Binary Operator (IDSB)	✘
IDS Trinary Operator and (IDST)	✘
IDSU (IDSU)	0
ID_Compat_Math_Continue (ID_Compat_Math_Continue)	0
ID_Compat_Math_Start (ID_Compat_Math_Start)	0
Ideographic (Ideo)	✔
InCB (InCB)	None
Indic Mantra Category (InMC)	—
Indic Positional Category (InPC)	NA
Indic Syllabic Category (InSC)	Other
Jamo Short Name (JSN)	—
Join Control (Join_C)	✘
Logical Order Exception (LOE)	✘
Modifier Combining Mark (MCM)	✘
Math (Math)	✘
Noncharacter Code Point (NChar)	✘
NFC Quick Check (NFC_QC)	Yes
NFD Quick Check (NFD_QC)	Yes
NFKC Casefold (NFKC_CF)	CJK Unified Ideograph-8336
NFKC Quick Check (NFKC_QC)	Yes
NFKC_SCF (NFKC_SCF)	CJK Unified Ideograph-8336
NFKD Quick Check (NFKD_QC)	Yes
Other Alphabetic (OAlpha)	✘
Other Default Ignorable Code Point (ODI)	✘
Other Grapheme Extend (OGr_Ext)	✘
Other ID Continue (OIDC)	✘
Other ID Start (OIDS)	✘
Other Lowercase (OLower)	✘
Other Math (OMath)	✘
Other Uppercase (OUpper)	✘
Prepended Concatenation Mark (PCM)	✘
Pattern Syntax (Pat_Syn)	✘
Pattern White Space (Pat_WS)	✘
Quotation Mark (QMark)	✘
Regional Indicator (RI)	✘
Radical (Radical)	✘
Sentence Break (SB)	Other Letter
Soft Dotted (SD)	✘
Sentence Terminal (STerm)	✘
Terminal Punctuation (Term)	✘
Unified Ideograph (UIdeo)	✔
Variation Selector (VS)	✘
Word Break (WB)	Other
White Space (WSpace)	✘
XID Continue (XIDC)	✔
XID Start (XIDS)	✔
Expands On NFC (XO_NFC)	✘
Expands On NFD (XO_NFD)	✘
Expands On NFKC (XO_NFKC)	✘
Expands On NFKD (XO_NFKD)	✘
Bidi Paired Bracket (bpb)	CJK Unified Ideograph-8336
Bidi Paired Bracket Type (bpt)	None
East Asian Width (ea)	wide
Hangul Syllable Type (hst)	Not Applicable
ISO 10646 Comment (isc)	—
Joining Group (jg)	No_Joining_Group
Joining Type (jt)	Non Joining
Line Break (lb)	Ideographic
Numeric Type (nt)	none
Numeric Value (nv)	not a number
Simple Case Folding (scf)	CJK Unified Ideograph-8336
Script Extension (scx)
Vertical Orientation (vo)	U
Big Five Mapping (kBigFive)	AFF9
CCCII Mapping (kCCCII)	21547B
CNS 11643-1986 Mapping (kCNS1986)	1-577D
CNS 11643-1992 Mapping (kCNS1992)	1-577D
Cangjie Input Code (kCangjie)	TOD
kCantonese (kCantonese)	caa4
kCihaiT (kCihaiT)	1135.501
kCowles (kCowles)	409
kDaeJaweon (kDaeJaweon)	1488.010
Unihan Definition (kDefinition)	tea
kEACC (kEACC)	21547B
kFanqie (kFanqie)	宅加
kFenn (kFenn)	484A
kFennIndex (kFennIndex)	8.09
kFourCornerCode (kFourCornerCode)	4490.4
kGB0 (kGB0)	1872
kGB1 (kGB1)	1872
kGradeLevel (kGradeLevel)	1
kHKGlyph (kHKGlyph)	3447
kHanYu (kHanYu)	53207.020
kHangul (kHangul)	다:0E 차:0N
kHanyuPinlu (kHanyuPinlu)	chá(194)
kHanyuPinyin (kHanyuPinyin)	53207.020:chá
kIICore (kIICore)	AGTJHKMP
kIRGDaeJaweon (kIRGDaeJaweon)	1488.010
kIRGHanyuDaZidian (kIRGHanyuDaZidian)	53207.020
kIRGKangXi (kIRGKangXi)	1029.040
kIRG_GSource (kIRG_GSource)	G0-3268
kIRG_HSource (kIRG_HSource)	HB1-AFF9
kIRG_JSource (kIRG_JSource)	J0-4363
kIRG_KPSource (kIRG_KPSource)	KP0-D5AC
kIRG_KSource (kIRG_KSource)	K0-527E
kIRG_TSource (kIRG_TSource)	T1-577D
kIRG_VSource (kIRG_VSource)	V1-6475
kJapanese (kJapanese)	チャサタちゃ
kJapaneseKun (kJapaneseKun)	CHA
kJapaneseOn (kJapaneseOn)	CHA SA TA
kJis0 (kJis0)	3567
kJoyoKanji (kJoyoKanji)	2010
kKangXi (kKangXi)	1029.040
kKorean (kKorean)	TA CHA
kKoreanEducationHanja (kKoreanEducationHanja)	2007
kKoreanName (kKoreanName)	2015
kLau (kLau)	165
kMainlandTelegraph (kMainlandTelegraph)	5420
kMandarin (kMandarin)	chá
kMatthews (kMatthews)	101
kMeyerWempe (kMeyerWempe)	58
kMojiJoho (kMojiJoho)	MJ021834 MJ021834:E0101 MJ021835:E0102
kMorohashi (kMorohashi)	30915' 30915':E0101 30915:E0102
kNelson (kNelson)	3940
kPhonetic (kPhonetic)	14
Radical Stroke Count (Adobe Japan 1-6) (kRSAdobe_Japan1_6)	C+2977+140.3.6
Radical Stroke Count (Unicode) (kRSUnicode)	140.6
kSBGY (kSBGY)	168.33
kSMSZD2003Index (kSMSZD2003Index)	579.04
kSMSZD2003Readings (kSMSZD2003Readings)	chá粵caa4
kTGH (kTGH)	2013:1411
kTGHZ2013 (kTGHZ2013)	034.120:chá
Taiwanese Telegraph Code (kTaiwanTelegraph)	5420
kTang (kTang)	djha
Stroke Number (kTotalStrokes)	9
UnihanCore2020 Set (kUnihanCore2020)	GHJKMPT
Quốc ngữ Pronunciation (kVietnamese)	trà
kXHC1983 (kXHC1983)	0112.050:chá
Xerox Code (kXerox)	244:075