The Chinese or Sinitic language family includes a number of related lects which have very similar written forms, but different grammar, vocabulary and especially pronunciation. On Wiktionary, these lects are treated under the header ==Chinese== and the language code zh unless they natively use a non-Chinese script.

Key points

  • The various varieties of Chinese are subsumed under the header ==Chinese== and the language code zh (vote).
    The exception is when the variety of Chinese natively uses a non-Chinese script, e.g. Dungan. The layout of these entries is currently undecided.
  • A Traditional Chinese form of a Chinese word, usually the most commonly used Traditional Chinese form, is chosen as the lemma (vote).
    All other forms of exactly the same word should soft-redirect to the lemma form using {{zh-see}}.
  • Terms are defined in relation to Modern Standard Written Chinese.
    Senses limited to the literary language, certain dialects or certain regions should be marked accordingly using label ({{lb}}, in definitions) and qualifier ({{q}}, elsewhere) tags.
    For example, 簡訊 / 简讯 (jiǎnxùn) uses {{lb|zh|chiefly|Taiwan}} and 煠熟狗頭 / 煠熟狗头 uses {{lb|zh|Cantonese}} to show that these terms are mainly used in Taiwan and exclusively Cantonese, respectively.

Entry format

Chinese entries should follow format guidelines in Entry layout. 朋友 is a good example of how Chinese entries should ideally be formatted.

{{zh-new}} can be used to accelerate creation of entries.

Unless the entry is of a variant or simplified spelling, templates that are almost always obligatory are {{zh-forms}} and {{zh-pron}}.

Some useful Chinese-specific templates include {{zh-l}} and {{zh-x}}.

Variants and simplified forms use {{zh-see}} to redirect to the standard traditional form.

is a good example of how Chinese character entries should ideally be formatted.

Basic headers for single characters

Glyph origin
Describes how the character obtained its current shape. Templates used include {{Han etym}} and {{Han compound}}.
A character is not always of Chinese origin; see (shí) for an example of what to do in this case.
Etymology
Describes the origin of the character's pronunciation(s). (Or, used to host a {{zh-see}} box; see documentation for details and for an example.)
Note that Old Chinese and Middle Chinese have been subsumed under "Chinese", so indicating that Mandarin (hēi) is inherited from Old Chinese (*hmlɯːɡ /⁠*hmlɯːɡ⁠/) or Middle Chinese (xok) is redundant. However, indicating that a character is derived from a different character is fine, such as with (yòu) (from Old Chinese ).
It is preferred that an entry is split by etymology per Old Chinese and Middle Chinese ancestor.
Pronunciation
Hosts {{zh-pron}}; see documentation for details.
The |cat= parameter is responsible for sorting entries into categories such as Category:Chinese verbs, Category:Mandarin nouns, and Category:Cantonese chengyu and should be filled out when reasonable.
Definitions
Hosts definitions. Rationale for using a "Definitions" header instead of "Noun", "Verb", or other more specific part of speech headers can be found here.
The {{head|zh|hanzi}} template is found directly under this header as the headword template. It has little practical value but is currently part of the standard Wiktionary entry format.

Basic headers for words

Glyph origin
This is usually not needed unless it is convenient to discuss it together (like 彳亍 (chìchù), 旮旯 (gālá)).
Etymology
Etymologies for words may show change of pronunciations or spellings, and semantic change (change in meaning). However, it is not necessary to duplicate any content from {{zh-forms}}. For simple figurative or idiomatic meaning, the |lit= parameter of {{zh-forms}} may be used to show the literal meaning.
Pronunciation
This section is the same as the one for single character entries.
Definitions
Definitions for words should be defined using specific part of speech headers, and appropriate headword template. For noun entries, use {{zh-mw}} to indicate classifiers for each sense and categorize the entry by classifiers; classifiers in the headword template is deprecated.

Other templates

  • {{zh-forms}}: found either at the top of an entry or under an Etymology header. See documentation for details.

Headword-line templates

  • Wiktionary previously had several headword-line templates specifically for Chinese, but all have been deprecated except for {{zh-verb}}, used for marking morphological structure. In most cases, you can simply use {{head}}.

About specific lects

This section lists all Chinese dialects supported and proposed to be supported. Dialect names in italics are currently not supported (in to-do list); other content in italics is tentative and needs further discussion.

Subgroup
of
Chinese
Variety chosen Romanisation Help page Romanisation allowed as entries? Tone sandhi
Mandarin Standard Mandarin
(Beijing dialect/Taiwan Mandarin)
Hanyu Pinyin Allowed for (all as as non-lemmas, votes: 1, 2):
  • monosyllables with diacritics (zhāng)
  • monosyllables with tone numbers (zhang1)
  • monosyllables with no tone mark (zhang)
  • polysyllables with diacritics (yánlì)
Only original tones are indicated in Pinyin, e.g. consecutive third tones are shown as third tones. Phonetic pinyin is shown in the expanded mode on tone sandhi with characters and when the actual tone differs from the nominal, e.g. 一定 (yīdìng) [phonetic: yídìng], 不過不过 (bùguò) [phonetic: búguò]
Sichuanese (Chengdu dialect) Sichuanese Pinyin help No Indicated by hyphen -.
Xi'an (Xi'an dialect) Modified Hanyu Pinyin help No See the romanisation page.
Wuhannese (Wuhan dialect) Modified Hanyu Pinyin,
based on that of the Longren dictionary
No Indicated by hyphen - ?
Nankinese (Nanjing dialect) Modified Hanyu Pinyin help No Original tones are displayed.
Dungan Use Dungan term in Cyrillic script. help Allow for terms in Cyrillic script and Xiao'erjing as lemmas Indicated by hyphen -.
Northeastern Mandarin (Harbin or Shenyang dialect) Hanyu Pinyin to be determined to be determined
Cantonese Standard Cantonese
(Guangzhou dialect/Hong Kong dialect)
Jyutping help Allowed for monosyllables as non-lemmas (vote). Changed tones are indicated by hyphen -.
Dongguan
(Guancheng dialect)
adapted Jyutping++ help No See the romanisation page.
Taishanese
(Taicheng dialect)
Wiktionary help No See the romanisation page.
Yangjiang
(Jiangcheng dialect)
adapted Jyutping++ help No Does not exist.
Gan Nanchang dialect Wiktionary help No
Hakka Sixian dialect
(north and south)
Pha̍k-fa-sṳ help No Original tones are displayed.
Hailu dialect Hakka Romanization System No Original tones are displayed.
Meixian dialect Guangdong Romanization help No Original tones are displayed.
Tingzhou dialect to be determined No
Huizhou to be determined to be determined help No
Jin Taiyuan dialect Wiktionary help No
Central Min Yong'an dialect to be determined, but an in-house transliteration system could be similar to Kienning Colloquial Romanized help No to be determined
Eastern Min Fuzhou dialect Bàng-uâ-cê help No
Fu'an dialect to be determined, but an in-house transliteration system could be similar to Bàng-uâ-cê No to be determined
Northern Min Jian'ou dialect Kienning Colloquial Romanized help No
Puxian Min Putian dialect,
Xianyou dialect
Pouseng Ping'ing, Hinghwa Romanized help No See the romanisation page.
Southern Min Datian Min to be determined No
Hainanese
(multiple dialects)
to be determined help No to be determined
Hokkien
(multiple dialects)
Pe̍h-ōe-jī help Allowed as lemmas (e.g. put-khó-su-gī) Only original tones are used.
Leizhou Min Leizhou Pinyin No
Teochew
(multiple dialects)
Guangdong Ministry of Education's
Teochew Romanization Scheme
(not Gaginang's Peng'im)
No
Shaojiang Shaowu dialect to be determined, but an in-house transliteration system could be similar to Kienning Colloquial Romanized No to be determined
Northern Pinghua to be determined to be determined No
Southern Pinghua Nanning Pinghua
(Tingzi dialect)
adapted Jyutping++ help No Does not exist.
Wu Northern Wu
(multiple dialects)
Wugniu help No See WT:AZH/Wu#Tones
Jinhua dialect Wugniu No
Taizhou dialect Wugniu No
Wenzhou dialect (Lucheng dialect) Wugniu help No
Xiang Changsha dialect Wiktionary help No
Loudi dialect Wiktionary help No
Hengyang dialect Wiktionary help No

Category sorting

In general, categories should be sorted by radical–stroke sortkeys produced by Module:Hani-sortkey.

Chinese characters

Chinese characters should not be conflated with Chinese words or morphemes. General information about the characters themselves belong in the Translingual section, which appears before all other sections. See Wiktionary:About Chinese characters for discussion of its format.

Additional help

Help from the community

Sometimes, we know there is a problem, but don’t know what to do to correct the problem. If you should find a Chinese entry with a problem that you do not know how to correct, there are several ways to approach the situation.

  1. Mark the page with {{attention}}. This template adds the entry to a cleanup category, where another user can then find and correct the problem. It helps if you include comments explaining what the problem is or why you think the page needs attention.
  2. Raise the issue on Wiktionary talk:About Chinese. Note that this approach is primarily for issues of style, formatting, categorization, and not for specifics of content.
  3. Mark the page with {{rfc}}. This is a more general cleanup tag, and it allows the user to include reasons or concerns as an argument in the template. Be sure to also add an entry to WT:RFC concerning the word so that other editors will be made aware of the problem.

Translations into Chinese lects

  • All translations into Chinese lects must be grouped under * Chinese. Subdialects can be sub-nested. Regional variations can be flagged with {{qualifier}}.
* Chinese:
*: Hokkien: {{t|nan-hbl|雪文|tr=sat-bûn}} {{qualifier|Zhangzhou}}, {{t|nan-hbl|茶塊|tr=tê-kóe}} {{qualifier|Quanzhou}} ...
  • Mandarin: 肥皂 (féizào)
  • The traditional precedes the simplified version if they are different and the transliteration is provided with the simplified version. All Chinese varieties need both traditional and simplified forms.
* Chinese:
*: Mandarin: {{t|cmn|心理學}}, {{t|cmn|心理学|tr=xīnlǐxué}}
  • If a translation is both simplified and traditional, only one translation is given.
* Chinese:
*: Mandarin: {{t|cmn|三明治|tr=sānmíngzhì}}

Current tasks

Other Chinese aids

Selected previous discussions

  NODES
COMMUNITY 2
Idea 2
idea 2
Note 3
Project 1