Pinyin Conversion Tool
Online Chinese to Pinyin converter with tone marks, tone numbers, and no-tone modes
What is Chinese to Pinyin?
The Pinyin Converter turns Chinese characters into Hanyu Pinyin, the official romanization system for Standard Mandarin. Pinyin makes pronunciation, tones, and syllable structure visible for language learning, dictionaries, teaching material, name transliteration, sorting, input methods, and data preparation. This tool can display tones as diacritic marks, tone numbers, or no tones, and it can adjust separators and capitalization for different workflows. The main limitation is ambiguity: many Chinese characters have multiple readings, and the correct pronunciation often depends on word context. Names, technical terms, dialectal words, and historical texts should therefore be reviewed rather than converted blindly.
How to Use
How to use
- Paste or enter Chinese text in the left input box
- Set tone style, separator, capitalization in options below
- Pinyin results appear automatically in the right output box
- Click 'Copy' to save the result
Conversion Notes
- Check names and place names manually; pinyin for polyphonic characters can depend on context and may need correction.
- Choose tone marks, tone numbers, or no tones according to where the result will be used.
Use Cases
Technical Principle
The core challenge of Chinese-to-pinyin conversion is polyphones: the same character can have different readings in different words, e.g. 'zhong' is zhong4 in 'zhongyao' (important) but chong2 in 'chongfu' (repeat); 'xing' is hang2 in 'yinhang' (bank) but xing2 in 'xingzou' (walk). Naive per-character lookup gets a lot wrong, so modern tools use a 'dictionary first, single-character fallback' strategy. This tool is built on the pinyin-pro library, which ships with hundreds of thousands of dictionary entries. When processing a text, it first tokenizes using the longest-match algorithm; words that hit the dictionary use the dictionary's specified reading, and the remaining loose characters fall back to the single-character table and pick the most common reading. For words containing 'yi' (one), tone-change rules also apply: 'yi' reads yi2 before a 4th-tone character, yi4 before other tones, and yi1 when standing alone. There are two ways to display tones: tone marks (a, a, a, a) are written directly on the vowel and are the most readable; tone numbers (a1, a2, a3, a4) go at the end of the pinyin and are convenient for transmitting and storing with ASCII characters. In pinyin structure, pinyin = initial (23 of them, e.g. b, p, m, f) + final (24 of them, e.g. a, o, e, ai) + tone (4 + neutral), and these can be split apart for teaching and indexing. Hanyu Pinyin is the PRC's official romanization, published in 1958. Older or alternative systems include Wade-Giles (e.g. Beijing as 'Pei-ching'), postal romanization ('Peking'), and Zhuyin/Bopomofo (ㄅ ㄆ ㄇ ㄈ) used in Taiwan. These still show up in proper nouns, historical documents, and traditional publications, but everyday use is dominated by Hanyu Pinyin.
- Uses a 'dictionary first, single-character fallback' strategy: look up polyphone readings by word, then fall back to the most common reading for loose characters.
- pinyin-pro ships hundreds of thousands of dictionary entries, covering the polyphone cases for the vast majority of everyday vocabulary.
- Tone marks (a) are the most readable; tone numbers (a1) are easier to store and transmit - the two forms can be converted back and forth.
- Tone-change rules for 'yi' and 'bu', as well as er-hua (retroflex) and neutral tone, are applied automatically per phonetic rules, more accurately than a pure table lookup.
- Pinyin can be split into initial + final + tone to support different scenarios like annotation, search indexing, and teaching.
- Hanyu Pinyin, Wade-Giles, and Zhuyin are different romanization systems from different periods and regions; they are not interchangeable.
Examples
Tone Marks
Input: 中国 (two characters)
Mode: tone marks (diacritics)
Output: zhōng guó
Use: print materials, slides, language textbooksTone Numbers
Input: 中国 (two characters)
Mode: tone numbers
Output: zhong1 guo2
Use: plain-text systems that cannot store combining diacriticsPolyphone disambiguation
Input: 银行 (bank) / 行走 (walk)
Mode: tone marks, dictionary-first lookup
Output: yín háng / xíng zǒu
Note: the character 行 is xíng in 行走 but háng in 银行; the tool uses the word context to pick the right readingUppercase with custom separator
Input: 北京欢迎你 (Beijing Welcomes You)
Mode: uppercase, separator = "-"
Output: BEI-JING-HUAN-YING-NI
Use: URL slugs, file names, classroom vocabulary listsFAQ
Does it convert simplified or traditional Chinese?
Both. The page reads either form and outputs Hanyu Pinyin. Polyphonic characters (多音字) like 重, 长, 行, 着 are disambiguated by a phrase dictionary so 银行 → yín háng (bank), 行走 → xíng zǒu (walk), not vice versa.
How are tones marked?
Three styles are available: tone marks (zhōng wén), numerals (zhong1 wen2), and no-tone (zhong wen). The default is tone marks; switch to numerals for systems that don't render diacritics, or no-tone for romanised search keywords.
What about the neutral tone (轻声)?
Neutral tone words like 妈妈, 桌子, 漂亮 are emitted with no tone mark on the second syllable (māma, zhuōzi, piàoliang). This matches Mainland China standard pronunciation. Taiwan-style tone-5 marking is available as an option.
Will it handle proper names correctly?
Common surnames and place names are in the dictionary. Less common names with tone-changing characters (e.g. 区 as Ōu vs qū, 单 as Shàn vs dān) may pick the wrong reading - verify after conversion. Foreign names transliterated into Chinese (e.g. 史密斯) are read character-by-character.
Are tone sandhi rules applied?
Some basic sandhi - third-tone-followed-by-third becomes second-tone-third (你好 → níhǎo) - is applied if you enable it. Most other sandhi (一 / 不 / 七 / 八 changes) is more context-dependent and the page leaves the underlying tone mark visible to avoid hiding the structure.
Can I convert pinyin back to Chinese?
Not reliably. Pinyin-to-Hanzi is highly ambiguous (zhi can be 之/只/直/止/止…) and is the job of an input method, not a converter. Use a real IME or AI text generator if you need that direction.
Is the text uploaded?
No. The conversion uses a built-in dictionary and runs locally. Pasted text never leaves your browser.