ToolActToolAct

Pinyin Conversion Tool

Online Chinese to Pinyin converter with tone marks, tone numbers, and no-tone modes

Input
Characters: 0
Output
Characters: 0

Conversion Options

Tone Style
Separator
Capitalization
Non-Chinese

What is Chinese to Pinyin?

The Pinyin Converter turns Chinese characters into Hanyu Pinyin, the official romanization system for Standard Mandarin. Pinyin makes pronunciation, tones, and syllable structure visible for language learning, dictionaries, teaching material, name transliteration, sorting, input methods, and data preparation. This tool can display tones as diacritic marks, tone numbers, or no tones, and it can adjust separators and capitalization for different workflows. The main limitation is ambiguity: many Chinese characters have multiple readings, and the correct pronunciation often depends on word context. Names, technical terms, dialectal words, and historical texts should therefore be reviewed rather than converted blindly.

How to Use

How to use

  1. Paste or enter Chinese text in the left input box
  2. Set tone style, separator, capitalization in options below
  3. Pinyin results appear automatically in the right output box
  4. Click 'Copy' to save the result

Conversion Notes

  • Check names and place names manually; pinyin for polyphonic characters can depend on context and may need correction.
  • Choose tone marks, tone numbers, or no tones according to where the result will be used.

Use Cases

Convert Chinese text into pinyin for study or transliterationPaste Chinese text and generate pinyin using pinyin-pro with tone marks, tone numbers, or no tones. The result can preserve spaces, use commas, remove separators, or apply a custom separator for naming, study, or transliteration workflows. Tone marks are best for print and slide decks, while tone numbers are easier to type and store in plain-text systems.
Prepare names and labels in a chosen styleLowercase, uppercase, and capitalized output options help adapt pinyin to contact lists, filenames, product labels, classroom materials, and search keywords. The custom separator is useful for slugs or hyphenated romanization. Run a few test names through the converter first to confirm capitalization and separators match the file system or URL rules where the result will live.
Clean mixed Chinese and non-Chinese textChoose whether to keep original non-Chinese characters or remove them from the generated output. This is helpful when converting paragraphs that contain punctuation, numbers, Latin text, or symbols but the final result should focus on pronunciation. Removing non-Chinese noise makes the pinyin column easier to scan during study sessions.
Handle polyphonic characters and rare readings manuallyCharacters like 行, 长, and 乐 map to multiple readings that pinyin-pro picks from word context. Spot-check product names, place names, and historical terms manually rather than assuming the default reading matches your audience. A short review of names and titles catches the most common pinyin mistakes before publishing.
Disambiguate tone marks vs numeric tones and 多音字 (重/還)Pick tone marks for human-readable output or numeric tones (ma1, ma2, ma3, ma4, ma5) for systems that cannot store combining diacritics, and remember that the same character can be a true 多音字 — 重 is zhong4 in 重要 (important) but chong2 in 重新 (again), and 還 is hai2 in 還是 (still) but huan2 in 還原 (restore). The dictionary-first, single-character fallback strategy behind pinyin-pro still relies on context, so review readings for terms that change meaning depending on tone.

Technical Principle

The core challenge of Chinese-to-pinyin conversion is polyphones: the same character can have different readings in different words, e.g. 'zhong' is zhong4 in 'zhongyao' (important) but chong2 in 'chongfu' (repeat); 'xing' is hang2 in 'yinhang' (bank) but xing2 in 'xingzou' (walk). Naive per-character lookup gets a lot wrong, so modern tools use a 'dictionary first, single-character fallback' strategy. This tool is built on the pinyin-pro library, which ships with hundreds of thousands of dictionary entries. When processing a text, it first tokenizes using the longest-match algorithm; words that hit the dictionary use the dictionary's specified reading, and the remaining loose characters fall back to the single-character table and pick the most common reading. For words containing 'yi' (one), tone-change rules also apply: 'yi' reads yi2 before a 4th-tone character, yi4 before other tones, and yi1 when standing alone. There are two ways to display tones: tone marks (a, a, a, a) are written directly on the vowel and are the most readable; tone numbers (a1, a2, a3, a4) go at the end of the pinyin and are convenient for transmitting and storing with ASCII characters. In pinyin structure, pinyin = initial (23 of them, e.g. b, p, m, f) + final (24 of them, e.g. a, o, e, ai) + tone (4 + neutral), and these can be split apart for teaching and indexing. Hanyu Pinyin is the PRC's official romanization, published in 1958. Older or alternative systems include Wade-Giles (e.g. Beijing as 'Pei-ching'), postal romanization ('Peking'), and Zhuyin/Bopomofo (ㄅ ㄆ ㄇ ㄈ) used in Taiwan. These still show up in proper nouns, historical documents, and traditional publications, but everyday use is dominated by Hanyu Pinyin.

  • Uses a 'dictionary first, single-character fallback' strategy: look up polyphone readings by word, then fall back to the most common reading for loose characters.
  • pinyin-pro ships hundreds of thousands of dictionary entries, covering the polyphone cases for the vast majority of everyday vocabulary.
  • Tone marks (a) are the most readable; tone numbers (a1) are easier to store and transmit - the two forms can be converted back and forth.
  • Tone-change rules for 'yi' and 'bu', as well as er-hua (retroflex) and neutral tone, are applied automatically per phonetic rules, more accurately than a pure table lookup.
  • Pinyin can be split into initial + final + tone to support different scenarios like annotation, search indexing, and teaching.
  • Hanyu Pinyin, Wade-Giles, and Zhuyin are different romanization systems from different periods and regions; they are not interchangeable.

Examples

Tone Marks

Input:   中国 (two characters)
Mode:    tone marks (diacritics)
Output:  zhōng guó
Use:     print materials, slides, language textbooks

Tone Numbers

Input:   中国 (two characters)
Mode:    tone numbers
Output:  zhong1 guo2
Use:     plain-text systems that cannot store combining diacritics

Polyphone disambiguation

Input:   银行 (bank) / 行走 (walk)
Mode:    tone marks, dictionary-first lookup
Output:  yín háng  /  xíng zǒu
Note:    the character 行 is xíng in 行走 but háng in 银行; the tool uses the word context to pick the right reading

Uppercase with custom separator

Input:   北京欢迎你 (Beijing Welcomes You)
Mode:    uppercase, separator = "-"
Output:  BEI-JING-HUAN-YING-NI
Use:     URL slugs, file names, classroom vocabulary lists

FAQ

Does it convert simplified or traditional Chinese?

Both. The page reads either form and outputs Hanyu Pinyin. Polyphonic characters (多音字) like 重, 长, 行, 着 are disambiguated by a phrase dictionary so 银行 → yín háng (bank), 行走 → xíng zǒu (walk), not vice versa.

How are tones marked?

Three styles are available: tone marks (zhōng wén), numerals (zhong1 wen2), and no-tone (zhong wen). The default is tone marks; switch to numerals for systems that don't render diacritics, or no-tone for romanised search keywords.

What about the neutral tone (轻声)?

Neutral tone words like 妈妈, 桌子, 漂亮 are emitted with no tone mark on the second syllable (māma, zhuōzi, piàoliang). This matches Mainland China standard pronunciation. Taiwan-style tone-5 marking is available as an option.

Will it handle proper names correctly?

Common surnames and place names are in the dictionary. Less common names with tone-changing characters (e.g. 区 as Ōu vs qū, 单 as Shàn vs dān) may pick the wrong reading - verify after conversion. Foreign names transliterated into Chinese (e.g. 史密斯) are read character-by-character.

Are tone sandhi rules applied?

Some basic sandhi - third-tone-followed-by-third becomes second-tone-third (你好 → níhǎo) - is applied if you enable it. Most other sandhi (一 / 不 / 七 / 八 changes) is more context-dependent and the page leaves the underlying tone mark visible to avoid hiding the structure.

Can I convert pinyin back to Chinese?

Not reliably. Pinyin-to-Hanzi is highly ambiguous (zhi can be 之/只/直/止/止…) and is the job of an input method, not a converter. Use a real IME or AI text generator if you need that direction.

Is the text uploaded?

No. The conversion uses a built-in dictionary and runs locally. Pasted text never leaves your browser.