Chinese Pinyin Conversion (pypinyin)

0.55.0 · active · verified Sat Apr 11

pypinyin is a Python library for converting Chinese characters to Pinyin. It intelligently matches the most fitting Pinyin based on phrase occurrences, supports heteronyms (multi-pronunciation characters), simplified/traditional Chinese, Zhuyin, and various Pinyin styles (e.g., tone conventions). The library is actively maintained, with version 0.55.0 released recently, demonstrating a consistent release cadence.

Warnings

Install

Imports

Quickstart

Demonstrates basic conversion of Chinese characters to Pinyin using `pinyin` and `lazy_pinyin` functions, including handling different styles and heteronyms.

from pypinyin import pinyin, lazy_pinyin, Style

chinese_text = "你好,世界!"

# Convert to Pinyin with tone marks (default style)
pinyin_result_toned = pinyin(chinese_text)
print(f"Toned Pinyin: {pinyin_result_toned}")

# Convert to Pinyin without tone marks (lazy_pinyin)
pinyin_result_lazy = lazy_pinyin(chinese_text)
print(f"Lazy Pinyin: {pinyin_result_lazy}")

# Convert to Pinyin using first letter style
pinyin_result_first_letter = pinyin(chinese_text, style=Style.FIRST_LETTER)
print(f"First Letter Pinyin: {pinyin_result_first_letter}")

# Handle heteronyms (multi-pronunciation characters)
heteronym_text = "中心"
pinyin_heteronym = pinyin(heteronym_text, heteronym=True)
print(f"Heteronym Pinyin for '中心': {pinyin_heteronym}")

view raw JSON →