Open Chinese Convert (OpenCC) Python Bindings

1.2.0 · active · verified Thu Apr 16

OpenCC (Open Chinese Convert) is an open-source project designed for high-quality conversions between Traditional Chinese, Simplified Chinese, and Japanese Kanji (Shinjitai). The Python `opencc` library provides C++ bindings, enabling character-level and phrase-level conversion, character variant handling, and adaptation to regional idioms. The current version is 1.2.0, with an active release cadence reflecting updates to the underlying C++ library.

Common errors

Warnings

Install

Imports

Quickstart

This quickstart demonstrates how to import the `OpenCC` class, initialize a converter with a specific configuration (e.g., Simplified Chinese to Traditional Chinese for Taiwan Standard), and perform text conversion. It also shows an example of converting Traditional Chinese to Simplified Chinese.

from opencc import OpenCC

# Initialize converter: Simplified Chinese to Traditional Chinese (Taiwan Standard)
# Available configs: 's2t.json', 't2s.json', 's2tw.json', 'tw2s.json', 's2hk.json', 'hk2s.json', 's2twp.json'
converter = OpenCC('s2tw.json')

# Text to convert
simplified_text = "鼠标键盘,开放中文转换。"

# Perform conversion
traditional_text = converter.convert(simplified_text)

print(f"Simplified: {simplified_text}")
print(f"Traditional (Taiwan): {traditional_text}")

# Example with another conversion: Traditional Chinese to Simplified Chinese
another_converter = OpenCC('t2s.json')
trad_hk_text = "滑鼠鍵盤,開放中文轉換。"
simplified_output = another_converter.convert(trad_hk_text)
print(f"Traditional (HK): {trad_hk_text}")
print(f"Simplified: {simplified_output}")

view raw JSON →