Metadata-Version: 2.4
Name: pinyin_jyutping_sentence
Version: 1.3
Summary: Convert a Chinese sentence to Pinyin or Jyutping
Home-page: https://github.com/lucwastiaux/python-pinyin-jyutping-sentence
Author: Luc Wastiaux
Author-email: lucw@airpost.net
License: GPL
Classifier: Programming Language :: Python :: 3.7
Classifier: Topic :: Text Processing :: Linguistic
Requires-Dist: jieba
Dynamic: author
Dynamic: author-email
Dynamic: classifier
Dynamic: description
Dynamic: home-page
Dynamic: license
Dynamic: requires-dist
Dynamic: summary

python-pinyin-jyutping-sentence
===============================

.. image:: https://travis-ci.org/lucwastiaux/python-pinyin-jyutping-sentence.svg?branch=master
 :target: https://travis-ci.org/lucwastiaux/python-pinyin-jyutping-sentence

Python module which converts a Chinese sentence from Simplified/Traditional to Mandarin/Pinyin and Traditional/Simplified to Cantonese/Jyutping, outputting diacritics (accented characters). I designed this library to create Mandarin and Cantonese flashcards.

Want to support my work on this module ? Become a supporter: https://www.patreon.com/lucw

Install
-------

.. code:: bash

    $ pip install pinyin_jyutping_sentence

Usage
-----

.. code:: python

    >>> import pinyin_jyutping_sentence
    >>> pinyin_jyutping_sentence.pinyin("提高口语")
    'tígāo kǒuyǔ'
    >>> pinyin_jyutping_sentence.jyutping("我出去攞野食")
    'ngǒ cēothêoi ló jěsik'
    # the tone_numbers argument can be used to disable diacritics
    >>> pinyin_jyutping_sentence.pinyin("忘拿一些东西了", tone_numbers=True)
    'wang4 na2 yi1xie1 dong1xi5 le5'
    # the spaces argument adds a space between each syllable
    >>> pinyin_jyutping_sentence.pinyin("忘拿一些东西了", tone_numbers=True, spaces=True)
    'wang4 na2 yi1 xie1 dong1 xi5 le5'
    >>> pinyin_jyutping_sentence.jyutping("有啲好貴", tone_numbers=True)
    'jau5 di1 hou3 gwai3'
    
Changelog
---------
* v1.1: improve conversion logic for single characters
* v0.9: removed stdout logging, added tox support
* v0.8: embed MDBG CC-CEDICT for more accurate Pinyin conversions
* v0.6: allow converting Traditional characters to Pinyin, and Simplified to Jyutping

Google Sheets add-on
--------------------

This library is available in the form of a Google Sheets Add-on. You can read about it here: https://medium.com/@lucw/converting-chinese-characters-to-pinyin-or-jyutping-on-google-sheets-eb12cca669cb

How it works
------------

Uses the Jieba library (https://github.com/fxsjy/jieba) to tokenize the sentence. Then words are converted to Pinyin/Jyutping either as a whole, or character by character, using the CC-Canto dictionary (http://cantonese.org/about.html). The Jyutping diacritic conversion is not standard but originally described here: http://www.cantonese.sheik.co.uk/phorum/read.php?1,127274,129006


