SunPinyin-2.0 Project Plan and Status Tracking Page

M1: 2009/05 ~ 2009/06

  1. re-implement the CPinyinTrie classes to use encoded syllable
    (instead of pinyin character) as key to lookup the systeam lexicon.
  2. implement the static Double-Array-Trie class, to load pinyin suffix trie data, assisting the pinyin segmentation.
  3. define the IPySegmentor interface, and implement the QuanPin segmentor basing on the pinyin suffix trie.

M2: 2009/06 ~ 2009/07

  1. re-implement the CIMIContext class basing on the new data strcture which would be largely changed.
  2. implement the CUserDict class (using SQLite).
  3. re-implement the classic view, and update gtk-standalone accordingly, to test the classic view.

M3: 2009/07 ~ 2009/08

  1. re-implement the modern view.
  2. re-implement the user configuration.
  3. testing the functionalities of views with gtk-standalone, and resolve found issues/bugs.

M4: 2009/08 ~ 2009/09

  1. update the Mac OS porting (by Yong, jjgod or other community developers)
  2. update the scim-sunpinyin porting (by Kov Chai or other community developers)
  3. update the iiimf-cle-sunpinyin porting (low priority, dueto the
    local file access limitation of iiimf, userdict may not be enabled)

M5: 2009/09 ~ 2009/10

  1. port to ibus framework (by Yong or other community developers)
  2. add Double Pinyin support
  3. bug fixes

Future Plans

  1. add ZhuYin support
  2. add Simplified Chinese -> Traditional Chinese conversion on Mac OS porting (scim has this feature as a framework facility)
  3. add more new words in dictionary, and retrain the language model
    ... ...

10 thoughts on “SunPinyin-2.0 Project Plan and Status Tracking Page

  1. 希望Mac OS X的port可以做得更好一些

  2. Jeremy,sunpinyin的内部就是一个智能整句输入的算法,可以参考一下我们的代码导读文档 ...

  3. Hi Yong. I built a sun pinyin package for SCIM on linux. You and the rest of the developers did (and continue to do) a great job. This is a really nice input method for Chinese.
    I have a question, I am fine with simplified Chinese, but I really prefer traditional characters. My question is: how hard would it be to build the package to input traditional characters? I haven't spent too much time yet reading the source, but since the dictionaries are based on utf-8, would it be as easy as using traditional characters in the binary data files? or is it more "hard coded" than that?
    I would be happy to volunteer, but right now, I don't know much about the structure of the binary files, specifically, lm_sc.t3g and pydict_sc.bin. Thanks.

  4. Hi, Jon, you are so welcome to join the development, you could refer to the code tour document (the SLM part) for more details of lexicon and slm binary files. You could contact me anytime, MSN: findsun AT hotmail DOT com, or Gtalk: findsun AT gmail DOT com. I'm not sure if you are fine with HanYu pinyin or you prefer to ZhuYin. BTW, SCIM has a built-in filter, that could translate Simplified Chinese to Traditional Chinese.

  5. 可以考虑一下即将推出的 Snow Leopard 的兼容性问题?
    我是多少期待 Sun 能加入五笔功能

  6. 很不错的输入法~

  7. 最近在使用sunpinyin的时候,发现在打开"首选项",点击“检查更新”出现检查更新窗口时,要是在某个程序窗口继续输入文字,就会出现程序无响应,我在不同的程序中试了五六次,其中三次都无奈强行关闭了系统,我的是10.5.8系统4G内存,sunpinyin1.0;另外,建议添加双拼输入方式,谢谢

  8. wei,非常感谢您的反馈,我会设法联系jjgod来看看这个问题,双拼输入方式在2.0版本的计划中 ...

Leave a Reply

Your email address will not be published. Required fields are marked *

To submit your comment, click the image below where it asks you to...
Clickcha - The One-Click Captcha