博客首页 注册 建议与交流 排行榜 加入友情链接
推荐 投诉 搜索: 帮助

剑胆琴心

知我者谓我心忧 不知我者谓我何求
  zsc.cublog.cn

关于作者
姓名:freebsd13
职业:IT
年龄:23
位置:北京-中关村
个性介绍:尘世间一迷途小书童
主页:http://www.cipsc.org.cn/~zsc
|| << >> ||
我的分类


Something of these days
These days, i have completed a convertor from pinyin to hanzi, using trigram and Viterbi decoding algorithm. It is written in C and Perl, which(Perl) is used to parse the raw corpus from the WWW. It is diffcult to compress the Langusge Model(LM), i just cut off the trigrams which count is less than 3.
The next step is do some test of this system and do some back-off based Entropy so as to compress the LM. There are a lot of hard work to be done.

发表于: 2008-05-13,修改于: 2008-05-13 21:24,已浏览137次,有评论0条 推荐 投诉


网友评论
 发表评论