Chinaunix首页 | 论坛 | 博客
  • 博客访问: 402016
  • 博文数量: 20
  • 博客积分: 5010
  • 博客等级: 大校
  • 技术积分: 1270
  • 用 户 组: 普通用户
  • 注册时间: 2006-06-16 09:18
文章分类
文章存档

2011年(6)

2010年(2)

2009年(1)

2008年(11)

我的朋友

分类:

2008-05-13 21:24:39

These days, i have completed a convertor from pinyin to hanzi, using trigram and Viterbi decoding algorithm. It is written in C and Perl, which(Perl) is used to parse the raw corpus from the WWW. It is diffcult to compress the Langusge Model(LM), i just cut off the trigrams which count is less than 3.
The next step is do some test of this system and do some back-off based Entropy so as to compress the LM. There are a lot of hard work to be done.

阅读(831) | 评论(1) | 转发(0) |
给主人留下些什么吧!~~

chinaunix网友2008-08-26 22:41:39

看到一个熟悉的名词,原来你也做维特比译码