搜索资源列表
ICTCLAS
- 中科院分词系统VC++版本,在VS2005下编译通过,含有所有源代码,可以保证在中科院算法上,自己添加新的想法,或者对已有的算法进行优化。-Word Segmentation System, Chinese Academy of Sciences VC++ version at compile under VS2005 passed, containing all the source code, algorithms can guarantee at the Chinese Academy o
keyword-chouqu
- 基于逆向最大匹配算法的分词及基于HMM模型的词性标注系统,包括了未登录词的识别、数据库的添加等内容。(需要手动修改数据库的路径才可以运行)-Reverse Maximum Matching Algorithm Based on the sub-word HMM-based model and part of speech tagging system, including the unknown word identification, such as the contents of the d
fenci
- 中文分词,可将文件划分词性,有词库,可添加新词-Use LR analysis of the compiler, may realize the words and grammar, semantics analysi
MyAnalyzer
- 简单的中文分词及敏感词检索工具,其中可以任意添加敏感词,并且可以选择使用正向最大匹配算法,逆向最大匹配算法,双向最大匹配算法中的一种进行分词。-Simple Chinese word and sensitive word retrieval tool, which can add any sensitive words, and you can choose to use the forward maximum matching algorithm, reverse maximum matchi