搜索资源列表
HanziStatics.rar
- 汉字统计程序
mlct_public
- 这是一个基于Java的分词、N-gram统计、分段 、分句等功能的程序,支持多种语言-This is a Java-based segmentation, N-gram statistics, the sub-clause of the function procedures, multilingual support
Maxmin
- 一个简单的数字计算和数字统计的VB小程序,可以用来参考做一些复杂程序的方法-a simple calculation of the figures and statistics on the VB small program can be used to do some reference to the complex procedures
maxent-20041229[1].win32
- 文本分析中基于统计的方法中,最为常用的最大熵算法,该源码为Python版,广泛应用于词性标注,词义消岐等领域-text analysis based on statistical methods, the most commonly used of maximum entropy algorithm, the source code for Python version, widely used in tagging, Meaning Consumers divergent fields
xdgf
- 字符处理这是一个基于Java的分词、N-gram统计、分段 、分句等功能的程序,支持多种语-characters to deal with this is a Java-based segmentation, N-gram to statistics, subparagraph Clauses function procedures, multiple language support
TestCorpusyuliaoguanli
- 1. 这是一个简单的语料库管理系统 2. 可以添加和删除语料文件,统计语料中的字数 3. 可以查找语料中的汉字串以及重叠形式 4. 语料文件存放在corpus目录下,查询结果保存在跟语料库相同目录下 5. corpus目录下有4个文本文件(其中test1, test2是两个小文件)供测试用 6. 只能处理文本文件,GB内码-1. This is a simple Corpus management system 2. They can add and delete corpu
wenben.txt
- 在一个文件中找到给定单词出现的位置并统计出现次数-documents in a given word to find the location and frequency statistics
text2idngram
- 最注明的cmu语言模型工具箱中的将文本转化为trigram统计的工具。在linux下可用。用法可使用-help命令。-most of the annotated cmu language model kit of text into trigram statistics tool. Linux can be used in the next. Usage may use-help orders.
BiHZFreqCode
- 汉字二字组频度统计。可以统计汉字文本中二字组的频度。很好用。中文文本分词很有用的工具。-Chinese word frequency statistics group. Chinese statistics can text the word frequency group. Good use. Chinese text segmentation useful tool.
nsp-v0.71.tar
- N元组统计程序源代码,使用perl编写,作者是Ted Pedersen。-N group statistical source code, the use of perl preparation, the author is Ted Pedersen.
ProbWordSeg1
- 基于最大概率的分词,首先读入.mdb数据库(字典与其统计词频),然后读入你要分词的.txt-based on the maximum probability of the word, first read into. Mdb database (with dictionary word frequency statistics). Then you should read into the word. txt
TS300Src
- 从唐诗300首中统计作者和发表的诗篇,用perl语言实现-from the Tang Dynasty 300 Statistics published by the author and poetry, using perl language
Sohu.ZIP
- 统计http://www.sohu.cn/页面中有多少个静态的超链接,用perl语言实现-statistics http://www.sohu.cn/ pages static number of hyperlinks using perl language
Oasis(Beta)
- 解码器是基于短语的统计机器翻译系统的核心模块,本解码器是“丝路”1.0 版(SilkRoad V1.0)中由哈尔滨工业大学开发的“绿洲Oasis”解码器。研究统计机器翻译的研究者必备。-decoder is based on the weight of statistical machine translation system's core module, The decoder is the "Silk Road", Version 1.0 (V1.0 SilkR
CAMEL
- 解码器是基于短语的统计机器翻译系统的核心模块,本解码器是“丝路”1.0 版(SilkRoad V1.0)中由中科院计算所开发的“骆驼CAMEL”解码器。研究统计机器翻译的研究者必备。-decoder is based on the weight of statistical machine translation system's core module, The decoder is the "Silk Road", Version 1.0 (V1.0 SilkRo
moses-2007-01-10
- 解码器是基于短语的统计机器翻译系统的核心模块,本解码器MOSE由SMT权威开发的解码器。研究统计机器翻译的研究者必备。,-decoder is based on the weight of statistical machine translation system's core module, MOSE the decoder developed by the authority of the SMT decoder. Study of statistical machine tra
CARAVAN
- 解码器是基于短语的统计机器翻译系统的核心模块,本解码器是“丝路”1.0 版(SilkRoad V1.0)中由厦门大学开发的“商队Caravan”解码器。研究统计机器翻译的研究者必备。-decoder is based on the weight of statistical machine translation system's core module, The decoder is the "Silk Road", Version 1.0 (V1.0 SilkRo
MFC查词典、分词、词频统计程序
- MFC编程,功能是查词典(用户可自己导入文本),分词,统计词频,还可以保存结果!我们MFC课的期末作业,强烈推荐!-MFC programming function is to check dictionary (users can import their own version), participle, statistical, frequency, the results can be saved! We MFC class at the end operations, strongly
ngrams
- 自然语言处理相关程序,有关分词的和词频统计-Natural language processing procedures, the statistical segmentation and word frequency
地统计代码
- 地统计代码,小波分析,人工神经网络等算法