搜索资源列表
lingpipe-3.6.0
- 一个自然语言处理的Java开源工具包。LingPipe目前已有很丰富的功能,包括主题分类(Top Classification)、命名实体识别(Named Entity Recognition)、词性标注(Part-of Speech Tagging)、句题检测(Sentence Detection)、查询拼写检查(Query Spell Checking)、兴趣短语检测(Interseting Phrase Detection)、聚类(Clustering)、字符语言建模(Character
docParser
- Mapping Words to Concepts to Categories. This code is for mining wikipedia Knowledge base for the help of text understanding
fp2
- fp2 is used for finding frequent itemsets in string dataset. the text mining application
JiaoChaShang
- 文本挖掘中交叉熵算法实现,通过词汇左右出现的概率来计算交叉熵-Text mining cross entropy algorithm,The task of part2of2speech iden t if icat ion is to au tomat ically assign a part2of2speech tag to an unknow n wo rd w ith emp ty part2of2speech info r2 mat ion. A part2of2speech
1111
- 文本挖掘-中文分类器搜索,可以挖掘出文本主干,利用贝叶斯算法。-Text mining
cofinew
- COFI tree is used for mining frequent patterns from large text data
tpTextMining
- tp text mining in java eclipse
TextMining-Tools
- 北大杨建武文本挖掘课件第15章,详细介绍了文本挖掘的工具和流程,可以在一天之内掌握文本挖掘的来龙去脉!-North Jian-Wu Yang text mining courseware Chapter 15, details the text mining tools and processes, can one day master the ins and outs of text mining!
Text-mining
- 10几篇文本挖掘方面的论文 例如 web内容挖掘综述 web内容挖掘技术研究.-Text mining,data mining,web mining.10 several text mining papers such as the web content mining Summary of Web Content Mining.
ARFFInputformat
- hadoop下自定义的读文件格式类,对于数据挖掘分类算法的训练测试文本的特殊格式有很大帮助.-hadoop read the file format class custom of great help for training in the special format of the test text data mining classification algorithms.
util
- 很多文本处理有用的工具,NLP,数据挖掘都能用到-A lot of useful text processing tools, NLP, data mining can be used
src
- this file for naive bayes classifier for text mining
Indexing
- 典型的文本挖掘案例,用于java程序开发平台的插件,dragontool-A typical route for the development of text retrieval and mining applications is illustrated in Figure 1. First of all, it is required to prepare a collection of machine-readable documents.
NaiveBayes
- 基于朴素贝叶斯算法实现的文本分类程序,对数据挖掘的初学者具有很好的学习参考价值。-Based on Bayesian text classification algorithm procedures, data mining beginners a good learning reference value.
ictclas4j
- 中科院中文分词系统完成的java源码,能很好的实现中文的分词,为文本挖掘提供基础。-Chinese Academy of Sciences Chinese word segmentation system to complete the java source code, can achieve good word of Chinese, provide a basis for text mining.
class
- 中文文本分类可以对已经分好词的文本进行分类,先自己导入数据,用libsvm中的svm进行分类和预测,特征用tfidf算法,还利用卡方检验进行了特征选择,可自行设定阈值-text mining
text_example
- Just sorce about mining text
cs224n-pa1-master
- java text mining ibm model
NLPIRS
- 中科院分词工具,适用于短文本挖掘,对情感倾向进行分类。-The Chinese academy of sciences segmentation tools IKAnalyzer2013, suitable for text mining
project3
- text mining, bayes k-means