搜索资源列表
Classifier4J
- Classifier4J这个Java类库为文本自动分类提供了一个API。缺省(目前)实现的API是一个贝叶斯分类器。这个类库可以用于多个目的-可能作为一个垃圾邮件过滤器或一个blog-Classifier4J Java class library for the automatic text classification provided an API. Default (current) to achieve the API is a Bayesian classifier. This lib
BayesClassifier.rar
- 贝叶斯分类算法,构造朴素贝叶斯分类器,对给定的中文文本进行分类,Bayesian classification algorithm, Naive Bayes classifier structure of a given Chinese text classification
kmeans
- kmeans文本聚类算法的简单实现,是用JAVA实现的-kmeans a simple realization of the text clustering algorithm is implemented using JAVA
SVM
- 这是SVM文本分类程序源码,既有java版本,也有C++版本。 -This is a SVM text classification program source code, both java versions, there are C++ version.
svm-EM
- SVM(支持向量机)和EM(最大熵)文本分类算法,压缩包中包括了测试文本(环境类和计算机类),词典,停用词表等。-SVM (support vector machine) and EM (maximum entropy) text classification algorithm, compressed package includes test text (environmental and computer), dictionary, thesaurus, such as disabled.
BayesFileClassify
- 文本分类是在文档所包含的文本基础上, 把给定的文档分配到固定类别集合中某一个类别的任务。这个任务中常常用到朴素贝叶斯模型。-Text classification are contained in the document text, based on the given document category assigned to a fixed set of a certain type of mission. This task is often used Naive Bayes model
classfer
- 文本分类器 文本分类算法 详细介绍了算法过程-Text Classifier Algorithm for text classification algorithm described in detail the process of
text_classification
- 这是一个文本分类的简单算法,基于weka的,实现文本的分类。-This is a easy classification code.
trieTree
- 一个基于hashtable的理论而编程的程序,建立文本时根据keywordtree而建立的-based on hash table, build table by keyword tree
text_data_mining
- java编写的数据挖掘方面的代码,里面包含有文本分类,作者身份识别方面的java源码,本人亲自参与编写-java code about data mining;include:text cluster ,authorship identification,
svm
- SVM代码,适用于文本分类,为学习SVM的朋友提供一个参考,共同学习。-SVM code for text classification, the study provides a reference SVM friends, learning together. 提供更好的翻译建议
KNN0.0_0.0
- 这是用java实现的KNN算法分类器 实现了机器的自动学习与文本自动分类 仅供学习使用-This is a java implementation of the KNN algorithm with classifier achieved the automatic machine learning and text categorization learning to use only
NBClassify
- 人工智能。基于朴素贝叶斯的文本分类器,测试正确率较高。-Artificial intelligence. Naive Bayes text classification based on, test accuracy is higher.
byteHFMEncode_decodesrc
- 以字符为单位进行的哈弗曼压缩,界面比较简单,但是功能还算可以,速度还算可以压缩 77.3m 的 wmv 视频使用了 4.5秒 ,400k 的doc 用0.186秒 ,压缩率要看内容,文本大约有 60 。-In characters for the Ha Fuman compression, the interface is relatively simple, but the functionality is OK, speed still can compress wmv video usin
Kmeans
- K-means算法实现文本聚类,Java实现的版本-K-means algorithm for text clustering
vsm-and-cluster-kmeans
- 对文本聚类的向量空间模型机制VSM以及Kmeans等讲解比较详细,是研究文本聚类的好书籍-Clustering of text vector space model to explain the mechanism of VSM and Kmeans and other more detailed, study of text clustering is a good book
FCM
- 基于遗传FCM算法的文本聚类,FCM算法运用到文本中-Based on genetic FCM clustering algorithm, FCM algorithm is applied to the text
java-diff
- diff查找出两文本的异同点 也可以进行2次开发进行异同筛选-diff can also find out the similarities and differences between the two text 2 development, the similarities and differences between screening
bayes
- 基于java,利用贝叶斯算法实现中文文本分类。(包含可调用的trainingset)(Text classification based on Java using Bayesian algorithm.)
CNN
- 卷积神经网络的源代码,用于微博博文文本情感分析的三分类。(Convolutional neural network source code for micro-blog Bowen text sentiment analysis of the three categories.)