搜索资源列表
demo
- 文本相似度比较```很好的`-text similarity comparison `` `good`
文本相似度计算2
- 文本相似度计算,值得下载
main.计算文本之间相似度的程序
- 计算文本之间相似度的程序,用于文本的聚类。是在已知各个文本的文本特征向量基础上进行计算的,利用余弦值计算,Calculation of similarity between the text of the procedures for text clustering. Are known at all the text of the text feature vector calculated based on the use of cosine values
TFIDF
- 用c#写的计算文本向量的TFIDF算法源码,同时包括用cosine距离计算文本相似度的算法源码-Calculation using c# to write the text of the TFIDF vector algorithm source code, while including the use of cosine similarity distance calculation algorithm for source text
stex
- 用于进行字符串的匹配查找,查找整个文件夹中的文本文件。并给出相应的相似度。-Search for the string matching to find an entire folder of text files. And the corresponding similarity.
simalar
- 基于Python的单词相似度分析,通过分析一些大文本来判断测试文件中给出的单词相似度判断的准确率-Python-based word similarity analysis, by analyzing a number of large text files to determine the test given to determine the accuracy of word similarity
wordsimilar
- 词汇分类 相似度计算 文本语料分析 归类 知网数据分类-Word text corpus classification Similarity analysis of data classified Text Classification
Measuring-the-SemanticSimilarity
- 本文提出了一个以知识为本 文本的语义相似性测量方法。虽然是一个大 以前的工作机构,专注于寻找概念的语义相似度 也就是说,这些字为导向的方法应用到文本相似不 尚未探讨。在本文中,我们介绍一种方法,组合成一个文本到文本度量字,字的相似性度量,我们表明,这种方法 优于传统的文本相似度 基于词法匹配的指标。-Thispaper presents a knowledge-based method for measuring the semantic-similarity oft
CMDiff
- C#实现的Diff工具,能够比较两个文本文件的差异,并计算文本相似度。-A diff tool implemented in C#,which can get differences between two text files, and
wenbenxiangsidujisuan
- 文本相似度计算工具代码,这是在做搜索引擎非常需要的一个算法,对于想从事开发这方面的应用,具有不错的参考价值。-Text similarity calculation tool code, which is doing a great need for an algorithm of the search engine, and want to engage in the development of this aspect of the application, has a good refer
wordsimilar
- 词汇分类 相似度计算 文本语料分析 归类 知网数据分类-Word text corpus classification Similarity analysis of data classified Text Classification
TextSimilarity
- 文本相似度计算程序,有图形界面,基于向量-text similarity
CosineSimilarAlgorithmzf
- 这里会用到TF/IDF权重,用余弦夹角计算文本相似度,用方差计算两个数据间欧式距离,用k-means进行数据聚类等数学和统计知识。-Here will use the TF/IDF weight, with cosine angle calculation of text similarity, with the variance of the two data between the data of the European distance, with K-means data cluste
xsd
- 易语言快速计算文本相似度源码例程程序演示了文本相似度的对比计算方法。 -Easy language to quickly calculate the similarity of the text source routine procedures to demonstrate the text similarity calculation method.
cos
- 计算词向量间的余弦相似度,用于语义文本挖掘 。(Calculate the cosine similarity between word vectors for semantic text mining.)
English
- 包括了原始英文文档、删除特殊符号、分词、词干化、计算相似度等文本预处理后产生的文档,总的数量是500个英文文档(Including the original English document, delete special symbols, such as text segmentation, a preprocessed documents produced, the total number of 500 English document)
Chinese
- 是做文本预处理时候利用爬虫收集的500个中文文档,包括分词部分、去掉特殊符号部分以及最后的相似度计算等(It is the 500 Chinese document collected by a crawler for text preprocessing, including the part of the participle, the removal of the special part of the symbol, and the final similarity calculatio
EnglishChuLi
- 利用python编写的文本预处理的程序,包含了每一步的实现代码,分为删除标点符号、删除停用词、相似度计算、PCA降维、聚类以及可视化等,运行环境为pytharm,python3开发环境(The text preprocessing program written by Python contains every step of implementation code, which is divided into delete punctuation marks, delete stop word
文本相似度计算方法研究综述.pdf
- Text similarity; semantic similarity; ontology; word bag model; neural network ; thesis review