搜索资源列表
WordCount2
- 基于hadoop1.x的wordcount程序,jar包是全的,只要设置一下即可使用-a word count program depend on hadoop 1.x with all jar files needed,easy to use
1
- 检测中文文章的相似度,首先对中文文章分词处理,然后提取特征,计算特征向量夹角。检验是否相似-Similarity detection Chinese article, the first article of the Chinese word processing and feature extraction, feature vector angle calculation. Test whether similar
distributed_skipgram_mixture-master
- The Distributed Multisense Word Embedding(DMWE) tool is a parallelization of the Skip-Gram Mixture [1] algorithm on top of the DMTK parameter server. It provides an efficient scaling to industry size solution for multi sense word embedding. -The Di