搜索资源列表
DeDuplication
- 自动删除Access数据库中重复的记录的源代码-Automatically delete duplicate records in Access database source code
MDment
- 基于MD5算法的重复数据删除技术的研究与改进MD5 algorithm is based on data deduplication technology Research and Improvement-MD5 algorithm is based on data deduplication technology Research and Improvement
sdfs-0.9.7.tar
- 具有重复数据删除功能的文件系统,是一个开源项目-With deduplication file system
Aspects_of_Deduplication
- Aspects of deduplication
DeDuplication
- VB编写的数据库工具删除ACCESS重复记录-VB, ACCESS database tool to remove duplicate records
deduplication
- C语言实现的simhash算法,用于文章查重!-Simhash algorithm C language, and re-check for the article!
TiChong
- 文本文件剔重工具,在工作中我们总有一些文件会有重复内容,在文件比较多和比较大的时候,处理起来就比较麻烦,就写了个小工具,测试发现在文件在超过3000万行后,效率会很慢。- Text file deduplication tool, at work we always have some files have duplicate content in more and larger files when handling them more trouble, they wrote a smal
mapr
- hadoop的mapreduce初级的几个小案例,包括单词计数、topk、单表关联、多表关联、数据排序、数据去重、数据分组,适合刚接触hadoop的人弄清mapreduce的流程- The primary hadoop mapreduce several small cases, including word count, topk, single-table association, multi-table association, data sorting, data deduplica