资源列表
paoding-analysis-2.0.4
- Paoding中文分词是一个使用Java开发的,可结合到Lucene应用中的,为互联网、企业内部网使用的中文搜索引擎分词组件。 Paoding填补了国内中文分词方面开源组件的空白,致力于此并希翼成为互联网网站首选的中文分词开源组件。 Paoding中文分词追求分词的高效率和用户良好体验。-Paoding Chinese word is a Java development can be combined with Lucene applications for the word componen
ZeroCrawler
- 该程序用于抓取某一网页的所有链接,适合爬虫初学者使用-The procedure used to crawl all the links of a web page, suitable for reptiles beginners
heritrix-1.14.0-src
- 知名网络蜘蛛源码,可以下载整站内容,扩展性强,可以下载动态网页
searchengine
- This document includes the use of Web data mining expertise to carry out the search engine design, and personalized search engine based on the study of documents, rich, do not miss!
interleaver
- interleaver research
MyLucene
- 自己写的Lucene写搜引擎 简单搜索引擎的设计与实现-Writing the Writing their own search engine Lucene search engine easy design and implementation
Clucene
- CLucene是Lucene的一个C++移植,Lucene是一个基于java的高性能的全文搜索引擎。CLucene因为使用C++编写,所以理论上要比lucene快。-The CLucene of Lucene a C++ transplant, Lucene is a java-based high-performance full-text search engine. The CLucene because to use C++ write so theoretically than luc
heritrix-1.14.2-src
- heritrix-1.14.2-src是网络爬虫Heritrix最新版本的源码,希望对大家有帮助-heritrix-1.14.2-src is a network of reptiles Heritrix the latest version of source, in the hope that we have to help
heritrix-1.14.3-src
- 这是一个很好的网络爬虫,很适合一般的搜索引擎!-This is a good web crawler, it is suitable for general search engines!
LucenePINPACTION
- lucene in action 中文版; 学习lucene必备书籍-lucene in action Chinese version learning essential books for lucene
6
- 自己动手写搜索引擎第三章代码,随书光盘中的内容,整个太大,只能分别上传-Chapter code search engine to write himself, with the contents of the CD-ROM, the whole is too big, we were only able to upload