搜索资源列表
Spideroo
- C#写的一个搜索引擎,可以搜索、建立索引等。building a simple search engine that crawls the file system from a specified folder, and indexing all HTML (or other types) of documents. A basic design and object model was developed as well as a query/results page-C# to write
yider_0_5_3
- the Yider is an open source VBscr ipt spider that allows you to quickly add a search system to your site like the one at the top of this page. It stores data in a Microsoft Access or SQL 2000 database with full text searching. The Yider does not requ
weblucene
- Lucene Web interface, use XML as a lightweight protocol. developer can convert data source (text, DB, MS Word, PDF... etc) into xml format, indexing with lucene engine, and get full text search result via HTTP, with XML format output, user can easily
SearchDotnet
- DotLucene: Full-Text Search for Your Intranet or Website using 37 Lines of Code -DotLucene: Full-Text Search for Your Intranet or Website using 37 Lines of Code
20040409baidu
- 老独搜索 (Ver 1.0 build 20040127) 你以为这是百度?错,这只是老独!无需管理的搜索站,本程序和全球中文搜索门户网站baidu.com同步更新,一次安装,无需维护;本程序模仿baidu的风格界面,欢迎各位朋友开发出其他风格的skin! -old independence search (Ver 1.0 build 20040127) You thought it was Baidu? Right or wrong, this just old alone!
aspseek
- ASPSeek是一个C++编写的互联网搜索引擎,并使用了STL库。它主要包括一个检索机器人,一个搜索守护程序,和一个搜索前端(CGI或者是Apache模块)。它大概可以检索几百万个URLs,来查找给定的短语和单词,并使用通配符,进行布尔搜索。搜索结果可以限定在给定的时间或站点,站点空间,并按照相关性或者时间进行排序(这里面使用了一些非常酷的技术)。ASPSeek可以应用于很多语言和编码中(甚至包括多字节语言如中文)。它为多个站点做了优化。(多线程检索,同步DNS查询, 按站点将结果分组, Web
43554TheResearchandDesignofSearchEngine
- 搜索引擎的研究与设计.rar The Research and Design of Search Engine 吉 林 大 学 硕 士 学 位 论 文 搜索引擎(Search Engine)是一个对互联网上的信息资源进行搜集整理, 然后供用户查询的系统,它包括信息搜集、信息整理和用户查询三部分,以目 录分类或全文检索的方式来提供查询服务。本文提出了一种简化的向量空间检 索模型,通过统计主题词条对文档的贡献度来建立倒排序索引库,为用户提供 智能的检索服务。-search
delphi_searchengine
- Search over 200 internet search engines. will launch the users default browser and show the results.. This source uses TLinkLabel By Vitaly Zayko on a few of the tabs It is not needed by the search engine itself. however it is included in
search22
- 用C#编写的一个多线程搜索引擎的源代码,能够并行或串行从多个位置进行搜索。-C# prepared in a multi-threaded search engine source code to parallel or serial number from the location of the search.
luke-src-0.7
- Lucene is an Open Source, mature and high-performance Java search engine. It is highly flexible, and scalable from hundreds to millions of documents. Luke is a handy development and diagnostic tool, which accesses already existing Lucene indexes
802.16jModule
- Recently, IEEE 802.16j multi-hop relay network is proposed to increase data rate and coverage of the IEEE 802.16e networks. The Relay Station (RS) is introduced to relay the data from MR-BS to SS/MS or from SS/MS to MR-RS. We have studied the researc
web_search
- 站内查询小程序~~不错哦比如你键入:新思维网或者xsww.net 你网站里的所以含有新思维或者xsww.net的网页都会查询出来,特别方便实用。想在你的网站中实现查询功能的快下,机会不要错过。-stations for small programs ~ ~ Oh, good for you type : new thinking or xsww.net your network's website containing new thinking so xsww.net or inquir
12spider
- 网络蜘蛛源码。 Spider是搜索引擎的一个自动程序。它的作用是访问互联网上的html网页 ,建立索引数据库,使用户能在搜索引擎中搜索到贵网站的网页。 搜索引擎 派出“蜘蛛”程序检索现有网站一定IP地址范围内的新网站,而对现有网 站的更新则根据该网站的等级不同有快慢之分。一般来说,网站网页等级 越高,更新的频率就越快。搜索引擎的“蜘蛛”同一天会对某些网站或同 一网页进行多次爬行,知道蜘蛛的运动规律,对于更新网页、了解搜索引 擎收录的收录情况等等有相当重要的作用。-Spider-source ne
信息检索报告
- Information Retrieval (IR) is the discipline that deals with retrieval of unstructured data, especially textual documents, in response to a query or topic statement, which mayitselfbeunstructured,e.g.,asentenceorevenanotherdocument,orwhichmay be s
Webloup
- WebLoupe is a java-based tool for analysis, interactive visualization (sitemap), and exploration of the information architecture and specific properties of local or publicly accessible websites. Based on web spider (or web crawler) technology. 开源搜索爬
开放源代码的全文检索引擎Lucene .NET
- 开放源代码的全文检索引擎Lucene .NET Lucene是apache软件基金会[4] jakarta项目组的一个子项目,是一个开放源代码[5]的全文检索引擎工具包,即它不是一个完整的全文检索引擎,而是一个全文检索引擎的架构,提供了完整的查询引擎和索引引擎,部分文本分析引擎(英文与德文两种西方语言)。Lucene的目的是为软件开发人员提供一个简单易用的工具包,以方便的在目标系统中实现全文检索的功能,或者是以此为基础建立起完整的全文检索引擎。,open source Lucene tex
spider.rar
- python的网页爬虫源码,希望对正在学习python或研究爬虫的朋友有帮助,python reptiles page source, and they hope to learn python or research are reptiles friends help
tspider
- TSpider is a application source code library that you can use in your own programs to scrape information from websites. If can be used to download whole websites, or just select information from specific pages. Source code is in Delphi-TSpider is a
GetImage_Eng
- 类似网络爬虫,从一个网页“爬”到另一个网页,然后选择图片下载。多线程。 可以用来按照一定规则下载网页中的元素,如图片、网页、flash等,举例如下-download images or other stuffs by analyzing webpages, search for webpages like a spider. you can config the downloading and crawling strategy in the program
FlickrCrawler
- 用C#自行开发的Flickr爬虫代码,实现了一个HttpRequestHelper类来处理网络请求,调用Flickr的API库来搜索指定内容或者作者的照片,并将返回结果存储到excel文件中。-Flickr reptiles code developed in C#, a HttpRequestHelper class to handle network requests, call the Flickr API library to search for specific content or