搜索资源列表
VIPS
- 用C++实现对普通网页文件的内容收取,以及按标签分类,为是VIPS算法做数据预处理,并以颜色和字体大小为属性-C++, to achieve the contents of ordinary web document collection, and by label classification, as is the VIPS algorithm to do data preprocessing, and to color and font size for the properties
WebPages_WordSplitting
- 自动提取网页内容(附带简单的 HTTPAnalyzer 类),并根据词典进行分词。-Automatically get the content from webpages, and split the words based on the internal Chinese dictionary.