搜索资源列表
Web-Extraction
- 该程序实现了将腾讯新闻网站的正文部分抽取出来,主要用到的是python的正则表达式处理包,功能简单实现良好-The program to achieve a body part extracted Tencent news sites, mainly used python regular expression processing package, the function is simple to achieve good
CatchNews
- 通过正则表达式分析网页内容,java编写的页面抓取程序-Regular expression analyzes web content, java written pages crawler