搜索资源列表
获取代理案例
- 利用scrapy框架写的python爬虫程序,使用爬取代理的案例来讲解的。(Scrapy framework is used to program crawler procedures in Python.)
gotoweb
- 利用python语言,实现从IP代理网站获取IP,并用此IP重复访问指定网页(Using the python language, the IP is obtained from the IP proxy site, and the specified page is repeatedly accessed with this IP)
xici_proxy
- 爬取西刺前10页(可自行修改参数total_page来管理爬取的页数)有效期大于1天的高匿代理IP,并测试其有效性,最后保存为Proxies.json文件(Unicode),使用时导入文件随机选取一个代理ip使用即可.(Crawl up to 10 pages before the Western thorn, which can modify the parameter total_page to manage the page number of climbing. The high hid
多线程爬虫
- 一个简单健壮的多线程爬虫程序,可以添加代理IP池、url池。并且已经构造好请求头和异常处理(A simple and robust crawler program, which can add agent IP pool and URL pool. And have constructed the request header and exception handling)