GoogleImageCrawler错误下载程序响应状态代码404

2024-05-02 07:36:53 发布

您现在位置:Python中文网/ 问答频道 /正文

我正在使用icrawler.builtin图书馆的谷歌图像爬虫。我四个月前用过,发现一切都很好。现在,它不适用于相同的代码。我已经用一段较小的代码对其进行了测试,但仍然出现一些较大的错误。我的代码如下所示:

from icrawler.builtin import GoogleImageCrawler

google_crawler = GoogleImageCrawler(storage={'root_dir': 'G:/'})
google_crawler.crawl(keyword="Cool", max_num=1)

我得到的错误如下所示:

2021-07-30 13:17:41,330 - ERROR - downloader - Response status code 404, file 
https://www.pinterest.com/pin/762093568188447037/","▷ 1001+ ideas For Rebellious And Cool 
Wallpapers For Boys | Cool wallpaper,  Cool backgrounds, 
Cool backgroundss hd",null,null,null,null,null,false,null,null,"Pinterest",null,null,null,null,null,null,null,null,null,null,null,false,{"26":[null,2]}],"2008":[null,"Cool wallpaper, Cool backgrounds ..."],"183836587":["pinterest.com"]}],null,[145,185,9,null,null,128,228,null,null,0,null,null,true,true,null,null,[]],7,null,null,"6jxBx41SFlkx8M",1,null,null,null,null,[[1627643857334508,176546694,3238750392],null,9]],[1,[0,"Ws0JazBdS1EpqM",["https://encrypted-tbn0.gstatic.com/images?q\u003dtbn:ANd9GcSh0VFllWjPPFsQuMOKP3KuORrB2tXmCV1jQ-lYaMIIM2V485kyJgJYRX4Aw0I65wY5gM8\u0026usqp\u003dCAU",178,282],["https://st.depositphotos.com/1085342/2104/i/950/depositphotos_21043079-stock-photo-cool-five-years-old-boy.jpg

然而,我的问题是,如果以及如何修复它