在网页上抓取一个jpg文件，然后使用python保存它

2条回答

网友

1楼 · 编辑于 2024-09-30 14:23:58

JPEG文件不是文本，而是二进制数据。所以您需要使用request.content属性来访问它。

下面的代码还包含一个get_headers()函数，当您浏览一个网站时，这个函数非常方便。

import requests

def get_headers(url):
    resp = requests.head(url)
    print("Status: %d" % resp.status_code)
    resp.raise_for_status()
    for t in resp.headers.items():
        print('%-16s : %s' % t)

def download(url, fname):
    ''' Download url to fname '''
    print("Downloading '%s' to '%s'" % (url, fname))
    resp = requests.get(url)
    resp.raise_for_status()
    with open(fname, 'wb') as f:
        f.write(resp.content)

def main():
    site = 'http://www.gucci.com/images/ecommerce/styles_new/201501/web_full/'
    basename = '277520_F4CYG_4080_001_web_full_new_theme.jpg'
    url = site + basename
    fname = 'qtest.jpg'

    try:
        #get_headers(url)
        download(url, fname)
    except requests.exceptions.HTTPError as e:
        print("%s '%s'" % (e, url))

if __name__ == '__main__':
    main()

我们调用.raise_for_status()方法，以便get_headers()和{}在出错时引发异常；我们在main()中捕获异常并打印相关信息。

网友

2楼 · 编辑于 2024-09-30 14:23:58

我不确定您使用encode的目的。你不是在处理文本，而是在处理图像。您需要以二进制数据而不是文本的形式访问响应，并使用图像处理函数而不是文本函数。试试这个：

from PIL import Image
from io import BytesIO
import requests

response = requests.get("http://www.gucci.com/images/ecommerce/styles_new/201501/web_full/277520_F4CYG_4080_001_web_full_new_theme.jpg")
bytes = BytesIO(response.content)
image = Image.open(bytes)
image.save("1.jpg")

注意使用response.content而不是{}。您需要安装PIL或枕头才能使用Image模块。BytesIO包含在python3中。

或者，您可以直接将数据保存到磁盘，而不必查看其中的内容：

^{pr2}$

相关问题更多 >

编程相关推荐

热门问题

热门文章

在网页上抓取一个jpg文件，然后使用python保存它

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >