获取从s托管的特定图像文件大小

2条回答

网友

1楼 · 编辑于 2024-10-03 00:17:17

如果您只是想通过URL获取文件的内容长度，可以通过只下载HTTP头并检查Content-Length字段来实现：

import requests
url='https://commons.wikimedia.org/wiki/File:Leptocorisa_chinensis_(20566589316).jpg'

http_response = requests.get(url)

print(f"Size of image {url} = {http_response.headers['Content-Length']} bytes")

但是，如果图像在发送之前由服务器压缩，^{}字段将包含压缩文件大小（实际下载的数据量），而不是未压缩的图像大小。你知道吗

要对给定页面上的所有图像执行此操作，可以使用BeautifulSoup HTML processing library提取页面上所有图像的URL列表，并检查文件大小，如下所示：

from time import sleep
import requests
from bs4 import BeautifulSoup as Soup

url='https://en.wikipedia.org/wiki/Agent_Orange'

html = Soup(requests.get(url).text)

image_links = [(url + a['href']) for a in html.find_all('a', {'class': 'image'})]

for img_url in image_links:
    response = requests.get(img_url)
    try:
        print(f"Size of image {img_url} = {response.headers['Content-Length']} bytes")
    except KeyError:
        print(f"Server didn't specify content length in headers for {img_url}")
    sleep(0.5)

您必须根据您的特定问题来调整它，并且可能必须将其他参数传递给^{}，以便将它缩小到您感兴趣的特定图像，但是类似的操作将实现您所要做的。你知道吗

网友

2楼 · 编辑于 2024-10-03 00:17:17

您可以尝试查看是否可以从浏览器中为每个图像发送HEAD请求。HTTP HEAD Request in Javascript/Ajax? 这取决于HTTP服务器是否正确支持它。我也不知道你如何得到的内容长度头，但这听起来像你想要的。你知道吗

相关问题更多 >

编程相关推荐

热门问题

热门文章

获取从s托管的特定图像文件大小

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >