使用Python请求从URL下载PDF

import requests url = 'https://www.w3.org/WAI/ER/tests/xhtml/testfiles/resources/pdf/dummy.pdf' response = requests.get(url) with open('C:\Users\User\PycharmProjects\PDFTest\FolderTest\dummy.pdf', 'wb') as f: f.write(response.content)

File "C:\Users\User\PycharmProjects\PDFTest\main.py", line 7 with open('C:\Users\User\PycharmProjects\PDFTest\FolderTest\dummy.pdf', 'wb') as f: ^ SyntaxError: (unicode error) 'unicodeescape' codec can't decode bytes in position 2-3: truncated \UXXXXXXXX escape Process finished with exit code 1

2条回答

网友

1楼 · 编辑于 2024-10-02 02:30:50

这是因为您在一个未被转换的字符串中使用了\。尝试在字符串前面使用\\或put和r

您也可以使用pathlib，我发现这更容易：

from pathlib import Path
import requests
filename = Path(r'C:\Users\User\PycharmProjects\PDFTest\FolderTest\dummy.pdf')
url = 'https://www.w3.org/WAI/ER/tests/xhtml/testfiles/resources/pdf/dummy.pdf'
response = requests.get(url)
filename.write_bytes(response.content)

网友

2楼 · 编辑于 2024-10-02 02:30:50

出现您的问题是因为解释器难以解析文件的路径，因为它包含unicode转义字符

试一试

file_path = r'drive:\path\to\file'

这实际上是通过告诉解释器将其作为原始字符串读取来转义字符串中的特殊字符

用于替代实现

Tqdm为终端提供进度条

import os
from tqdm import tqdm
import requests

def download(lnk:str, fname:str):
    rq = requests.get(lnk,stream=True)
    totalsize = int(rq.headers['content-length'])
    chunksize = 1024
    if totalsize:
        print(f'\t{round(totalsize*10**-3,2):,} kb')
    with open(fname,'wb') as fobj:
        if totalsize:
            for b in tqdm(iterable=rq.iter_content(chunk_size=chunksize), total = totalsize/chunksize, unit = 'KB'):
                fobj.write(b)
        else:
            for b in tqdm(rq):
                fobj.write(b)
    os.startfile(fname)

相关问题更多 >

编程相关推荐

热门问题

热门文章