表单识别器自定义模型失败，文件类型无效`{“错误”：{“代码”：“1000”，“消息”：“输入文件无效”。}`

import requests import json import os import pathlib # path of file to evaluate floc = 'path/to/file' # extract file type file_type = pathlib.Path(floc).suffix[1:] # set headers with file type and our api key headers = { 'Content-Type': f'application/{file_type}', 'Ocp-Apim-Subscription-Key': os.environ["AZURE_FORM_RECOGNIZER_KEY"] } # read in the file as binary to send files = {'file': open(floc, 'rb')} # post the file to be analysed r = requests.post( f'https://eastus.api.cognitive.microsoft.com/formrecognizer/v2.1/custom/models/{os.environ["MODEL_ID"]}/analyze', headers=headers, files=files ) r

1条回答

网友

1楼 · 发布于 2024-09-24 22:20:53

我已经解决了我的问题，但将把我的答案留给其他人

有两个主要问题：

使用files而不是data发送pdf
使用默认端点（https://eastus.api.cognitive.microsoft.com）而不是我自己的端点

修复程序如下所示：

import requests
import json
import os
import pathlib

# path of file to evaluate
floc = 'path/to/file'

# extract file type
file_type = pathlib.Path(floc).suffix[1:]

# set headers with file type and our api key
headers = {
    'Content-Type': f'application/{file_type}',
    'Ocp-Apim-Subscription-Key': os.environ["AZURE_FORM_RECOGNIZER_KEY"]
}

# post the file to be analysed
r = requests.post(
    f'{endpoint}/formrecognizer/v2.1/custom/models/{os.environ["MODEL_ID"]}/analyze',
    headers=headers, 
    data=open(floc, 'rb') # send binary of your file
)

r

通过转到表单识别器的Azure实例，您可以找到自己的endpoint值：

相关问题更多 >

编程相关推荐

热门问题

热门文章