如何用pytess提高图像识别的可能性

for dImg in range(0, len(imgList)): url = imgList[dImg] local = "img" + str(dImg) + ".jpg" urllib.request.urlretrieve(url, local) imgOpen = Image.open(local) imgOpen.resize((500,500)) imgToString = pytesseract.image_to_string(imgOpen) newEmail.append(imgToString)

2条回答

网友

1楼 · 编辑于 2024-09-29 01:38:20

设置页面分段模式（psm）可能会有所帮助。在

要获得所有可用的psm，请在终端中输入tesseract help-psm。在

然后根据您的需要确定psm。假设您要将图像视为单个文本行，在这种情况下，您的ImgToString变成：

imgToString = pytesseract.image_to_string(imgOpen, config = ' psm 7')

希望这对你有帮助。在

网友

2楼 · 编辑于 2024-09-29 01:38:20

您可以在代码中执行几个预处理步骤。在

1）使用from PIL import Image和{}。您可以检查其他几个设置。在

2）一个稍微先进的方法：使用CNN。您可以使用一些预先培训的cnn。在这里您可以找到更详细的信息：https://www.cs.princeton.edu/courses/archive/fall00/cs426/lectures/sampling/sampling.pdf

畅通节能法

相关问题更多 >

编程相关推荐

热门问题

热门文章