自定义字体的PyteSeract对数字的分类不正确

def getnumber(self, img): grey = cv2.cvtColor(img, cv2.COLOR_BGR2GRAY) thresh, grey = cv2.threshold(grey, 50, 255, cv2.THRESH_BINARY_INV) filename = "{}.png".format(os.getpid()) cv2.imwrite(filename, grey) text = pytesseract.image_to_string(Image.open(filename), lang='Droid', config='--psm 13 --oem 3 -c tessedit_char_whitelist=0123456789.$¢') os.remove(filename) return(text)

1条回答

网友

1楼 · 发布于 2024-10-02 22:37:52

你在正确的轨道上。在为OCR预处理图像时，您希望得到黑色文本，背景为白色。其思想是放大图像，用大津的阈值得到二值图像，然后进行OCR。我们使用 psm 6告诉Pytesseract假设一个统一的文本块。查看here了解更多配置选项。这是经过处理的图像：

OCR结果：

2¢

代码

import cv2
import pytesseract
import imutils

pytesseract.pytesseract.tesseract_cmd = r"C:\Program Files\Tesseract-OCR\tesseract.exe"

# Resize, grayscale, Otsu's threshold
image = cv2.imread('1.png')
image = imutils.resize(image, width=500)
gray = cv2.cvtColor(image, cv2.COLOR_BGR2GRAY)
thresh = cv2.threshold(gray, 0, 255, cv2.THRESH_BINARY_INV + cv2.THRESH_OTSU)[1]

# Perform text extraction
data = pytesseract.image_to_string(thresh, lang='eng',config=' psm 6')
print(data)

cv2.imshow('thresh', thresh)
cv2.imwrite('thresh.png', thresh)
cv2.waitKey()

机器规格：

Windows 10
opencv-python==4.2.0.32
pytesseract==0.2.7
numpy==1.14.5

相关问题更多 >

编程相关推荐

热门问题

热门文章