用numpy将列附加到空列表以获得基本的OCR?

2024-09-29 19:20:00 发布

您现在位置:Python中文网/ 问答频道 /正文

我试图制作一个程序,通过一个图像模拟一行文本,从中获取每个字母。考虑到像素的2D数组的图像,如果在连续的列中存在黑色像素,那么这些列将被写入缓冲区。一旦到达没有黑色像素的列(即字母之间的间隔),缓冲区将转入检测到的字母图像。然而,我得到一个编译器错误,我不明白,我希望你们能帮助我。希望你也能理解我没有认识到的任何逻辑错误。你知道吗

总之,代码:

from PIL import Image
import numpy as np

class ShapeDetect:
    def __init__(self):
        self.img = Image.open("test3.jpg")
        self.pixels = self.img.load()
        self.l = np.empty([0, 0])
        self.valid_pixels = ['(0, 0, 0)', '(1, 1, 1)', '(2, 2, 2)', '(3, 3, 3)', '(4, 4, 4)', '(5, 5, 5)']

def printPixels(self):
    for i in range(self.img.size[0]):
        for j in range(self.img.size[1]):
            print(self.pixels[i, j])

def processPixels(self):
    n = 1

    # If a black pixel is in the current column, add it to l
    for i in range (self.img.size[0]):
        for j in range(self.img.size[1]):
            if str(self.pixels[i, j]) in self.valid_pixels:
                self.writeColumn(i)
                break

            # Once a whole shape has been written to l, make l into an image
            else:
                 if self.l.size > 0 and j == self.img.size[1] - 1:
                    new_img = Image.new(self.img.mode, (self.l.size, 100))
                    new_img.putdata(self.l)
                    new_img.save(str(n) + ".jpg")

                    n = n + 1
                    self.l = np.empty([0], [0])

def writeColumn(self, n):
    # put column in pixels in temp, then append temp to l
    temp = np.zeros((1, self.img.size[1]), dtype=np.int64)
    for i in range(self.img.size[1]):
        temp[0, i], = (self.pixels[n, i])

    np.append(self.l, temp, axis = 1)

if __name__ == "__main__":
    shapeDetect = ShapeDetect()
    ShapeDetect.processPixels(shapeDetect)

我得到的错误是:

Traceback (most recent call last):
  File "SD.py", line 46, in <module>
    ShapeDetect.processPixels(shapeDetect)
  File "SD.py", line 23, in processPixels
    self.writeColumn(i)
  File "SD.py", line 40, in writeColumn
    temp[0, i], = (self.pixels[n, i])
ValueError: too many values to unpack

Tags: toinselfimgnewforsizedef
1条回答
网友
1楼 · 发布于 2024-09-29 19:20:00

发生错误的原因是self.pixels[n, i]返回一个像素,该像素有3个值。 看起来您实际上想要所有3个值,但是您在temp[0,i]之后错误地放置了逗号。去掉逗号可以解决这个问题。你知道吗

但是,有一种提取列的更快方法。你可以替换

temp = np.zeros((1, self.img.size[1]), dtype=np.int64)
for i in range(self.img.size[1]):
    temp[0, i], = (self.pixels[n, i])

temp = self.pixels[n, :]

对于numpy数组

相关问题 更多 >

    热门问题