如何找到具体的文字和打印后，我的下两个字

from PIL import Image import pytesseract pytesseract.pytesseract.tesseract_cmd = r'C:\Users\gzi\AppData\Roaming\Python\Python37\site-packages\tesseract.exe' img=Image.open('C:/Users/gzi/Desktop/work/lux.jpg') text = pytesseract.image_to_string(img, lang = 'eng') if 'INGREDIENTS' in text: print("True") else: print("False")

3条回答

网友
1楼 · 编辑于 2024-05-19 09:15:40

因此，假设我们使用pytesseract提取了以下文本：
text = '''Ground Almonds INGREDIENTS: Ground Almonds(100%). 1kg'''
我们可以通过以下方式实现预期结果：
first_index = text.find('INGREDIENTS') second_index = text.find('(') my_string = f'{text[first_index:second_index]}' print(my_string)
输出为：
INGREDIENTS: Ground Almonds
因此在代码片段中，我们使用find方法来定位INGREDIENTS单词和(符号（假设它总是在主成分之后，指定它的百分比）。你知道吗
然后对上述索引使用string切片并打印结果，用f-string将其格式化为所需的输出。你知道吗

网友
2楼 · 编辑于 2024-05-19 09:15:40

使用正则表达式查找所有匹配项：
import re txt = "INGREDIENTS: Ground Almonds(\"100\");" x = re.findall("INGREDIENTS:\s(\w+)\s(\w+)", txt) print(x) # [('Ground', 'Almonds')]

网友
3楼 · 编辑于 2024-05-19 09:15:40

如果您不关心百分比并希望避免regex：

string = 'INGREDIENTS: Ground Almonds(100%).'

tokens = string.split()
for n,i in enumerate(tokens):
    if 'INGREDIENTS' in i:
        print(' '.join(tokens[n:n+3]))

输出：

INGREDIENTS: Ground Almonds(100%).

相关问题更多 >

编程相关推荐

热门问题

热门文章