在python中提取字符串

...... (other text goes here)..... <TD align="left" class=texttd>AAA</TD> ..... (useless text here)..... <TD align="left" class=texttd>BBB</TD> ....(more text)..... <TD align="left" class=texttd>CCC</TD> <TD align="left" class=texttd>DDD</TD> ......(more text).....

3条回答

网友

1楼 · 编辑于 2024-09-30 06:33:26

你可以写一个REGEX，但它在某种程度上是在“解析”HTML。为HTML编写正则表达式的问题是HTML一团糟。它很少是完美的，当您依赖它获取数据时，这会导致问题。在

我个人会用美容素。它确实做了比你要求的更多的事情，但也超出了你的努力。在

网友

2楼 · 编辑于 2024-09-30 06:33:26

您想要BeautifulSoup：

from BeautifulSoup import BeautifulSoup
soup = BeautifulSoup(your_file)

soup.find("font", "textfont")

网友

3楼 · 编辑于 2024-09-30 06:33:26

def foo():
    input_file = open("myfile.txt", 'r')
    input = ''.join(input_file.readlines())

    looking_for = ['AAA', 'BBB', 'CCC', 'DDD']
    have = []

    for thing in looking_for:
        if thing in input:
            have.append(thing)
    return have

相关问题更多 >

编程相关推荐

热门问题

热门文章

在python中提取字符串

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >