使用BeautifulSoup从html表中提取值

网友

1楼 · 编辑于 2024-10-01 07:17:26

另一个解决方案

from simplified_scrapy import SimplifiedDoc,req,utils
html = '''
<td class="celda400" vAlign="center" align="right" width="100" bgColor="#DFEDFF" style="color:Black">
575,42
</td>
<td class="celda400" vAlign="center" align="right" width="100" bgColor="#DFEDFF" style="color:Black">
575,43
</td>
'''
doc = SimplifiedDoc(html)
texts = doc.selects('td.celda400').text
print (texts)

结果:

['575,42', '575,43']

网友

2楼 · 编辑于 2024-10-01 07:17:26

可以使用任何属性进行提取。例如，使用

class = "celda400"属性

response.find('td', {'class':"celda400"}).string

网友

3楼 · 编辑于 2024-10-01 07:17:26

你可以试试。我想，你可以理解：

from bs4 import BeautifulSoup

html_doc = """
    <td class="celda400" vAlign="center" align="right" width="100" bgColor="#DFEDFF" style="color:Black">
    575,42
    </td>
    <td class="celda400" vAlign="center" align="right" width="100" bgColor="#DFEDFF" style="color:Black">
    875,42
    </td>
    """
soup = BeautifulSoup(html_doc, 'lxml')

all_td = soup.find_all('td', {'class':"celda400"})

for td in all_td:
    value = td.text.strip()
    print(value)

相关问题更多 >

编程相关推荐

热门问题

热门文章

使用BeautifulSoup从html表中提取值

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >