使用Beautiful Soup时出现“预期的字符串或缓冲区”错误

import re import urllib from BeautifulSoup import * htm1 = urllib.urlopen('https://pr4e.dr-chuck.com/tsugi/mod/python-data/data/comments_42.html').read() soup = BeautifulSoup(htm1) tags = soup('span') for tag in tags: y = re.findall ('([0-9]+)',tag.txt) print sum(y)

1条回答

网友

1楼 · 发布于 2024-09-29 22:00:17

我建议使用bs4而不是{}（这是旧版本）。您还需要更改以下行：

y = re.findall ('([0-9]+)',tag)

像这样的事情：

^{pr2}$

看看这是否能让你走得更远：

sum = 0  #initialize the sum
for tag in tags:
    y = re.findall ('([0-9]+)',tag.text)  #get the text from the tag                                                                                                                                    
    print(y[0])  #y is a list, print the first element of the list                                                                                                                                      
    sum += int(y[0])  #convert it to an integer and add it to the sum                                                                                                                                   

print('the sum is: {}'.format(sum))

相关问题更多 >

编程相关推荐

热门问题

热门文章

使用Beautiful Soup时出现“预期的字符串或缓冲区”错误

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >