返回BeautifulGroup中不确定数量的段落

def BrightstormPageTest(): soup = Soup(urllib.urlopen('http://brightstorm.com/science/chemistry/chemical-reaction-rates/collision-theory/').read()) relevantTagText = "" for element in soup.findAll("section"): print element.nextSibling

1条回答

网友

1楼 · 发布于 2024-09-26 22:50:01

您需要迭代部分，然后迭代段落。为了演示，我修改了您的代码以打印每个段落的文本。在

from bs4 import BeautifulSoup as Soup

def BrightstormPageTest():
    soup = Soup(urllib.urlopen('http://brightstorm.com/science/chemistry/chemical-reaction-rates/collision-theory/').read())
    sections = soup.findAll("section")
    for section in sections:
        ps = section.findAll("p")
        for p in ps:
            print p.text

def BrightstormPageTest2():
    soup = Soup(urllib.urlopen('http://brightstorm.com/science/chemistry/chemical-reaction-rates/collision-theory/').read())
    sections = soup.findAll("section")
    for section in sections:
        while True:
             try:
                 print section.nextSibling.text
             except TypeError:
                 # .text is a valid method on a <p> element, but not a NavigableString.  
                 break

相关问题更多 >

编程相关推荐

热门问题

热门文章

返回BeautifulGroup中不确定数量的段落

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >