擅长:python、mysql、java
<p>beauthulsoup有自己的regex功能</p>
<pre><code>html = '<html><span>Volume: </span><span class="data_bold"><value>20</value></span></html>'
soup = BeautifulSoup(html)
matches = soup.findAll(text=re.compile('Volume'))
for match in matches:
element = match.parent
#o/p: <span>Volume: </span>
sibling_tag = element.findNextSibling()
#o/p: <span class="data_bold"><value>20</value></span>
print sibling_tag.find('value').text
#o/p: u'20'
</code></pre>
<p>注意:这只是一个没有实际html访问权限的例子</p>