擅长:python、mysql、java
<p>这只是简单的字符串操作,并不是真正的BS4问题。可以做如下的事情。请注意,下面的方法可能不是最好的方法,但我这样做是为了冗长。在</p>
<pre><code>from bs4 import BeautifulSoup as bsoup
ofile = open("test.html")
soup = bsoup(ofile)
soup.prettify()
tds = soup.find_all("td")
templist = [td.get_text() for td in tds]
newlist = []
for temp in templist:
whole = temp.split(":") # Separate by ":" first.
half = whole[0].split("(") # Split the first half using open parens.
first = half[0].strip() # First of three elements.
second = half[1].replace(")","").strip() # Second of three elements.
third = whole[1].strip() # Use the second element for the first split to get third of three elements.
newlist.append([first, second, third])
for lst in newlist:
print lst # Just print it out.
</code></pre>
<p>结果:</p>
^{pr2}$
<p>让我们知道这是否有帮助。在</p>