擅长:python、mysql、java
<pre><code>import bs4
html = '''<tr>
<td class="num cell-icon-string" data-sort-value="6">
<td class="cell-icon-string"><a class="ent-name" href="/pokedex/charizard" title="View pokedex for #006 Charizard">Charizard</a></td>
</tr>
<tr>
<td class="num cell-icon-string" data-sort-value="6">
<td class="cell-icon-string"><a class="ent-name" href="/pokedex/charizard" title="View pokedex for #006 Charizard">Charizard</a><br>
<small class="aside">Mega Charizard X</small></td>
</tr>'''
soup = bs4.BeautifulSoup(html, 'lxml')
</code></pre>
<p>在:</p>
<pre><code>[tr.get_text(strip=True) for tr in soup('tr')]
</code></pre>
<p>输出:</p>
<pre><code>['Charizard', 'CharizardMega Charizard X']
</code></pre>
<p>您可以使用<code>get_text()</code>来连接标记中的所有文本,<code>strip=Ture</code>将删除字符串中的所有空间</p>