擅长:python、mysql、java
<p>您可以使用<code>bs4</code>对象的<code>__getitem__</code>方法访问数据:</p>
<pre><code>import re
from bs4 import BeautifulSoup as soup
s = """
<li>this is li</li>
<li class="c1" data="this is data">ineinieni </li>
<li class="c1" >ineinieni </li>
<li data="this is the data1">ineinieni </li>
<li data="this is the data2">ineinieni </li>
"""
s = soup(s, 'lxml')
final_data = [re.sub('the\s', '', i['data']) for i in s.find_all('li') if re.findall('data\=', str(i))]
</code></pre>
<p>输出:</p>
<pre><code>['this is data', 'this is data1', 'this is data2']
</code></pre>