擅长:python、mysql、java
<p>这适用于发布的数据,但我建议您调整用于刮取的代码,以便在找不到评级时返回“无评级”</p>
<pre class="lang-py prettyprint-override"><code>import pandas as pd
data = [
"Samsung Galaxy A12 ",
"5 out of 5(6)",
"Screenguard",
"Samsung Galaxy Mos / A02s ",
"4 out of 5(1)",
"Pillow",
]
products = [product for product in data if not "out of" in (product)]
ratings = []
idx = 1
for product in products:
idx = data.index(product)
if idx>=len(data)-1:
ratings.append('No rating')
elif not 'out of' in data[idx+1]:
ratings.append('No rating')
else:
ratings.append(data[idx+1])
df = pd.DataFrame({'product':products, 'rating': ratings})
</code></pre>
<pre><code>Sample Output
product rating
0 Samsung Galaxy A12 5 out of 5(6)
1 Screenguard No rating
2 Samsung Galaxy Mos / A02s 4 out of 5(1)
3 Pillow No rating
</code></pre>