<p>我正在使用get_text()获取unicode格式。
如何将Unicode更改为DataFrame中的字符串?在</p>
<p>需要正确的文本格式来整理数据。。。。。
下面是我的代码。。。。在</p>
<pre><code>import requests
from pattern import web
from bs4 import BeautifulSoup
from pandas import *
url = 'http://www.mouthshut.com/product-reviews/amazonin-reviews-925670774-srch'
r = requests.get(url)
bs = BeautifulSoup(r.text)
mouthrev = []
Title = []
for revlist in bs.find_all("li","reviewdetails openshare"):
title = revlist.find_all('div','reviewtitle fl')
title = [g.get_text(strip=True) for g in title]
for parent in revlist.find_all("div", itemprop='description'):
review = parent.find_all('p')
review = [g.get_text(strip=True) for g in review]
mouthrev.<a href="https://www.cnpython.com/list/append" class="inner-link">append</a>(review)
Title.append(title)
mouth1 = DataFrame({'Title' : Series(Title),'Review' : Series(mouthrev)})
mouth1.to_csv('D:\\Review.csv')
</code></pre>
<p>我得到的结果是:</p>
^{pr2}$