<p>这是一个(糟糕的)例子,说明如何做到这一点。。。不过,一种更优雅的方法是将字符串编码为iso8859-1(我认为这就是\xa0的来源)。你知道吗</p>
<pre><code>my_list = [('11 August 1902\xa0(1902-08-11)Paris, France', None),
('29 July 1991(1991-07-29) (aged\xa088)Paris, France', None),
('\xa0France', None),
('\xa0French Army', None),
('1921-1959', None),
('General de brigade', None),
('Mobile Group 2Mobile Group 1Operational Group North-West', None),
('World War IIFirst Indochina War*Battle of Dien Bien Phu', None)]
my_new_list = []
for my_item in my_list:
tuple_first = my_item[0]
tuple_first = tuple_first.replace('\xa0', ' ') # I think really this should be
# encoded with the ISO 8859-1 and
# in this encoding \xa0 is a non
# breaking space... but for now
# I just replace it with a space char
my_new_list.append(tuple_first)
</code></pre>
<p>这是输出(每项新行)</p>
<pre><code>['11 August 1902 (1902-08-11)Paris, France',
'29 July 1991(1991-07-29) (aged 88)Paris, France',
'France',
'French Army',
'1921-1959',
'General de brigade',
'Mobile Group 2Mobile Group 1Operational Group North-West',
'World War IIFirst Indochina War*Battle of Dien Bien Phu'
]
</code></pre>