<pre><code>from lxml import html
import requests
import csv
page = requests.get('http://www.google.com/finance?q=[%28exchange+%3D%3D+%22ABC%22%29]&restype=company&noIL=1&start=0&num=1500')
tree = html.fromstring(page.content)
#Scrape stocks companies and symbols
stocks = tree.xpath('//a [not(@class)][@id][@href]/text()')
#This will create a list of prices
stocks.remove('IRM Group Berhad');
stocks.remove('A & M Realty Berhad');
stocks.remove('BERJAYA FOOD BERHAD- A SHARES');
print 'Stocks= ', stocks
# open a file for writing.
csv_out = open('KLSE.csv', 'wb')
mywriter = csv.writer(csv_out)
rows = zip(stocks)
mywriter.writerows(rows)
csv_out.close()
</code></pre>
<p>我想删除所有的行,其中包含'贝哈德'一词,因为我不想删除它一个接一个。知道怎么做吗?你知道吗</p>
<p>假设股票只是一张普通的单子,你可以试试</p>
<pre><code>trimmed_stocks = [ x for x in stocks if not 'Berhad' in x ]
</code></pre>
<p>从你的帖子里还不清楚,比如说,贝哈德或者贝哈德是否也应该被排除在外,但是这些问题可以用类似的方法处理。你知道吗</p>