擅长:python、mysql、java
<p>使用gangabass中的xpath:</p>
<pre><code>import scrapy
class txt_filter:
txt= '<tr>\
<td><div>2018/2058</div></td>\
<td class="address"><div>Land North of 37 and 39 Hare Lane Claygate Esher Surrey KT10 9BT</div></td>\
<td class="proposal"><div>Confirmation of Compliance with Conditions: 6 (Tree Protection and Pre-Commencement Inspection) and 6 (Tree Protection) of planning permission 2017/0451.</div></td>\
<td><div style="min-width:90px">Claygate Ward</div></td>\
</tr>'
resp = scrapy.http.response.text.TextResponse(body=txt,url='abc',encoding='utf-8')
print(resp.xpath('//tr[1]/td/div/text()').extract())
</code></pre>
<p>只从td中删除了[1]以获取所有行。在</p>