In [38]: from lxml import etree
In [39]: import urllib2
In [40]: html = etree.fromstring(urllib2.urlopen('http://www.insiderpages.com/b/3721895833/central-kia-of-irving-irving').read(), parser)
In [41]: html.xpath('//abbr')[0].xpath('./@title')
Out[41]: ['3']
你试过xpath吗?在
Don't use regex to extract data from html.你有lxml,使用它的幂(XPath)。在
相关问题 更多 >
编程相关推荐