Python和lxml.html获取“元素”输出问题

import lxml.html import lxml import urllib2 webHTML = urllib2.urlopen('http://hobbyking.com/hobbyking/store/__39036__Turnigy_Multistar_2213_980Kv_14Pole_Multi_Rotor_Outrunner.html').read() webHTML = lxml.html.fromstring(webHTML) productDetails = webHTML.get_element_by_id('productDetails') for element in productDetails: print element.text_content()

1条回答

网友

1楼 · 发布于 2024-10-02 22:29:39

恐怕lxml.html无法解析这个特定的HTML源代码。它将带有id="productDetails"的h3标记解析为空元素（这在default "recover" mode中）：

<h3 class="productDescription2" id="productDetails" itemprop="description"></h3>

用^{} parser切换到^{}（这是一个非常宽大的）：

^{pr2}$
印刷品：
Looking for the ultimate power system for your next Multi-rotor project? Look no further!The Turnigy Multistar outrunners are designed with one thing in mind - maximising Multi-rotor performance! They feature high-end magnets, high quality bearings and all are precision balanced for smooth running, these motors are engineered specifically for multi-rotor use.These include a prop adapter and have a built in aluminium mount for quick and easy installation on your multi-rotor frame. outrunner ...

相关问题更多 >

编程相关推荐

热门问题

热门文章