擅长:python、mysql、java
<p>像这样的东西应该可以。我删除了所有的隐式等待(顺便说一句,如果你想走这条路线,你应该在你的脚本顶部设置一次,当你恢复你的<code>driver</code>;而且它们很长!)</p>
<pre><code>listOfURLs=['https://oooo.com/Number=xxxxx', 'https://oooo.com/Number/yyyyyy', 'https://oooo.com/Number/zzzzzz']
result=[]
for url in listOfURLs:
driver.get(url)
content = driver.page_source
soup = BeautifulSoup(content, 'html.parser')
contact_text = soup.findAll("div", {"id": "renderContacInfo"})
for tx in contact_text:
output1=tx.find(text="Contact").findNext('h5').text
output2=tx.find(text="Email").findNext('h5').text
output=f"{url} {output1} {output2}"
result.append(output)
driver.quit()
</code></pre>
<p><code>result</code>是一个列表,以url+联系人+电子邮件的形式包含所有收集的输出</p>