擅长:python、mysql、java
<p>您可以将selenium用于此任务:</p>
<pre><code>from selenium import webdriver
import os
# setup path to chrome driver
chrome_driver = os.getcwd() + '/chromedriver'
# initialise chrome driver
browser = webdriver.Chrome(chrome_driver)
# load url
url = 'http://www.mortgagenewsdaily.com/directory/mortgage/alabama'
browser.get(url)
# find all elements
content = browser.find_elements_by_xpath('//*[@id="CommonContentInner"]/div/div/div/div/div')
# get text from each element
collected_data = []
for item in content:
personal_data = item.get_attribute("innerText")
collected_data.append(personal_data)
# clean list
collected_data = filter(None, collected_data)
</code></pre>
<p>输出:</p>
<pre><code> -
Tyler Tullis
-
Montgomery, Alabama 36117
| (334) 322-3707
-
Nathan Stotlar
Mortgage Production Manager - PrimeLending, a PlainsCapital
Company
Fitchburg, Wisconsin 53717
phone: (608) 467-4249
nathanstotlar.com
-
.
.
.
</code></pre>