找到所需的元素后,我得到:
[<div class="statsValue">$1,615,422</div>, <div class="statsValue">1</div>, <div class="statsValue">2</div>]
我只想得到数字$1,615,422
。我该怎么做?我在网上找不到任何有用的东西
这是我的密码:
from selenium import webdriver
from selenium.webdriver.remote import webelement
import pandas as pd
import time
from bs4 import BeautifulSoup
driver = webdriver.Chrome('chromedriver.exe')
driver.get('https://www.redfin.com/')
search_box = driver.find_element_by_name('searchInputBox')
search_box.send_keys('693 Bluebird Canyon Drive, Laguna Beach, CA 92651')
search_box.submit()
time.sleep(2)
def get_address_url(address):
url_list = []
search_box = driver.find_element_by_name('searchInputBox')
search_box.send_keys('693 Bluebird Canyon Drive, Laguna Beach, CA 92651')
search_box.submit()
time.sleep(2)
url_list.append(driver.current_url)
# element = driver.find_elements_by_class_name('statsValue')
# print(element[0].get_attribute('innerHTML'))
soup = BeautifulSoup(driver.page_source, 'html.parser')
data = soup.find_all(lambda tag: tag.name == 'div' and tag.get('class') == ['statsValue'])
print(data)
print(len(data))
print(type(data))
driver.quit()
可以使用regex和
sub
删除not数字:结果:
如果你只想得到数字
$1,615,422
,我想requests
就足够了希望这个代码能帮助你
您需要
text
属性相关问题 更多 >
编程相关推荐