擅长:python、mysql、java
<p>欢迎来到SO!你知道吗</p>
<p>不幸的是,您将无法使用<code>requests</code>(也不能使用<code>requests_html</code>或类似的库)刮取该页面,因为您需要一个工具来处理动态页面,即基于javascript的页面。你知道吗</p>
<p>对于python,我强烈建议使用<code>selenium</code>及其<code>webdriver</code>。下面是一段打印所需输出的代码,即所有列出的作业(注意,需要安装<code>selenium</code>和Firefox webdriver,并使用正确的运行路径)</p>
<pre><code># Import libraries
from bs4 import BeautifulSoup
from selenium import webdriver
# Set the URL you want to webscrape from
url = 'https://germanamerican.csod.com/ux/ats/careersite/5/home?c=germanamerican'
browser = webdriver.Firefox() # initialize the webdriver. I use FF, might be Chromium or else
browser.get(url) # go to the desired page. You might want to wait a bit in case of slow connection
page = browser.page_source # this is the page source, now full with the listings that have been uploaded
soup = BeautifulSoup(page, "lxml")
jobs = soup.findAll('a', {'data-tag' : 'displayJobTitle'})
for j in jobs:
print(j.text)
browser.quit()
</code></pre>