<p>我试图从这个网站的超链接中提取URL:<a href="https://riwayat-file-covid-19-dki-jakarta-jakartagis.hub.arcgis.com/" rel="nofollow noreferrer">https://riwayat-file-covid-19-dki-jakarta-jakartagis.hub.arcgis.com/</a></p>
<p>我使用了以下Python代码:</p>
<pre><code>import requests
from bs4 import BeautifulSoup
url = "https://riwayat-file-covid-19-dki-jakarta-jakartagis.hub.arcgis.com/"
req = requests.get(url, headers)
soup = BeautifulSoup(req.content, 'html.parser')
print(soup.prettify())
links = soup.find_all('a')
for link in links:
if "href" in link.attrs:
print(str(link.attrs['href'])+"\n")
</code></pre>
<p>问题是这段代码没有返回任何URL</p>
<p>我想获取所有这些URL:<br/>
<a href="https://i.stack.imgur.com/WP8Xv.png" rel="nofollow noreferrer"><img src="https://i.stack.imgur.com/WP8Xv.png" alt="I want to get all of this urls"/></a></p>
<p>链接由javascript代码动态生成,数据可以在下面的结构中找到</p>
<pre><code><script id="site-injection">
window.__SITE="your data is here"
</script>
</code></pre>
<p>因此,您需要获取这个<code>script</code>元素并解析<code>window.__SITE</code>的值</p>