擅长:python、mysql、java
<p>尝试下面的脚本来获取它们。您愿意获取的数据包含在注释中,这就是为什么通常的方法不允许您收集这些数据:</p>
<pre><code>from urllib.request import urlopen
from bs4 import BeautifulSoup, Comment
content = urlopen("https://www.baseball-reference.com/leagues/MLB/2018-standard-pitching.shtml")
soup = BeautifulSoup(content.read(),"lxml")
for comment in soup.find_all(string=lambda text:isinstance(text,Comment)):
sauce = BeautifulSoup(comment,"lxml")
for tags in sauce.find_all('tr'):
name = [item.get("csk") for item in tags.find_all("td")[:1]]
print(name)
</code></pre>