擅长:python、mysql、java
<p>你根本不需要正则表达式。Css选择器和字符串操作将引导您走向正确的方向。请尝试以下脚本:</p>
<pre><code>import requests
from bs4 import BeautifulSoup
page_link = 'http://www.chessgames.com/perl/chess.pl?yearcomp=exactly&year=1932&playercomp=white&pid=&player=Alekhine&pid2=&player2=Naegeli&movescomp=exactly&moves=&opening=&eco=&result=1%2F2-1%2F2'
soup = BeautifulSoup(requests.get(page_link).text, 'lxml')
item_num = soup.select_one("[href*='gid=']")['href'].split("gid=")[1]
print(item_num)
</code></pre>
<p>输出:</p>
<pre><code>1012809
</code></pre>