擅长:python、mysql、java
<p>这里有mate,我发现在这个站点中,索赔部分是一个带有自己Id的html,使事情变得更简单。我只是把这一部分整理好,给你一根绳子,你就可以玩了</p>
<pre><code>import requests
from bs4 import BeautifulSoup
page = requests.get("https://patents.google.com/patent/EP1208209A1/en?oq=medicinal+chemistry")
soup = BeautifulSoup(page.content, 'html.parser')
claim_sect = soup.find_all('section', attrs={"itemprop":"claims"})
print('This is the raw content: \n')
print(str(claim_sect))
print('This is the variable type: \n')
print(str(type(claim_sect)))
str_sect = claim_sect[0]
</code></pre>