擅长:python、mysql、java
<p>您需要从表基准中的链接中提取标题。因为这里的每个<code><td></code>都包含一个<code><a></code>,所以只需调用<code>td.find('a')['title']</code></p>
<pre><code>import re, requests
from bs4 import BeautifulSoup
nyaa_link = 'https://nyaa.si/'
request = requests.get(nyaa_link, headers={'User-Agent': 'Mozilla/5.0'})
source = request.content
soup = BeautifulSoup(source, 'lxml')
#GETTING TORRENT NAMES
title = []
rows = soup.findAll("td", colspan="2")
for row in rows:
#UPDATED CODE
desired_title = row.find('a')['title']
if 'comment' not in desired_title:
title.append(desired_title)
#GETTING MAGNET LINKS
magnets = []
for link in soup.findAll('a', attrs={'href': re.compile("^magnet")}):
magnets.append(link.get('href'))
print(magnets)
</code></pre>