Python：抓取游戏名

def ScrapeK10(): siteToScrape = 'http://www.kiz10.com/new-games' print '\n[!] Requesting Kiz10..' kizReq = requests.get(siteToScrape) print '\n[!] Scraping Newest Games...' kizTree - html.fromstring(kizReq.content) kizElement = kizTree.xpath('//strong[@class="bx-caption"]/text()') print 'Latest Games : ', kizElement, '\n' return

1条回答

网友

1楼 · 发布于 2024-09-29 21:30:11

你会用正则表达式吗？请注意，所有游戏名称都包含在名为“itemsGame”的JavaScript对象中。你知道吗

使用regex将其过滤掉，然后再次使用regex拆分每一行。你知道吗

这应该够了

def main():
    import re
    import requests
    url = "http://kiz10.com/index.php?page=newgames"
    raw = requests.get(url).content
    match = re.search("var itemsGame = \[(.*?)\];$", raw, re.M)
    for line in re.findall('\[(.*?)\]', match.group(1)):
        print(line.replace("'", "").split(",")[3].strip())

或者，您也可以对var itemsGame中的字符串调用eval（）= 到下一个\n字符。你知道吗

显然，eval总是很危险的，从来没有真正被推荐过

相关问题更多 >

编程相关推荐

热门问题

热门文章