当标题中有一个未替换的撇号（'）时，如何使用pywikibot.Page（site，title）.text？

cities = [(n["name"]) for n in graph.nodes.match("City")] for city in cities: site = pywikibot.Site(code="en", fam="wikivoyage") page = pywikibot.Page(site, city) text = page.text

cities = [(n["name"]) for n in graph.nodes.match("City")] city = "L'Aquila" altered_city = re.sub("'", "\'", city) print(altered_city) site = pywikibot.Site(code="en", fam="wikivoyage") page = pywikibot.Page(site, altered_city) print(page) print(page.text)

[[wikivoyage:en:L'Aquila]] {{pagebanner|Pagebanner default.jpg}} '''L'Aquila''' is the capital of the province of the same name in the region of [[Abruzzo]] in [[Italy]] and is located in the northern part of the..

cities = [(n["name"]) for n in graph.nodes.match("City")] city_from_list = cities[0] print(city_from_list) print(type(city_from_list)) altered_city = re.sub("'", "\'", city_from_list) site = pywikibot.Site(code="en", fam="wikivoyage") page = pywikibot.Page(site, altered_city) print(page) print(page.text)

2条回答

网友

1楼 · 编辑于 2024-09-26 22:44:54

re.sub("'", "\'", city)不做任何事情：

>>> city = "L'Aquila"
>>> re.sub("'",  "\'", city)
"L'Aquila"
>>> city == re.sub("'",  "\'", city)
True

Python将"\'"视为"'"。见文件Lexical analysis # String and Bytes literals处的表格

我不知道为什么代码的第二部分对您不起作用，但它应该起作用。也许你只是没有执行最后一行。即使page.text返回了None，print语句也应该打印None。试试print(type(page.text))

网友

2楼 · 编辑于 2024-09-26 22:44:54

Pywikikbot按预期为拉奎拉工作：例如

>>> import pywikibot
>>> site = pywikibot.Site('wikivoyage:en')
>>> page = pywikibot.Page(site, "L'Aquila")
>>> print(page.text[:100])
{{pagebanner|Pagebanner default.jpg}}
'''L'Aquila''' is the capital of the province of the same name

似乎您的cities[0]与"L'Aquila"不同。注意page.text总是给出一个str并且从不返回None。您可以使用exists()方法检查现有页面：

>>> page = pywikibot.Page(site, "L'Aquila")
>>> page.exists()
True
>>>

相关问题更多 >

编程相关推荐

热门问题

热门文章