<p>由于已标记Selenium,您可以执行以下更改以消除上述错误:-</p>
<p>这是纯selenium解决方案,如果您有一个正在运行的驱动程序实例,请使用如下<code>FirefoxOptions</code>重新配置它:</p>
<pre><code>options = webdriver.FirefoxOptions()
options.binary_location = r"C:\Program Files\Mozilla Firefox\firefox.exe"
driver = webdriver.Firefox(executable_path=r'\geckodriver.exe full path here', firefox_options=options)
driver.get("https://www.montratec.com1")
</code></pre>
<p>用于爬虫程序(基于py3 asyncio和aiohttp库的Web抓取框架)</p>
<p>安装:</p>
<pre><code>pip install crawler
</code></pre>
<p><strong>示例代码:</strong></p>
<pre><code>import re
from itertools import islice
from crawler import Crawler, Request
RE_TITLE = re.compile(r'<title>([^<]+)</title>', re.S | re.I)
class TestCrawler(Crawler):
def task_generator(self):
for host in islice(open('var/domains.txt'), 100):
host = host.strip()
if host:
yield Request('http://%s/' % host, tag='page')
def handler_page(self, req, res):
print('Result of request to {}'.format(req.url))
try:
title = RE_TITLE.search(res.body).group(1)
except AttributeError:
title = 'N/A'
print('Title: {}'.format(title))
bot = TestCrawler(concurrency=10)
bot.run()
</code></pre>
<p>正式参考<a href="https://pypi.org/project/crawler/" rel="nofollow noreferrer">here</a></p>