Python asyncio/aiohttp:ValueError:Windows上select（）中的文件描述符太多

import asyncio import aiohttp MAXitems = 30 async def getHeaders(url, session, sema): async with session: async with sema: try: async with session.head(url) as response: try: if "html" in response.headers["Content-Type"]: return url, True else: return url, False except: return url, False except: return url, False def checkUrlsWithoutHtml(listOfUrls): headersWithoutHtml = set() while(len(listOfUrls) != 0): blockurls = [] print(len(listOfUrls)) items = 0 for num in range(0, len(listOfUrls)): if num < MAXitems: blockurls.append(listOfUrls[num - items]) listOfUrls.remove(listOfUrls[num - items]) items +=1 loop = asyncio.get_event_loop() semaphoreHeaders = asyncio.Semaphore(50) session = aiohttp.ClientSession() data = loop.run_until_complete(asyncio.gather(*(getHeaders(url, session, semaphoreHeaders) for url in blockurls))) for header in data: if False == header[1]: headersWithoutHtml.add(header) return headersWithoutHtml listOfUrls = ['http://www.google.com', 'http://www.reddit.com'] headersWithoutHtml= checkUrlsWithoutHtml(listOfUrls) for header in headersWithoutHtml: print(header[0])

2条回答

网友

1楼 · 编辑于 2024-10-01 09:22:04

默认情况下，Windows在asyncio循环中只能使用64个套接字。这是底层select()API调用的限制。在

要增加限制，请使用ProactorEventLoop。安装说明见here。在

网友

2楼 · 编辑于 2024-10-01 09:22:04

我也有同样的问题。不能百分之百地确定它能正常工作，但请尝试替换它：

session = aiohttp.ClientSession()

有了这个：

^{pr2}$

默认情况下，limit设置为100（docs），这意味着客户端一次可以同时打开100个连接。正如Andrew提到的，Windows一次只能打开64个套接字，所以我们提供一个小于64的数字。在

相关问题更多 >

编程相关推荐

热门问题

热门文章