在Python中梳理不同类的几个大数据结构；如何在减少内存使用的同时合并和存储所需的数据？

网友

1楼 · 编辑于 2024-09-27 21:29:38

如果设备响应按顺序排列并按主机分组，则不需要字典，只需要三个列表：

last_host = None
hosts = []                # the list of hosts
host_responses = []       # the list of responses for each host
responses = []
for output in expandresults:
    if output.val is not None:
        if output.hostname != last_host:    # new host
            if last_host:    # only append host_responses after a new host
                host_responses.append(responses)
            hosts.append(output.hostname)
            responses = [output.val]        # start the new list of responses
            last_host = output.hostname
        else:                               # same host, append the response
            responses.append(output.val)
host_responses.append(responses)

for host, responses in zip(hosts, host_responses):
    self.WriteOut(host, ','.join(responses))

网友

2楼 · 编辑于 2024-09-27 21:29:38

通过使用探查器，您可能更容易确定内存的去向：

https://pypi.python.org/pypi/memory_profiler

另外，如果您已经在调整fastsnmpy类，那么您只需更改实现来为您进行基于字典的结果合并，而不是让它先构造一个巨大的列表。在

你要坚持多久？如果重用结果列表，它将无限期增长。在

网友

3楼 · 编辑于 2024-09-27 21:29:38

内存消耗是由于以非绑定方式实例化了几个worker。在

I've updated fastsnmpy (latest is version 1.2.1 ) and uploaded it to PyPi. You can do a search from PyPi for 'fastsnmpy', or grab it directly from my PyPi page here at FastSNMPy

刚刚更新完文档，并将它们发布到位于fastSNMPy DOCS的项目页面

我在这里所做的基本上是用来自多处理的进程池替换早期的未绑定worker模型。它可以作为参数传入，也可以默认为1。在

为了简单起见，现在只有两种方法。 snmpwalk（进程=n）和snmpbulkwalk（进程=n）

你不应该再看到内存问题了。如果有，请在github上ping我。在

怎么回事

代码活动

我如何处理数据

罪魁祸首

相关问题更多 >

编程相关推荐

热门问题

热门文章