SumT中Python磁盘使用率高

2024-06-26 10:27:11 发布

您现在位置:Python中文网/ 问答频道 /正文

我在python程序中遇到了一些奇怪的行为。基本上,当我试图创建和填充一个长度大于1000的SumTree时,我的磁盘使用量增加了很多,达到了300MB/s,然后程序就死掉了。你知道吗

我很确定在这个过程中没有涉及到文件r/w,问题在于add函数。代码如下所示。你知道吗

import numpy as np

class SumTree():

    trans_idx = 0

    def __init__(self, capacity):
        self.num_samples = 0
        self.capacity = capacity
        self.tree = np.zeros(2 * capacity - 1)
        self.transitions = np.empty(self.capacity, dtype=object)

    def add(self, p, experience):
        tree_idx = self.trans_idx + self.capacity - 1
        self.transitions[self.trans_idx] = experience
        self.transitions.append(experience)
        self.update(tree_idx, p)

        self.trans_idx += 1
        if self.trans_idx >= self.capacity:
            self.trans_idx = 0

        self.num_samples = min(self.num_samples + 1, self.capacity)

    def update(self, tree_idx, p):
        diff = p - self.tree[tree_idx]
        self.tree[tree_idx] = p
        while tree_idx != 0:
            tree_idx = (tree_idx - 1) // 2
            self.tree[tree_idx] += diff

    def get_leaf(self, value):
        parent_idx = 0
        while True:  
            childleft_idx = 2 * parent_idx + 1  
            childright_idx = childleft_idx + 1
            if childleft_idx >= len(self.tree):  
                leaf_idx = parent_idx
                break
            else:  
                if value <= self.tree[childleft_idx]:
                    parent_idx = childleft_idx
                else:
                    value -= self.tree[childleft_idx]
                    parent_idx = childright_idx

        data_idx = leaf_idx - self.capacity + 1
        return leaf_idx, self.tree[leaf_idx], self.transitions[data_idx]

    @property
    def total_p(self):
        return self.tree[0]  # the root

    @property
    def volume(self):
        return self.num_samples  # number of transistions stored

下面是使用SumTree对象的示例:

def add(self, experience)
    max_p = np.max(self.tree.tree[-self.tree.capacity:])
    if max_p == 0:
        max_p = 1.0
    exp = self.Experience(*experience)
    self.tree.add(max_p, exp)  

其中Experience是一个命名元组,self.tree是一个Sumtree实例,当我删除最后一行时,高磁盘使用率就消失了。你知道吗

有人能帮我吗?你知道吗


Tags: selfaddtreetransdefnpnummax