单例python生成器？或者，pickle一个python生成器？

def get_train_example(): for l in open(HYPERPARAMETERS["TRAIN_SENTENCES"]): prevwords = [] for w in string.split(l): w = string.strip(w) id = None prevwords.append(wordmap.id(w)) if len(prevwords) >= HYPERPARAMETERS["WINDOW_SIZE"]: yield prevwords[-HYPERPARAMETERS["WINDOW_SIZE"]:] def get_train_minibatch(): minibatch = [] for e in get_train_example(): minibatch.append(e) if len(minibatch) >= HYPERPARAMETERS["MINIBATCH SIZE"]: assert len(minibatch) == HYPERPARAMETERS["MINIBATCH SIZE"] yield minibatch minibatch = []

3条回答

网友

1楼 · 编辑于 2024-10-01 15:43:41

这可能不是您的选择，但是stacklesspython（http://stackless.com）允许您在特定条件下对函数和生成器等内容进行pickle。这将起作用：

在食品公司名称：

def foo():
    with open('foo.txt') as fi:
        buffer = fi.read()
    del fi
    for line in buffer.split('\n'):
        yield line

在食品.txt公司名称：

^{pr2}$
在口译员中：
Python 2.6 Stackless 3.1b3 060516 (python-2.6:66737:66749M, Oct 2 2008, 18:31:31) IPython 0.9.1 -- An enhanced Interactive Python. In [1]: import foo In [2]: g = foo.foo() In [3]: g.next() Out[3]: 'line1' In [4]: import pickle In [5]: p = pickle.dumps(g) In [6]: g2 = pickle.loads(p) In [7]: g2.next() Out[7]: 'line2'
需要注意的是：您必须缓冲文件的内容，并删除file对象。这意味着该文件的内容将在pickle中复制。在

网友
2楼 · 编辑于 2024-10-01 15:43:41

下面的代码应该或多或少地完成您想要的。第一个类定义了一些类似于文件但可以被pickle的东西。（当您取消拾取时，它会重新打开该文件，并查找您对其进行pickle时文件所在的位置）。第二个类是生成wordwindows的迭代器。在
class PickleableFile(object): def __init__(self, filename, mode='rb'): self.filename = filename self.mode = mode self.file = open(filename, mode) def __getstate__(self): state = dict(filename=self.filename, mode=self.mode, closed=self.file.closed) if not self.file.closed: state['filepos'] = self.file.tell() return state def __setstate__(self, state): self.filename = state['filename'] self.mode = state['mode'] self.file = open(self.filename, self.mode) if state['closed']: self.file.close() else: self.file.seek(state['filepos']) def __getattr__(self, attr): return getattr(self.file, attr) class WordWindowReader: def __init__(self, filenames, window_size): self.filenames = filenames self.window_size = window_size self.filenum = 0 self.stream = None self.filepos = 0 self.prevwords = [] self.current_line = [] def __iter__(self): return self def next(self): # Read through files until we have a non-empty current line. while not self.current_line: if self.stream is None: if self.filenum >= len(self.filenames): raise StopIteration else: self.stream = PickleableFile(self.filenames[self.filenum]) self.stream.seek(self.filepos) self.prevwords = [] line = self.stream.readline() self.filepos = self.stream.tell() if line == '': # End of file. self.stream = None self.filenum += 1 self.filepos = 0 else: # Reverse line so we can pop off words. self.current_line = line.split()[::-1] # Get the first word of the current line, and add it to # prevwords. Truncate prevwords when necessary. word = self.current_line.pop() self.prevwords.append(word) if len(self.prevwords) > self.window_size: self.prevwords = self.prevwords[-self.window_size:] # If we have enough words, then return a word window; # otherwise, go on to the next word. if len(self.prevwords) == self.window_size: return self.prevwords else: return self.next()

网友
3楼 · 编辑于 2024-10-01 15:43:41

您可以创建一个标准的迭代器对象，只是它不像生成器那样方便；您需要将迭代器的状态存储在instace上（以便对其进行pickle处理），并定义一个next（）函数来返回下一个对象：

class TrainExampleIterator (object):
    def __init__(self):
        # set up internal state here
        pass
    def next(self):
        # return next item here
        pass

迭代器协议很简单，在一个对象上定义.next()方法就是将其传递给for循环等的全部内容

在Python3中，迭代器协议使用__next__方法（稍微更加一致）。在

相关问题更多 >

编程相关推荐

热门问题

热门文章