在处理多个文档时，空间推理会失效

class SpacyExtractor(): def __init__(self): spacy.require_gpu() self.model = spacy.load('en_core_web_trf', disable=["tagger", "parser", "attribute_ruler", "lemmatizer"]) def get_named_entities(self, text: str): doc = self.model(text) entities = [] for ent in doc.ents: entities.append((ent.text, ent.label_)) return entities

1条回答

网友

1楼 · 发布于 2024-05-13 02:36:52

The problem is, with each call of get_named_entities, the amount of GPU memory allocated goes up.

您应该detach您的数据，如FAQ中所述：

Don’t accumulate history across your training loop. By default, computations involving variables that require gradients will keep history. This means that you should avoid using such variables in computations which will live beyond your training loops, e.g., when tracking statistics. Instead, you should detach the variable or access its underlying data.

编辑

你也可以使用

with torch.no_grad():
    doc = self.model(text)

EDIT: Also, I found out the OOM error when processing a single really long document, presented as a single long string.

这是意料之中的

相关问题更多 >

编程相关推荐

热门问题

热门文章

在处理多个文档时，空间推理会失效

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >