空间替换令牌

import spacy nlp = spacy.load("en_core_web_lg") from spacy.tokens import Doc doc1 = nlp("Hi this is my dog.") new_words = [token.text if token.text!="dog" else "Simba" for token in doc1] Doc(doc1.vocab, words=new_words) # Hi this is my Simba .

3条回答

网友

1楼 · 编辑于 2024-09-27 07:20:12

看来你在找一个常规的替代品？我会的

string = "Hi this is my dog."
string = string.replace("dog","Simba")

网友

2楼 · 编辑于 2024-09-27 07:20:12

下面的函数替换任意数量的匹配项（使用spaCy查找），保持与原始文本相同的空格，并适当处理边缘情况（如匹配项位于文本开头时）：

import spacy
from spacy.matcher import Matcher

nlp = spacy.load("en_core_web_lg")

matcher = Matcher(nlp.vocab)
matcher.add("dog", None, [{"LOWER": "dog"}])

def replace_word(orig_text, replacement):
    tok = nlp(orig_text)
    text = ''
    buffer_start = 0
    for _, match_start, _ in matcher(tok):
        if match_start > buffer_start:  # If we've skipped over some tokens, let's add those in (with trailing whitespace if available)
            text += tok[buffer_start: match_start].text + tok[match_start - 1].whitespace_
        text += replacement + tok[match_start].whitespace_  # Replace token, with trailing whitespace if available
        buffer_start = match_start + 1
    text += tok[buffer_start:].text
    return text

>>> replace_word("Hi this is my dog.", "Simba")
Hi this is my Simba.

>>> replace_word("Hi this dog is my dog.", "Simba")
Hi this Simba is my Simba.

网友

3楼 · 编辑于 2024-09-27 07:20:12

text='你好，这是我的狗' 打印（text.replace（'dog'，'simba'））

相关问题更多 >

编程相关推荐

热门问题

热门文章

空间替换令牌

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >