使用Python解析字符串？

master = [] # Dictionary for us to look up words def is_word(inputstr): #returns True/False def processstring(fstr,secstr,li): if is_word(fstr): li.append(fstr) if len(secstr) == 0: if len(li) != 0: master.append(li) return processstring(fstr+secstr[0], secstr[1:len(secstr)],li) def wrapperprocess(inpstr): li = [] if len(inpstr) == 0: return processstring('',inpstr,li) wrapperprocess(inpstr[1:len(inpstr)]) wrapperprocess('helloyellowellow') print master

3条回答

网友

1楼 · 编辑于 2024-10-02 08:18:41

这是个好问题

使用Wordnet包

在解析给定字符串时，从一些索引开始，并不断地折磨索引值对于索引上的每一个增量，使用wordnet检查同一个单词的存在，它会告诉你特定的子字符串是否有意义！在

要安装wordnet：

https://pypi.python.org/pypi/Wordnet-bn/1.0

网友

2楼 · 编辑于 2024-10-02 08:18:41

既然您提到您正在寻找一个有效的算法，并且假设您提前获得了字典（而不仅仅是作为一个可调用的谓词），那么您可以使用Aho–Corasick算法。在

当然，如果输入的文本很短，一个更朴素的算法会更快，以避免字典的“昂贵”预处理。在

另外，python的另一个答案是：这里有一个简单的方法来检查每个子字符串：

def gen_words(txt):
    n = len(txt)
    for i in range(n):
        for j in range(i+1, n+1):
            subtxt = txt[i:j]
            if is_word(subtxt):
                yield subtxt

要获得独特性，请执行以下操作：

^{pr2}$

网友

3楼 · 编辑于 2024-10-02 08:18:41

你可以这样做：

tgt='helloyellowellow'

with open('/usr/share/dict/words') as f:
    for word in f:
        word=word.strip()
        if word in tgt and len(word)>1:
            print word

印刷品：

^{pr2}$

如果您只是在寻找尚未定义的函数is_word，那么可以使用如下方法：

def is_word(word, dic='/usr/share/dict/words'):
    if not hasattr(is_word, 'words'):
        with open(dic) as f:
            is_word.words={word.strip() for word in f}

    return word in is_word.words and len(word)>1

作为默认的数据结构，Python集合有一个平均值look-up time of O(1)。你不太可能自己写一些更快的东西。在

相关问题更多 >

编程相关推荐

热门问题

热门文章