基于替换创建字符串组合

2024-09-28 20:43:37 发布

您现在位置:Python中文网/ 问答频道 /正文

给定一个单词和一个替换字符字典,我需要根据替换字符形成一个字符组合

输入

word = 'accompanying'  
substitutions={'c':['$'], 'a': ['4'], 'g': ['9']} 

输出

{'a$$ompanyin9', 'ac$ompanyin9','a$companyin9','4ccomp4nying', '4$$omp4nying', 
'4$comp4nying','4c$omp4nying', '4ccomp4nyin9', 'a$$ompanying', 'a$companying', 'ac$ompanying', 
'accompanyin9', 'accompanying', '4$$omp4nyin9', '4$comp4nyin9', '4c$omp4nyin9','etc.,'}

我写了一个代码,但它并没有提供我所期望的所有组合

示例代码

from itertools import product
substitutions={'c':['$'], 'a': ['4'], 'g': ['9']}
for key in substitutions.keys():
  if key not in substitutions[key]:
    substitutions[key].append(key)
wordPossibilities = []
word = 'accompanying'
for substitute in [zip(substitutions.keys(),ch) for ch in product(*substitutions.values())]:
  temp=word
  for replacement in substitute:
    temp=temp.replace(*replacement)
  wordPossibilities.append(temp)
print(set(wordPossibilities))

我的输出

{'4$$omp4nyin9', 'a$$ompanyin9', 'a$$ompanying', 'accompanyin9',
'accompanying', '4ccomp4nyin9', '4$$omp4nying', '4ccomp4nying'}

如果找到替换项,“我的代码”将替换所提供字符串中的所有字符。如何根据索引进行替换以找到所有可能的组合


Tags: key代码infor字符tempacword
2条回答

使用带递归的生成器非常简洁明了:

word = 'accompanying'  
subs={'c':['$'], 'a': ['4'], 'g': ['9']} 
def get_subs(d, c = []):
  if not d:
     yield ''.join(c)
  else:
     for i in [d[0], *subs.get(d[0], [])]:
        yield from get_subs(d[1:], c+[i])

print(list(get_subs(word)))

输出:

['accompanying', 'accompanyin9', 'accomp4nying', 'accomp4nyin9', 'ac$ompanying', 'ac$ompanyin9', 'ac$omp4nying', 'ac$omp4nyin9', 'a$companying', 'a$companyin9', 'a$comp4nying', 'a$comp4nyin9', 'a$$ompanying', 'a$$ompanyin9', 'a$$omp4nying', 'a$$omp4nyin9', '4ccompanying', '4ccompanyin9', '4ccomp4nying', '4ccomp4nyin9', '4c$ompanying', '4c$ompanyin9', '4c$omp4nying', '4c$omp4nyin9', '4$companying', '4$companyin9', '4$comp4nying', '4$comp4nyin9', '4$$ompanying', '4$$ompanyin9', '4$$omp4nying', '4$$omp4nyin9']

但是,itertools.product可用于较短的解决方案:

from itertools import product as prod
s = ''.join('{}' if i in subs else i for i in word)
result = [s.format(*i) for i in prod(*[[i, *subs[i]] for i in word if i in subs])]

输出:

['accompanying', 'accompanyin9', 'accomp4nying', 'accomp4nyin9', 'ac$ompanying', 'ac$ompanyin9', 'ac$omp4nying', 'ac$omp4nyin9', 'a$companying', 'a$companyin9', 'a$comp4nying', 'a$comp4nyin9', 'a$$ompanying', 'a$$ompanyin9', 'a$$omp4nying', 'a$$omp4nyin9', '4ccompanying', '4ccompanyin9', '4ccomp4nying', '4ccomp4nyin9', '4c$ompanying', '4c$ompanyin9', '4c$omp4nying', '4c$omp4nyin9', '4$companying', '4$companyin9', '4$comp4nying', '4$comp4nyin9', '4$$ompanying', '4$$ompanyin9', '4$$omp4nying', '4$$omp4nyin9']
显然,你需要重写你的逻辑来考虑每个字母的EM>实例EME>期望的字母,而不是每个字母。查找所需字母的所有匹配项;使用itertools获取电源组;对电源组的每个元件进行指定的替换power_set来自this SO answer。我在一些地方留下了代码“分解”,以便更容易地显示逻辑。您可能希望将最后一个循环包装成一行return表达式

from itertools import chain, combinations

def power_set(iterable):
    s = list(iterable)
    return chain.from_iterable(combinations(s, r) for r in range(len(s)+1))
                               
substitutions={'c':['$'], 'a': ['4', 'a'], 'g': ['9']}
word = 'accordingly'

# Get index of each desired letter and its poosible substitutions
sub_idx = [(pos, letter, sub_letter) for pos, letter in enumerate(word)
            if letter in list(substitutions.keys()) for sub_letter in substitutions[letter]]
print("Replacement set", sub_idx)

for possibility in power_set(sub_idx):
    # Make each of the substitutions indicated in the power set
    new_word = list(word)
    for pos, _, sub_letter in possibility:
        new_word[pos] = sub_letter
    print(''.join(new_word))

输出:

Replacement set [(0, 'a', '4'), (0, 'a', 'a'), (1, 'c', '$'), (2, 'c', '$'), (8, 'g', '9')]
accordingly
4ccordingly
accordingly
a$cordingly
ac$ordingly
accordin9ly
accordingly
4$cordingly
4c$ordingly
4ccordin9ly
a$cordingly
ac$ordingly
accordin9ly
a$$ordingly
a$cordin9ly
ac$ordin9ly
a$cordingly
ac$ordingly
accordin9ly
4$$ordingly
4$cordin9ly
4c$ordin9ly
a$$ordingly
a$cordin9ly
ac$ordin9ly
a$$ordin9ly
a$$ordingly
a$cordin9ly
ac$ordin9ly
4$$ordin9ly
a$$ordin9ly
a$$ordin9ly

相关问题 更多 >