从字符串中置换删除长度可变的已定义子字符串

AlCrHfMoNbN --> CrHfMoNbN AlHfMoNbN AlCrMoNbN AlCrHfNbN AlCrHfMoN AlCrHfMoTaN --> CrHfMoTaN AlHfMoTaN AlCrMoTaN AlCrHfTaN AlCrHfMoN

3条回答

网友

1楼 · 编辑于 2024-09-28 23:37:33

如果您有gawk，请将FPAT设置为[A-Z][a-z]*，这样每个元素都将被视为一个字段，并使用一个简单的循环来生成置换。同时将OFS设置为空字符串，这样输出记录中就不会有空格。你知道吗

$ gawk 'BEGIN{FPAT="[A-Z][a-z]*";OFS=""} {for(i=1;i<NF;++i){p=$i;$i="";print;$i=p}}' file
CrHfMoNbN
AlHfMoNbN
AlCrMoNbN
AlCrHfNbN
AlCrHfMoN
CrHfMoTaN
AlHfMoTaN
AlCrMoTaN
AlCrHfTaN
AlCrHfMoN
CrHfMoTiN
AlHfMoTiN
AlCrMoTiN
AlCrHfTiN
AlCrHfMoN
CrHfMoVN
AlHfMoVN
AlCrMoVN
AlCrHfVN
AlCrHfMoN
CrHfMoWN
AlHfMoWN
AlCrMoWN
AlCrHfWN
AlCrHfMoN

我还写了一个带有额外空间和解释性注释的便携版本：

awk '{
  # separate last element from others
  sub(/[A-Z][a-z]*$/, " &")
  # from the beginning of line
  # we will match each element and print a line where it is omitted
  for (i=0; match(substr($1,i), /[A-Z][a-z]*/); i+=RLENGTH)
    print substr($1,1,i)  substr($1,i+RLENGTH+1) $2
    #     ^ before match  ^ after match          ^ last element
}' file

网友

2楼 · 编辑于 2024-09-28 23:37:33

IIUC，你只需要str.replace：

input_list = ['AlCrHfMoNbN', 'AlCrHfMoTaN']
removals = ['Al', 'Cr', 'Hf', 'Mo', 'Nb', 'Ta', 'Ti', 'V', 'W', 'Zr']
result = {}
for i in input_list:
    result[i] = [i.replace(r,'') for r in removals if r in i]

输出：

{'AlCrHfMoNbN': ['CrHfMoNbN',
  'AlHfMoNbN',
  'AlCrMoNbN',
  'AlCrHfNbN',
  'AlCrHfMoN'],
 'AlCrHfMoTaN': ['CrHfMoTaN',
  'AlHfMoTaN',
  'AlCrMoTaN',
  'AlCrHfTaN',
  'AlCrHfMoN']}

网友

3楼 · 编辑于 2024-09-28 23:37:33

这并不使用您的尝试，但当我们假设您的元素总是以大写字母开头（否则仅由小写字母组成）时，它就起作用了：

def f(s):
    # split string by elements
    import re
    elements = re.findall('[A-Z][^A-Z]*', s)

    # make a list of strings, where the first string has the first element removed, the second string the second, ...
    r = []
    for i in range(len(elements)):
        r.append(''.join(elements[:i]+elements[i+1:]))

    # return this list
    return r

当然，这仍然只适用于一个字符串。所以，如果你有一个字符串列表l，你想把它应用到其中的每个字符串，只需使用一个for循环，如下所示：

# your list of strings
l = ["AlCrHfMoNbN", "AlCrHfMoTaN", "AlCrHfMoTiN", "AlCrHfMoVN", "AlCrHfMoWN"]

# iterate through your input list
for s in l:
    # call above function
    r = f(s)
    # print out the result if you want to
    [print(i) for i in r]

相关问题更多 >

编程相关推荐

热门问题

热门文章

从字符串中置换删除长度可变的已定义子字符串

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >