Python正则表达式，用于在字符串中查找和替换

lines = ['1 2 3*2.5 3 6 1*.3 8 \n', '! comment here\n', '1*1 2.0 2*2.1 3 6 0 8 \n'] for l, line in enumerate(lines): if line.strip() == '' or line.strip()[0] in ['#','!','C']: del lines[l] for l, line in enumerate(lines): repls = [word for word in line.strip().split() if word.find('*')>=0] print repls for repl in repls: print repl line = line.replace(repl, ' '.join([repl.split('*')[1] for n in xrange(int(repl.split('*')[0]))])) lines[l] = line print lines

编辑：

根据评论，我编辑了如下Python代码：

in_lines = ['1 2 3*2.5 3 6 1*.3 8 \n', '! comment here\n', '1*1 2.0 2*2.1 3 6 0 8 \n'] lines = [] for line in in_lines: if line.strip() == '' or line.strip()[0] in ['#','!','C']: continue else: repls = [word for word in line.strip().split() if word.find('*')>=0] for repl in repls: line = line.replace(repl, ' '.join([float(repl.split('*')[1]) for n in xrange(int(repl.split('*')[0]))])) lines.append(line) print lines

1条回答

网友

1楼 · 发布于 2024-05-04 13:35:48

Python道

使用python令人敬畏的功能特性和列表理解来代替：

#!/usr/bin/env python

lines = ['1 2 3*2.5 3 6 1*.3 8 \n', '! comment here\n', '1*1 2.0 2*2.1 3 6 0 8 \n']

#filter out comments
lines = [line for line in lines if  line.strip() != '' and line.strip()[0] not in ['#','!','C']]

#turns lines into lists of tokens
lines = [[word for word in line.strip().split()] for line in lines]

# turns a list of strings into a number generator, parsing '*' properly
def generate_numbers(tokens):
  for token in tokens:
    if '*' in token:
      n,m = token.split("*")
      for i in range(int(n)):
        yield float(m)
    else:
      yield float(token)

# use the generator to clean up the lines
lines = [list(generate_numbers(tokens)) for tokens in lines]

print lines

输出：

^{pr2}$

又快又小的Python道

此解决方案使用生成器而不是列表，这样您就不必在内存中加载整个文件。注意两个习语的用法：

with open("name") as file
这将在退出块后清理文件句柄。
for line in file
这将使用生成器迭代文件中的行，而无需在内存中加载整个文件。

这给了我们：

#!/usr/bin/env python

# turns a list of strings into a number generator, parsing '*' properly
def generate_numbers(tokens):
  for token in tokens:
    if '*' in token:
      n,m = token.split("*")
      for i in range(int(n)):
        yield float(m)
    else:
      yield float(token)

# Pull this out to make the code more readable
def not_comment(line):
  return line.strip() != '' and line.strip()[0] not in ['#','!','C']

with open("try.dat") as file:
  lines = ( 
    list(generate_numbers((word for word in line.strip().split()))) 
    for line in file if not_comment(line)
  ) # lines is a lazy generator

  for line in lines:
    print line

输出：

➤ ./try.py 
[1.0, 2.0, 2.5, 2.5, 2.5, 3.0, 6.0, 0.3, 8.0]
[1.0, 2.0, 2.1, 2.1, 3.0, 6.0, 0.0, 8.0]

编辑：

Python道

又快又小的Python道

相关问题更多 >

编程相关推荐

热门问题

热门文章