向与表达式（Python）匹配的行首和行尾添加字符

import os import sys import html2text file = raw_input("File to convert: ") h = html2text.HTML2Text() with open(file, 'r') as f: dataContent = f.read() dataConverted = h.handle(dataContent) with open('tempconvert', 'w') as f: f.write(dataConverted) #print dataConverted # Add the | to the line beginnings and endings with open('tempconvert', 'r') as f: tempContent = f.readlines() with open('finalconvert', 'w') as f: for line in tempContent: if '|' in line: f.write('|' + line.rstrip('\n') + '| \n')

2条回答

网友

1楼 · 编辑于 2024-07-04 17:16:51

1:通过使用^{}将单个字符串转换为字符串列表，可以消除临时文件。这样做的副作用是从每一行中删除\n，您必须在后面添加或说明这些内容。你知道吗

tempContent = dataConverted.split('\n')

2:有两种方法可以解决这个问题。首先是简单地使用else来编写您之前跳过的行。（如果使用上面的split提示，则不需要rstrip）。你知道吗

if '|' in line:
    f.write('|' + line + '| \n')
else:
    f.write(line + '\n')

另一种方法是，如果行需要更新，则更新它，然后在任何情况下都写入它。你知道吗

if '|' in line:
    line = '|' + line + '| '
f.write(line + '\n')

3:这更难，因为你不只是想把这些条添加到一个空白行之后的任何一行，你想检测到有一个表出现了。这意味着你需要向前看。这里有一个小功能，你可以用它来自动看前面。你知道吗

def lookahead(seq):
    current = None
    for upcoming in seq:
        if current is not None:
            yield current, upcoming
        current = upcoming
    if current is not None:
        yield current, None

你可以这样使用它：

for line, upcoming in lookahead(tempContent):
    if (upcoming and '|' in upcoming) or ('|' in line):
        line = '|' + line + '|'

网友

2楼 · 编辑于 2024-07-04 17:16:51

第一次尝试：

html = html2text.HTML2Text()
input_file = raw_input('File to convert: ')

with open(input_file, 'r') as input_, open('finalconvert', 'w') as output_:
    data = input_.read()
    data_converted = html.handle(data)

    for line in data_converted.split('\n'):
        if '|' in line:
            line = "|{}|\n".format(line.rstrip())
        output_.write(line.encode())

这段代码修复了1和2；但是我不理解3。你知道吗

相关问题更多 >

编程相关推荐

热门问题

热门文章