在Python中如何通过字符搜索跳到一行

#foeijfoijeoijoijfoiej ijfoiejoi jfeoijfoifj i jfoei joi jo ijf eoij oie jojf #feoijfoiejf ioj oij oi jo ij i joi jo ij oij #### oijroijf 3## # o #foeijfoiej i jo i iojf 3 ## #io joi joij oi j## io joi joi j3# 3i ojoi joij # The stuff I care about

2条回答

网友

1楼 · 编辑于 2024-10-02 20:30:34

在不知道垃圾数据的大小或扫描垃圾数据的情况下，您无法直接查找它。但是，将文件包装在^{}中以丢弃行并不难，直到您看到“良好”的数据，之后它将遍历所有剩余的行：

import itertools

# Or def a regular function that returns True until you see the line
# delimiting the beginning of the "good" data
not_good = '# The stuff I care about\n'.__ne__

with open(filename) as f:
    for line in itertools.dropwhile(not_good, f):
        ... You'll iterate the lines at and after the good line ...

如果您确实需要适当定位文件描述符，而不仅仅是行，那么这个变量应该可以工作：

^{pr2}$

如果您真的需要（而不是仅仅需要偏移量），您可以调整它以获得实际的行号。但是它的可读性稍差，因此如果需要，通过enumerate显式迭代可能更有意义（留作练习）。让Python为您工作的方法是：

from future_builtins import map  # Py2 only
from operator import itemgetter

with open(filename) as f:
    linectr = itertools.count()
    # Get first good line
    # Pair each line with a 0-up number to advance the count generator, but
    # strip it immediately so not_good only processes lines, not line nums 
    good_start = next(itertools.dropwhile(not_good, map(itemgetter(0), zip(f, linectr))))

    good_lineno = next(linectr) # Keeps the 1-up line number by advancing once

    # Seek back to undo the read of the first good line:
    f.seek(-len(good_start), io.SEEK_CUR)

    # f is now positioned at the beginning of the line that begins the good data

网友

2楼 · 编辑于 2024-10-02 20:30:34

尝试使用readlines函数。这将返回一个包含每一行的列表。您可以使用for循环来分析每一行，搜索所需内容，然后通过列表中的索引获得该行的编号。例如：

with open('some_file_path.txt') as f:
    contents = f.readlines()
object = '#the line I am looking for'
for line in contents:
    if object in line:
        line_num = contents.index(object)

要去掉井号，只需使用replace函数。例如new_line = line.replace('#','')

相关问题更多 >

编程相关推荐

热门问题

热门文章