嵌套forloop迭代停止

<html> <head> <title></title> </head> <body> blabla blablabla qwqwqw </body> </html>

from lxml import etree tree = etree.parse("file.html") filein = "file.css" def f1(): with open(filein, 'rU') as f: for span in tree.iterfind('//span'): for line in f: if span and span.attrib.has_key('id'): x = span.get('id') if "af" not in x and x in line: print x, line def main(): f1()

2条回答

网友

1楼 · 编辑于 2024-09-26 17:51:53

如果如我所想，树已完全加载到内存中，则可以尝试反转循环。这样，您只需浏览文件filein一次：

def f1():

    with open(filein, 'rU') as f:   
        for line in f:
            for span in tree.iterfind('//span'):   
                if span and span.attrib.has_key('id'):
                    x = span.get('id')
                    if "af" not in x and x in line:
                            print x, line

网友

2楼 · 编辑于 2024-09-26 17:51:53

这是因为在第二个外循环开始之前，您已经读取了所有filein行。要使其正常工作，您需要在filein上启动内部循环之前添加f.seek（0）：

with open(filein, 'rU') as f:   
    for span in tree.iterfind('//span'):
        f.seek(0)   
        for line in f:
            if span and span.attrib.has_key('id'):
                x = span.get('id')
                if "af" not in x and x in line:
                        print x, line

相关问题更多 >

编程相关推荐

热门问题

热门文章