Python中的Iterparse对象没有返回iter obj

ARTICLE_TAGS = ['OpportunitySynopsisDetail_1_0', 'OpportunityForecastDetail_1_0'] for _tag in ARTICLE_TAGS: f = open(xml_f) context = etree.iterparse(f, tag = _tag) for _, e in context: _id = e.xpath('.//OpportunityID/text()') text = e.xpath('.//OpportunityTitle/text()') f.close()

<?xml version="1.0" encoding="UTF-8"?> <Grants xsi:schemaLocation="http://apply.grants.gov/system/OpportunityDetail-V1.0 https://apply07.grants.gov/apply/system/schemas/OppotunityDetail-V1.0.xsd" xmlns="http://apply.grants.gov/system/OpportunityDetail-V1.0" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instace"> <OpportunitySynopsisDetail_1_0> <OpportunityID>262148</OpportunityID> <OpportunityTitle>Establishment of the Edmund S. Muskie Graduate Internship Program</OpportunityTitle> </OpportunitySynopsisDetail_1_0> <OpportunityForecastDetail_1_0> <OpportunityID>284765</OpportunityID> <OpportunityTitle>PPHF 2015: Immunization Grants-CDC Partnership: Strengthening Public Health Laboratories-financed in part by 2015 Prevention and Public Health Funds</OpportunityTitle> </OpportunityForecastDetail_1_0> </Grants>

1条回答

网友

1楼 · 发布于 2024-10-01 00:14:15

首先，在解析包含名称空间的XML时，查看标记名时必须使用这些名称空间。
第二，iterparse不接受名为tag的参数，因此我不知道您的代码在发布时是如何工作的。最后，从iterparse返回的元素没有名为xpath的成员函数，因此这也不可能起作用

下面是如何使用iterparse解析XML的示例：

NS='{http://apply.grants.gov/system/OpportunityDetail-V1.0}'
ARTICLE_TAGS = [NS+'OpportunitySynopsisDetail_1_0', NS+'OpportunityForecastDetail_1_0']

with open(xml_f, 'r') as f:
    context = etree.iterparse(f)

    for _, e in context:
    if e.tag  in ARTICLE_TAGS:
        _id = e.find(NS+'OpportunityID')
        text = e.find(NS+'OpportunityTitle')
        print(_id.text, text.text)

正如我在评论中所说，Python documentation和Effbot page on ElementTree一样有用。有很多其他可用的资源；把xml.etree.elementtree放进谷歌，开始阅读吧

相关问题更多 >

编程相关推荐

热门问题

热门文章