正在检查ElementTree节点是否为空failu问题的回答

正在检查ElementTree节点是否为空failu

回答此问题可获得 20 贡献值，回答如果被采纳可获得 50 分。

我一直收到错误：<code>AttributeError: 'NodeList' object has no attribute 'data'</code>，但我只是试图检查该节点是否为空，如果是，只传递-1而不是值。我的理解是<code>temp_pub.getElementsByTagName("pages").data</code>应该返回<code>None</code>。我怎么解决这个问题？在 （注：我试过<code>!= None</code>和{<cd5>}） <pre><code>xmldoc = minidom.parse('pubsClean.xml') #loop through <pub> tags to find number of pubs to grab root = xmldoc.getElementsByTagName("root")[0] pubs = [a.firstChild.data for a in root.getElementsByTagName("pub")] num_pubs = len(pubs) count = 0 while(count < num_pubs): temp_pages = 0 #get data from each <pub> tag temp_pub = root.getElementsByTagName("pub")[count] temp_ID = temp_pub.getElementsByTagName("ID")[0].firstChild.data temp_title = temp_pub.getElementsByTagName("title")[0].firstChild.data temp_year = temp_pub.getElementsByTagName("year")[0].firstChild.data temp_booktitle = temp_pub.getElementsByTagName("booktitle")[0].firstChild.data #handling no value if temp_pub.getElementsByTagName("pages").data != None: temp_pages = temp_pub.getElementsByTagName("pages")[0].firstChild.data else: temp_pages = -1 temp_authors = temp_pub.getElementsByTagName("authors")[0] temp_author_array = [a.firstChild.data for a in temp_authors.getElementsByTagName("author")] num_authors = len(temp_author_array) count = count + 1 </code></pre> 正在处理的XML ^{pr2}$ 编辑中的完整代码（with to ElementTree） <pre><code>#for execute command to work import sqlite3 import xml.etree.ElementTree as ET con = sqlite3.connect("publications.db") cur = con.cursor() from xml.dom import minidom #use this to clean the foreign characters import re def anglicise(matchobj): if matchobj.group(0) == '&amp;': return matchobj.group(0) else: return matchobj.group(0)[1] outputFilename = 'pubsClean.xml' with open('test.xml') as inXML, open(outputFilename, 'w') as outXML: outXML.write('<root>\n') for line in inXML.readlines(): if (line.find("") or line.find("")): newline = line.replace("", "") newLine = newline.replace("", "") outXML.write(re.sub('&[a-zA-Z]+;',anglicise,newLine)) outXML.write('\n</root>') tree = ET.parse('pubsClean.xml') root = tree.getroot() xmldoc = minidom.parse('pubsClean.xml') #loop through <pub> tags to find number of pubs to grab root2 = xmldoc.getElementsByTagName("root")[0] pubs = [a.firstChild.data for a in root2.getElementsByTagName("pub")] num_pubs = len(pubs) count = 0 while(count < num_pubs): temp_pages = 0 #get data from each <pub> tag temp_ID = root.find(".//ID").text temp_title = root.find(".//title").text temp_year = root.find(".//year").text temp_booktitle = root.find(".//booktitle").text #handling no value if root.find(".//pages").text: temp_pages = root.find(".//pages").text else: temp_pages = -1 temp_authors = root.find(".//authors") temp_author_array = [a.text for a in temp_authors.findall(".//author")] num_authors = len(temp_author_array) count = count + 1 #process results into sqlite pub_params = (temp_ID, temp_title) cur.execute("INSERT OR IGNORE INTO publication (id, ptitle) VALUES (?, ?)", pub_params) cur.execute("INSERT OR IGNORE INTO journal (jtitle, pages, year, pub_id, pub_title) VALUES (?, ?, ?, ?, ?)", (temp_booktitle, temp_pages, temp_year, temp_ID, temp_title)) x = 0 while(x < num_authors): cur.execute("INSERT OR IGNORE INTO authors (name, pub_id, pub_title) VALUES (?, ?, ?)", (temp_author_array[x],temp_ID, temp_title)) cur.execute("INSERT OR IGNORE INTO wrote (name, jtitle) VALUES (?, ?)", (temp_author_array[x], temp_booktitle)) x = x + 1 con.commit() con.close() print("\nNumber of entries processed: ", count) </code></pre>

0 条评论
分类：Python问答

默认排序时间排序

1 个回答

匿名 1天前

　擅长：python、mysql、java

正在检查ElementTree节点是否为空failu

1 个回答

相关Python问题