将unicode输入转换为字符串进行比较问题的回答

将unicode输入转换为字符串进行比较

回答此问题可获得 20 贡献值，回答如果被采纳可获得 50 分。

0 条评论
分类：Python问答

默认排序时间排序

1 个回答

匿名 1天前

　擅长：python、mysql、java

回到你原来的帖子： <pre><code>if tr1_find.search(str(ListTables[0].Cell(x,y))): print 'Found' value = ListTables[0].Cell(x,y+1) </code></pre> <code>ListTables[0].Cell(x,y)</code>从Word文档返回一个<code>Cell</code>实例。对其调用<code>str()</code>将检索其Unicode值，并尝试使用<code>ascii</code>编解码器将其编码为字节字符串。因为它包含非ASCII字符，所以失败时会出现<code>UnicodeEncodingError</code>。在 在以后的编辑中： ^{pr2}$ <code>unicode</code>检索Unicode值，将其转换为UTF-8字节字符串，并将其存储在<code>tyring</code>中。下一行尝试将字节字符串再次编码为“ascii”。这是无效的，因为只能对Unicode字符串进行编码，因此Python首先尝试使用默认的“ascii”编解码器将字节字符串转换回Unicode字符串。这将导致<code>UnicodeDecodingError</code>（不是编码）。在 最佳实践是用Unicode进行所有字符串处理。缺少的是<code>Range()</code>方法来获取单元格的值。下面是一个访问Word文档表的示例： <pre><code>PythonWin 2.7.1 (r271:86832, Nov 27 2010, 18:30:46) [MSC v.1500 32 bit (Intel)] on win32. Portions Copyright 1994-2008 Mark Hammond - see 'Help/About PythonWin' for further copyright information. >>> import win32com.client >>> word=win32com.client.gencache.EnsureDispatch('Word.Application') >>> word.ActiveDocument.Tables[0].Cell(1,1).Range() u'One\u4e00\r\x07' </code></pre> 请注意，它是一个Unicode字符串。Word似乎还使用<code>\r\x07</code>作为细胞系终止符。在 现在可以测试该值： <pre><code>>>> value = word.ActiveDocument.Tables[0].Cell(1,1).Range() >>> value == 'One' # NOTE: Python converts byte strings to Unicode via the default codec ('ascii' in Python 2.X) False >>> value == u'One' False >>> value == u'One马\r\x07' False >>> value == u'One一\r\x07' True >>> value == u'One\u4e00\r\x07' True >>> value == 'One\x92' # non-ASCII byte string fails to convert __main__:1: UnicodeWarning: Unicode equal comparison failed to convert both arguments to Unicode - interpreting them as being unequal False </code></pre>

将unicode输入转换为字符串进行比较

1 个回答

相关Python问题