如何测试编码类型python2.7？

1条回答

网友

1楼 · 发布于 2024-10-02 08:21:19

不要依赖默认的系统编码。相反，总是自己设置：

    # read in a string (a bunch of bytes the encoding of which you should know)
    str = sys.stdin.read();
    # decode the bytes into a unicode string
    u = unicode.decode(str, encoding='ISO-8859-1', errors=replace);
    # do stuff with the string
    # ...
    # always operate on unicode stuff inside your program.
    # make a 'unicode sandwhich'.
    # ...
    # encode the bytes in preparation for writing them out
    out = unicode.encode(u, encoding='UTF-8')
    # great, now you have bytes you can just write out
    with open('myfile.txt', 'wb') as f:
        rb.write(out)

注意，我对整个编码进行了硬编码。你知道吗

但是如果你不知道输入的编码呢？好吧，那是个问题。You need to know that。但我也明白unicode可能会很痛苦，python社区的一个家伙告诉你how to stop the pain (video)。你知道吗

注意，python3的一大变化是更好的unicode支持。与使用unicode包和混乱的py2str类型不同，在python 3中str类型正是python 2的unicode类型，您可以在更方便的地方指定编码：

with open('myfile.txt', 'w', encoding=UTF-8, errors='ignore') as f:
   # ...

相关问题更多 >

编程相关推荐

热门问题

热门文章

如何测试编码类型python2.7？

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >