字节类型上的UnicodeDecodeError

Traceback (most recent call last): File "c:.\SharqBot.py", line 1130, in <module> fullR=s.recv(1024).decode('utf-32').split('\r\n') UnicodeDecodeError: 'utf-32-le' codec can't decode bytes in position 0-3: codepoint not in range(0x110000)

b':tmi.twitch.tv 001 absolutelyabot :Welcome, GLHF!\r\n:tmi.twitch.tv 002 absolutelyabot :Your host is tmi.twitch.tv\r\n:tmi.twitch.tv 003 absolutelyabot :This server is rather new\r\n:tmi.twitch.tv 004 absolutelyabot :-\r\n:tmi.twitch.tv 375 absolutelyabot :-\r\n:tmi.twitch.tv 372 absolutelyabot :You are in a maze of twisty passages, all alike.\r\n:tmi.twitch.tv 376 absolutelyabot :>\r\n'

3条回答

网友

1楼 · 编辑于 2024-10-05 11:01:10

如果decode作为UTF-8不起作用，则每个Unicode序数都可以用UTF-8表示，这是因为正在传输的字节采用不同的编码，或者数据是文本和二进制数据的混合，并且只有一部分是UTF-8。很可能是文本是UTF-8编码的（大多数网络协议都是），因此非UTF-8数据将是帧数据或类似数据，需要进行解析以提取文本数据。

任何试图在文本/二进制情况下掩盖此类错误的尝试都将只是消除问题，而不是修复它们。您需要知道数据的编码（以及格式，如果不是所有的文本数据都有一个编码），然后使用它。你收到的数据不会神奇地变成UTF-16或UTF-32，因为你想要它。

网友
2楼 · 编辑于 2024-10-05 11:01:10

您可以尝试使用decode/encode（'utf-16-le'）。我试过了，没关系。但我不太清楚为什么。：P页

网友
3楼 · 编辑于 2024-10-05 11:01:10

尝试使用编码='ISO-8859-1'

相关问题更多 >

编程相关推荐

热门问题

热门文章