我需要在我的python - 龙卷风服务器上接收协议缓冲区消息,并从二进制消息中获取内容。协议缓冲区python - unicode解码错误
postContent = self.request.body
message = prototemp.ReqMessage()
message.ParseFromString(postContent)
它可以完美的使用测试工具。当我在沙箱环境中运行,并模拟从我的客户1000个请求,它工作在某些情况下,但在大多数的请求,它抛出一个异常 -
File "server1.py", line 21, in post
message.ParseFromString(postContent)
File "/usr/lib/python2.6/site-packages/protobuf-2.4.1-py2.6.egg/google/protobuf/message.py", line 179, in ParseFromString
self.MergeFromString(serialized)
File "/usr/lib/python2.6/site-packages/protobuf-2.4.1-py2.6.egg/google/protobuf/internal/python_message.py", line 755, in MergeFromString
if self._InternalParse(serialized, 0, length) != length:
File "/usr/lib/python2.6/site-packages/protobuf-2.4.1-py2.6.egg/google/protobuf/internal/python_message.py", line 782, in InternalParse
pos = field_decoder(buffer, new_pos, end, self, field_dict)
File "/usr/lib/python2.6/site-packages/protobuf-2.4.1-py2.6.egg/google/protobuf/internal/decoder.py", line 544, in DecodeField
if value._InternalParse(buffer, pos, new_pos) != new_pos:
File "/usr/lib/python2.6/site-packages/protobuf-2.4.1-py2.6.egg/google/protobuf/internal/python_message.py", line 782, in InternalParse
pos = field_decoder(buffer, new_pos, end, self, field_dict)
File "/usr/lib/python2.6/site-packages/protobuf-2.4.1-py2.6.egg/google/protobuf/internal/decoder.py", line 410, in DecodeField
field_dict[key] = local_unicode(buffer[pos:new_pos], 'utf-8')
UnicodeDecodeError: 'utf8' codec can't decode byte 0xce in position 1: invalid continuation byte
在另一些情况下,它给出了这些错误 -
UnicodeDecodeError: 'utf8' codec can't decode byte 0xbf in position 3: invalid start byte
UnicodeDecodeError: 'utf8' codec can't decode byte 0xe7 in position 3: unexpected end of data
可能是什么原因?
你尝试使用try/except子句打印正在生成异常的字符串?或者使用'pdb'来查看这个变量是什么?因为它告诉你这个问题:字符串中指定位置的某些字符不能用utf-8编码。所以要么你需要处理这个角色。 (如果你能弄清楚它是什么以及你是否一般处理它,你将能够处理它) –
我的第一个猜测是测试客户端使用UTF-16,因为那些字节没有似乎与UTF-8或任何有意义的西方图表相匹配 –
听起来就像它正在接收一条它缺少协议定义的消息。一个或多个发射器是否使用不同的规格? – MarkHu