Fix the wrong Content-Length in python-server.py for non-ascii characters. #24480
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Resolves: #24479
python-server.py
currently usessys.stdin.read
for reading the input, and it receives the length instr
(utf-8 string).ref: https://docs.python.org/3/library/sys.html
On the other "Content-Length" is the size in bytes, therefore we should not pass
content_length
tosys.stdin.read
. For example,print("こんにちは世界")
's length is 16 in str, but 30 in bytes.This PR have two changes.
sys.stdin.read(content_length)
withsys.stdin.buffer.read(content_length).decode()
._send_message
calculate "Content-Length" from bytes, not str.By these changes, original issue #24479 can be resolved.