Why tornado.web.RequestHandler.arguments.get is binary?

L

Laszlo Nagy

I believe most data passed in URLs are character data. RFC 2986 also
suggest that the standard should be percent encoded UTF-8:
The generic URI syntax mandates that new URI schemes that provide for
the representation of character data in a URI must, in effect,
represent characters from the unreserved set without translation, and
should convert all other characters to bytes according to UTF-8
<http://en.wikipedia.org/wiki/UTF-8>, and then percent-encode those
values. This requirement was introduced in January 2005 with the
publication of RFC 3986 <http://tools.ietf.org/html/rfc3986>. URI
schemes introduced before this date are not affected. [1]

It is somewhat confusing that URI may be used to represent binary data.
More specifically, http and https URLs contain textual data in almost
all cases. When it is textual, it must be in UTF-8 (as dictated by the
RFC). So what is the reason in arguments.get returning binary data?


[1] http://en.wikipedia.org/wiki/Percent-encoding#Percent-encoding_in_a_URI
 

Ask a Question

Want to reply to this thread or ask your own question?

You'll need to choose a username for the site, which only take a couple of moments. After that, you can post your question and our members will help you out.

Ask a Question

Similar Threads

Crawling 1
XML 1.x: URIs' and IRIs' impact on well-formedness 2
Patricia trie vs binary search. 32
URI vs. URL vs. URN 1
Why Python3 12
Is char obsolete? 20
ChatBot 4
FAQ 4.72 How do I handle binary data correctly? 0

Members online

Forum statistics

Threads
474,001
Messages
2,570,251
Members
46,851
Latest member
CristineKo

Latest Threads

Top