Why tornado.web.RequestHandler.arguments.get is binary?

Laszlo Nagy · Nov 16, 2013

I believe most data passed in URLs are character data. RFC 2986 also
suggest that the standard should be percent encoded UTF-8:

The generic URI syntax mandates that new URI schemes that provide for
the representation of character data in a URI must, in effect,
represent characters from the unreserved set without translation, and
should convert all other characters to bytes according to UTF-8
<http://en.wikipedia.org/wiki/UTF-8>, and then percent-encode those
values. This requirement was introduced in January 2005 with the
publication of RFC 3986 <http://tools.ietf.org/html/rfc3986>. URI
schemes introduced before this date are not affected. [1]

It is somewhat confusing that URI may be used to represent binary data.
More specifically, http and https URLs contain textual data in almost
all cases. When it is textual, it must be in UTF-8 (as dictated by the
RFC). So what is the reason in arguments.get returning binary data?

[1] http://en.wikipedia.org/wiki/Percent-encoding#Percent-encoding_in_a_URI

Crawling	1	Mar 10, 2021
XML 1.x: URIs' and IRIs' impact on well-formedness	2	Dec 13, 2009
Patricia trie vs binary search.	32	May 25, 2012
URI vs. URL vs. URN	1	Jun 6, 2011
Why Python3	12	Jun 28, 2010
Is char obsolete?	20	Apr 8, 2011
ChatBot	4	Jan 19, 2021
FAQ 4.72 How do I handle binary data correctly?	0	Feb 13, 2011

Why tornado.web.RequestHandler.arguments.get is binary?

Laszlo Nagy

Ask a Question

Similar Threads

Members online

Forum statistics

Latest Threads