Respect Content-Type header when encoding unicode bodies #422

kgriffs · 2015-01-27T23:40:45Z

Currently, if req.body is set to a unicode string, the framework always encodes it as UTF-8 in the response. For correctness, we should respect alternate encodings when they are specified in the Content-Type header for the response.

lwcolton · 2015-04-13T19:02:08Z

Can you explain this a little more? Would this just be if the content-type is set to text/, then charset is used to decide how to encode the charset? http://www.w3.org/Protocols/rfc1341/7_1_Text.html

kgriffs · 2016-05-05T22:44:41Z

According to https://tools.ietf.org/html/rfc2046#section-4.1.2 :

Other media types than subtypes of "text" might choose to employ the charset parameter as defined here

For example, RFC 7303 defines multiple charsets for the XML media type, e.g.:

Content-Type: application/xml; charset=utf-16

vytas7 · 2023-03-26T18:20:42Z

I'm going to close this issue since it hasn't really attracted much attention, and it feels like it should be responsibility of the respective media handler. For instance, JSON is standardized to almost always use UTF-8 (or just ASCII by escaping the entities).

resp.text is documented to encode in UTF-8, simple and clear.
For more advanced handling of text as media, see also #2037

kgriffs added the bug label May 5, 2016

kgriffs added this to the Backlog (Non-Breaking Changes) milestone May 5, 2016

kgriffs modified the milestones: Backlog (Non-Breaking Changes), Triaged (Non-Breaking Changes) Apr 25, 2017

vytas7 mentioned this issue Feb 20, 2022

When serializing errors, respect charset, if any, specified in the Accept header #775

Closed

vytas7 closed this as not planned Won't fix, can't repro, duplicate, stale Mar 26, 2023

vytas7 added the wontfix label Mar 26, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Respect Content-Type header when encoding unicode bodies #422

Respect Content-Type header when encoding unicode bodies #422

kgriffs commented Jan 27, 2015

lwcolton commented Apr 13, 2015

kgriffs commented May 5, 2016

vytas7 commented Mar 26, 2023

Respect Content-Type header when encoding unicode bodies #422

Respect Content-Type header when encoding unicode bodies #422

Comments

kgriffs commented Jan 27, 2015

lwcolton commented Apr 13, 2015

kgriffs commented May 5, 2016

vytas7 commented Mar 26, 2023