Why does the percent sign in a URL cause an HTTP 400 Bad Request error?

22

2

I stumbled upon this by accident when mistyping the URL for a web page in my web browser.

Why does visiting http://example.com/% cause an HTTP 400 Bad Request error to be thrown? Is the server expecting something else after or before the percent sign?

It seems to happen for Apache and Nginx servers.

iglvzx

Posted 2014-05-28T23:01:19.650

Reputation: 21 611

Answers

33

Short answer

As per RFC 3986, a bare % character is not a valid URI syntax; it should be followed by two meaningful hexadecimal digits.

Long answer

The HTTP status code you got belongs to the 4xx class:

4xx: Client Error - The request contains bad syntax or cannot be fulfilled

Source: Hypertext Transfer Protocol (HTTP) Status Code Registry

In particular, code 400 is defined by the Internet Engineering Task Force (IETF) in RFC 2616:

10.4.1 400 Bad Request

The request could not be understood by the server due to malformed syntax. The client SHOULD NOT repeat the request without modifications.

Source: RFC 2616 - Hypertext Transfer Protocol -- HTTP/1.1

Quoting Wikipedia (bold emphasis mine):

The characters allowed in a URI are either reserved or unreserved (or a percent character as part of a percent-encoding).

Source: Percent-encoding - Percent-encoding in a URI

If you want to insert a literal % symbol, you need to use its percent-encoded representation: %25.

Further reading

and31415

Posted 2014-05-28T23:01:19.650

Reputation: 13 382

I don't get it. If I mask the '%' sign with '%25' the file will still not be served but an error 400 will be thrown in our scenario (Apache -> JKMount -> Tomcat) – fiffy – 2017-06-08T13:03:40.257

If you have the following RewriteRule RewriteRule (.*) xyz/$1 (where xyz is any folder name) in .htaccess, you should double-encode % as %2525. – Marco Marsala – 2019-09-22T20:58:11.643

9

The percent sign is for inserting a character that is normally not supported in the url. For example %20 is the same as a space.

LPChip

Posted 2014-05-28T23:01:19.650

Reputation: 42 190

4And to insert a percent character itself, it's %25 – Robotnik – 2014-05-29T03:50:40.980

A + is a shortcut way to encode a space. If you want a real plus sign, use its hex code, %2B. – Phil Perry – 2014-05-29T13:44:04.757

3>

  • is the correct encoding for a space only within a query string. %20 is the correct encoding elsewhere within the url.
  • < – Eden Townsend – 2014-05-29T14:22:22.257