How does websocket frame masking protect against cache poisoning?

Question

I've been studying the Websocket protocol (RFC 6455). Section 10.3 specifically talks about frame masking, which is prevents cache poisoning from http proxy servers.

How does frame masking prevent cache poisoning? How is a proxies cache "poisoned"? Why is the masking only applied to messages from the client to the server and not also the other way around?

Can this attack also be applied to long polling methods as well (specifically XHR)?

tylerl · Accepted Answer · 2013-06-05T21:21:08.480

That's a lot of questions. We'll stick just to the core problem and mitigation technique:

There's the chance that my connection may be proxied or otherwise examined by a caching proxy. And there's a decent chance that I can fool the proxy if I can get the right data into my request and the desired data into the response. The basic idea was demonstrated in a paper called Talking to Yourself for Fun and Profit.

As an attacking client, what I want to do is push content up to the server that contains a seemingly valid request string:

... something ...

GET /jquery-latest.min.js HTTP/1.1
Host: code.jquery.com
...
... something that triggers the desired response from the server

The only requirement is that you figure out a way to get the server to send you a desired string, which typically isn't too hard. Then the server predictably parrots back the desired response:

HTTP/1.1 200 OK
content-type: text/javascript

... my trojaned javascript file

As far as the server and client are concerned, we're just passing strings back and forth. No real reason for concern, no harm done, no possible vector for exploit.

But a proxy watching the traffic would see what looks a lot like an HTTP request, and then what looks a lot like an HTTP response. And if the proxy is naive enough (and many are), it will update its cache accordingly, so that the next time someone requests http://code.jquery.com/jquery-latest.min.js through the proxy, they'll get my trojaned version instead of the real CDN-delivered code.

To guard against this, the traffic in one direction (in this case client-to-server) is mildly scrambled. It doesn't have to be cryptographically secure as you don't care if someone can decode it. You just need to make sure that the client can't predict what string he'll be transmitting over the network. And since he can't predict his network traffic, he can't use this mechanism to trigger a proxy server to update its cache.

The proxy will only see a cacheable event if BOTH the request AND response contain what looks like HTTP traffic. So only one direction need to be jumbled.

This does not apply to XHR, since XHR requests are "normal" HTTP requests, and are well-understood by proxies.

Updated the question to (hopefully) focus on the core problem, which you answered well, thanks! — Luke, Jun 05 '13 at 21:27

score -2 · Answer 2 · answered Jan 23 '14 at 22:48

-2

How does frame masking prevent cache poisoning?

The idea is to prevent content of the frame from being interpreted as some HTTP request/response in progress by mangling the frame data.

In reality I believe this is totally useless.

Assume I've written a simple WebSocket client (true actually), as the sender of the frame I get to choose the masking dword. All I have to do is make up a frame of random data with some HTTP headers in the middle and apply the xor masking algorithm. Then send that frame in my WebSocket using the same mask, the WebSocket's xor masking will simply cause the HTTP parts to be plain text on the wire.

Yours, TonyWilk

answered Jan 23 '14 at 22:48

TonyWilk

97
1

Why would you want to do that, causing your own application to be more prone to faulty HTTP proxies? The point is to prevent your Websockets traffic from being understood and mangled by 3rd party proxies. – jjmontes Aug 28 '19 at 15:08
It's not trying to protect anyone from your custom web socket client. If you control the client and the server you can in-fact still mess-with naive caching proxies in-between. But when web browsers properly implement masking, no matter how clever you are, now you can't use javascript websockets on my browser that poisons my naive caching proxy - so you can't attack my infrastructure - only your own. – DanO Jan 31 '20 at 00:34

How does websocket frame masking protect against cache poisoning?

2 Answers2