Does CURL cache requests?

Question

This is a pretty long question, so bear with me.

I wanted to stress my Akamai Server logged in from an AWS instance. So, I started running ab benchmark. However, they seemed ridiculously fast to download ~3 MB video files. Naturally I wanted to see what's going on. This is what I did to get the file

curl -v -o /dev/null

The above completed in ~5 seconds.

Next, I ran the same command again. This time, it completed in ~200ms! Naturally, my intuition says the file is being cached somewhere.

My questions:

Does curl cache files? If so, is there a way to ignore it?
If curl doesn't, does the ubuntu abstracts a cache beneath curl? If so, is there a way to ignore it?
Given the requirements, do you think there could be a benchmarking tool apart from ab that could serve the purpose?

Thank you, Akshay

score 21 · Accepted Answer · answered Jun 11 '15 at 10:19

21

The curl client isn't caching files, but the remote server network might well be. Try adding an arbitrary query string variable to the URL to see if you can reproduce it.

answered Jun 11 '15 at 10:19

Josip Rodin

1,575
11
17

Thank you for your answer. I couldn't add arbitrary query string as the Akamai server that I use doesn't accept any query params! (forcing error as it relies on salted token digest of timestamp and URL). However I was able to generate multiple tokens for the same path (essentially multiple URLs) and you are absolutely right. curl wasn't caching any file - the remote server was. Go CDN! :) – Akshaya Shanbhogue Jun 11 '15 at 19:04

score 7 · Answer 2 · answered Oct 20 '15 at 10:50

7

Belatedly, try:

curl -v -H "Cache-Control: no-cache"

That will tell the web server to not cache. Doesn't stop layers below caching unless it's coded to obey the headers.

answered Oct 20 '15 at 10:50

user171959

179
1
2

score 2 · Answer 3 · answered Nov 24 '18 at 09:51

2

You can use add a random query string using the $RANDOM environment variable:

curl --location --silent "https://git.io/lsf-e2e?$RANDOM"

This worked for me on github raw files.

answered Nov 24 '18 at 09:51

Édouard Lopez

425
1
3
13

`$RANDOM` is a good idea, but `?$RANDOM` might result in a bad request. For a valid query string, `?foo=$RANDOM` would work. – Dario Seidl Nov 15 '21 at 12:51

score 1 · Answer 4 · answered May 26 '18 at 01:53

1

I've used this curl command with a cache buster parameter.

curl http://example.com/static/changing_file?_=$(date +%s)

date +%s prints the seconds since the epoch, if you call the url more than once a second use date +%s.%N to add in nanoseconds.

answered May 26 '18 at 01:53

Martlark

141
7

1

Or you could use $RANDOM instead of appending the nanoseconds. Sure, it's not the prettiest (or most concise) thing, and it kind of merges two solutions, but it does not require nanosecond precision. – Gustavo6046 Jun 05 '20 at 22:27

score -2 · Answer 5 · edited Oct 06 '15 at 22:11

-2

Maybe your dns is caching the resolution of the name and this is the reason of the diference in time of response.

It's only a theory.

edited Oct 06 '15 at 22:11

Falcon Momot

24,975
13
61
92

answered Oct 05 '15 at 23:34

user315010

9

Does CURL cache requests?

5 Answers5