32

My question is about Virtual Machines and delivering their content over the servers connection to the internet.

I have an Ec2 windows instance, and its network connection appears to be 100mbps

If I was to be delivering content from that EC2 instance, is THAT my potential bottleneck?

How does s3 differ, I am guess their is no real potential outbound bottleneck with s3?

Note : I know s3 and their CDN would be better for static content, however I need to explore this situation for now. Our HTML pages need to access a server side page via AJAX, and because there is no bombproof work around for this at the moment our content and our server needs to be on exactly the same domain, so it rules out using S3.

Bandwidth needed : I am not sure, we could have up to 100 users downloading videos at any time, probably no more. Videos can be up to 5mb each, but they would view up to 20.

Junior Mayhé
  • 185
  • 1
  • 10
Keeno
  • 611
  • 3
  • 9
  • 18
  • " we need our HTML to access a proxy via JS" ... What does that mean? – Eric Hammond Oct 26 '11 at 11:39
  • How much bandwidth do you expect to need to use at peak? – Eric Hammond Oct 26 '11 at 11:41
  • @EricHammond I have updated my question with some more info – Keeno Oct 26 '11 at 14:08
  • It doesn't sound to me like you have any particular reason to be hyper focused on any specific potential bottlenecks at this point. Build your service using common best practices for scalability and then investigate and solve performance issues as they arise. Serve videos through a CDN. – Eric Hammond Oct 26 '11 at 21:13
  • "there is no bombproof work around for this" ... I have difficulty believing that you need to use the same hostname. You might post a separate question describing the problem you are facing there. – Eric Hammond Oct 26 '11 at 21:15
  • @EricHammond http://en.wikipedia.org/wiki/Same_origin_policy – Keeno Oct 27 '11 at 08:28
  • 1
    Keeno: I'm familiar with same origin policy in JS, but still don't see why it would prevent you from serving the video content with a different domain (CDN). This is common practice. – Eric Hammond Oct 27 '11 at 17:08

2 Answers2

62

I can't speak for Windows instances, but I will presume that their base characteristics are fairly similar to Linux instances.

Your estimate for bandwidth usage is 100 simultaneous video downloads (I am not sure if you mean downloading the file or streaming the video - I will assume the latter). If we take a stream rate of 512kbps, you need about 51Mbit/s or 6.5MB/s.

EC2 instances differ in their I/O performance (which includes bandwidth). There are 3 levels of I/O performance: low, moderate, and high. Keep in mind, though, that disk I/O (i.e. from EBS volumes) also is bandwidth dependent. You can only really consider bandwidth within the EC2 network (as it will be completely variable over the Internet).

Some typical numbers to quantify 'low', 'medium', and 'high' (different sources quote different numbers for theoretical values, so they might not be completely accurate).

High: Theoretical: 1Gbps = 125MB/s; Realistic (source): 750Mbps = 95MB/s

Moderate: Theoretical: 250Mbps; Realistic (source, p57): 80Mbps = 10MB/s

Low: Theoretical: 100Mbps; Realistic (from my own tests): 10-15Mbps = 1-2MB/s

(There is actually a 'very high' level as well (10Gbps theoretical) but that applies only to cluster compute instances only).

A further point of mention is the degree of variation. On smaller instances, there is more variability in performance as the physical components are shared between more virtual machines. Regardless, you can expect around +/-20% variation in your performance (sources: 1, 2, 3). In your case (as per the assumptions/calculations at the top), you may need peak bandwidth of 13MB/s (double 6.5MBps, since disk I/O is also network limited). If you are transferring lower bandwidth content, you should be able to use an instance with 'moderate' I/O performance (see the instance types page), if your calculations result in a higher bandwidth requirement, you will need an instance with 'high' I/O performance. Simply streaming the data should not be CPU or memory bound, but sustaining 100 simultaneous connections will probably require at least a medium sized instance - and if bandwidth is a concern, based on the above, a large instance would be a safer bet).

I would recommend benchmarking the servers you launch to see if they meet your (calculated) needs. Launch two instances (of the same type) and run iperf on each using the instances' private IP addresses - you will need to open port 5001 in your security group if you run it with the default settings). Additionally, most tests outside of the EC2 network show results of between 80-130Mbps (large instances) - although such numbers are not necessarily meaningful.

A CDN would be better suited to your needs, if your setup permits it. S3 appear to have a limit around 50MB/s for bandwidth (at least from a single instance) as per this article, but that is higher than what you should require (S3 does not support streaming). Cloudfront would be better suited to your task (as it is designed as a CDN) and supports 1000Mbps=125MB/s by default (source) with higher bandwidth available on request and can stream content as well)

cyberx86
  • 20,620
  • 1
  • 60
  • 80
  • Where do you get your "theoretical" Mbps values? I cannot find any official or unofficial source of documentation for these numbers. – dtheodor Jun 22 '15 at 08:59
  • 4
    Those numbers are likely quite different by now (and I can't seem to track down sources for them at the moment). A lot has changed in 4 years. It is known that EC2 has 10Gbps interconnects within cluster groups. AWS used to (in 2007) quote a figure of 250Mbps dedicated local bandwidth. Currently, network bandwidth is shared between instances, with priority allocated based on instance type (and possibly factors such EBS optimization). At least in part, instance size determines how many other instances you share resources with. If the network is unused, your instance gets a larger share. – cyberx86 Jun 22 '15 at 18:24
  • Sorry, this is off topic, but I don't know how else to contact @cyberx86 but just letting you know that the ssl cert for your website expired a bit over a week ago. Apologies for commenting here, but can't find an email address for you. – Kevin Lyda Aug 03 '16 at 09:49
  • 1
    @KevinLyda Thanks. While I did know that the certificate had expired, that site hasn't been updated for some time - so this was a good opportunity to migrate to automatic renewals with LetsEncrypt. – cyberx86 Aug 06 '16 at 03:10
  • Yep, been using it myself. Very easy to script (particularly with the non-official clients) and get working away in the background. – Kevin Lyda Aug 07 '16 at 19:34
0

The numbers seem to change over time and as the number of different instance types proliferate. But an number of people post benchmarks. I've had some luck by googling [instance category] ec2 network benchmark.

For example, I wanted to know the bandwidth of an m4.xlarge instance, so I searched ec2 m4 network benchmark. I found this test result from the Washington Post engineering blog:

Type       Cost    Bandwidth Bandwidth Cost (1Gb/s) Bandwidth Cost EBS optimized
...
m4.xlarge  $0.252  791Mb/s   $0.320                 $0.320
...
Simon Woodside
  • 456
  • 1
  • 7
  • 15