Unlike the rigid infrastructure of on-premises clusters, EMR decouples compute and storage giving you the ability to scale each independently and take advantage of tiered storage of Amazon S3. With EMR, you can provision one, hundreds, or thousands of compute instances to process data at any scale. The number of instances can be increased or decreased automatically using Auto Scaling (which manages cluster sizes based on utilization) and you only pay for what you use.

