If your social media newsfeed is anything like mine, then it is regularly filled with videos, articles, and memes related to audio, intermixed with common viral content.

There have been several posts floating around recently that caught my attention. Some have been explicitly related to audio, while others were not. But, they got me thinking about audio things anyway.

Visual Analogies for MP3 Compression

To begin with, there is a popular meme that shows a picture of Scarlett Johansson in various stages of make-up and Photoshop editing. This picture is meant to provide a visual analogy to the several stages of music production – from mixing to mastering. At the bottom is a pixelated version of the “Mastering” photo. This degraded version represents “MP3 compression.”

Around the same time I saw the picture of Scarlett Johansson, I also saw a different post of a TED Talk on MP3 compression.

The music your mp3’s are made of | Ozark Henry | TEDxLiège If your social media newsfeed is anything like mine, then it is regularly filled with videos, articles, and memes related to audio, intermixed with common viral content. There have been several posts floating around recently that caught my attention. Some have been explicitly related to audio, while

The speaker in the video discusses the concept of audio compression by using a similar visual analogy: Pablo Picasso’s Guernica. Audio compression is demonstrated by shrinking the size of the painting to miniscule dimensions.

Besides the visual picture, the speaker provides an explanation of his perspective, implying that MP3 compression removes the energy and soul of a musical recording.

Beyond just removing intangible elements of a recording, more specifically it is implied from the visual analogy that MP3 compression uniformly removes data from a recording.

The speaker goes on to explain how we perceive the missing data, “psychoacoustics helps you restore what has been blanked out.”

A similar claim and analogy is provided by Andrew Scheps in this video.

Audio Quality in Streaming Media | Andrew Scheps | Talks at Google If your social media newsfeed is anything like mine, then it is regularly filled with videos, articles, and memes related to audio, intermixed with common viral content. There have been several posts floating around recently that caught my attention. Some have been explicitly related to audio, while

In this video, MP3 data compression is illustrated using a text example where specific letters are removed. The claim is that our brain can still interpret the meaning of the words even with the missing characters because our “brain fills in missing information.”

Are These Analogies Appropriate?

However, all of these analogies do not properly explain MP3 compression. These claims are over-simplistic and insufficient, even to the point of being misleading and incorrect.

MP3 compression does not uniformly remove information or uniformly degrade quality from a song (with the exception of the very lowest bit-rates that uniformly remove high frequencies).

MP3 compression does not remove information that our brain can obviously notice and fill in.

Rather, MP3 compression selectively removes information that our brain would never receive anyway.

ADVERTISEMENT

MP3 compression works because the cochlea in our auditory system has limitations, not because our brain fills in the gaps.

Cochlear Mechanics

When sound enters our ears, it passes through several parts of your auditory system, including the cochlea, before sound ever reaches our brain.

When sound vibrations pass through our cochlea, the vibrations travel through fluid. Lower frequencies travel further through the fluid, and therefore travel further through the cochlea. Higher frequencies are absorbed sooner in fluid, and do not travel as far through the cochlea.

Hair cell detectors located along the cochlea are associated with various frequencies, from low to high. Because lower frequencies travel further through the cochlea, there is the potential for loud low frequencies to stimulate the hair cell detectors for low frequencies, as well as the hair cell detectors for high frequencies.

Therefore, loud lower frequencies have the potential to mask quiet higher frequencies. This masking means the brain will not receive the additional information from the higher frequencies because it is lost due to the lower frequency information.

The conceptual question then becomes, “if there is information in an audio signal that will never reach the brain for analysis, then is the extra information worth sending through the auditory system to be lost?”

Your Ears are Colorblind

In my opinion, a better visual analogy for MP3 compression is to consider color blindness. Typically, people who are colorblind can see some colors, but cannot perceive a difference between certain colors.

Color blindness does not occur because a person’s brain fails to interpret different colors.

Color blindness occurs if light-sensitive cells in the retina fail to respond appropriately to variations in wavelengths of light that enable people to see separate colors.

Consider what would happen for someone who is colorblind if an image contains colors that that cannot be differentiated.

There is a viral post I recently saw about Bob Ross painting a picture for a colorblind fan.

In this example, the painter chose to limit himself to only using colors that could be perceived by the viewer. Colors are intentionally excluded that cannot be perceived.

MP3 compression is very similar. It is a process to limit the information in an audio signal to only what can be perceived by the listener.

Conclusion

If you want a visual analogy for MP3 compression, it is a beautiful painting created specifically for your perceptual system.





Eric Tarr Eric Tarr is a musician, audio engineer, and producer based in Nashville, TN. Currently, he is a Professor of Audio Engineering Technology at Belmont University.