A Simple Guide to Digital Music File Formats

by Jack Sharkey for KEF
Vinyl has made a resounding comeback for a variety of reasons, mostly having to do with the vinyl experience. CD sales may have diminished, but the CD isn’t going anywhere soon, and even the lowly cassette tape is making a mostly un-ironic comeback. But digital music is not only here to stay, it’s the king of the playback hill. With that in mind, we thought we’d offer a little guide to what all of those file extensions and terms actually mean.

Compression: Back when digital music files and playback devices were first developed, storage was limited and expensive. A standard audio file can take up a serious amount of room on your hard-drive or handheld device, so ways to eliminate some of the data in a digital music file were developed to help save on storage space. Generally speaking today, storage is not an issue except for phones and other convenient portable music players, so compressed music still has a place.


Mumbo-jumbo: A CD generally compresses at a rate of 10:1, meaning roughly 10Mb of data per minute of music.


Lossy Compression: Refers to file compression where some original data is lost during compression. Once that data is lost, it cannot be retrieved. There are two schools of thought about lossy compression: The first is that only audiophiles and hipsters will notice a difference, but unpretentious folk won’t (paraphrasing an article on the subject I read once), but maybe a better way to look at it is that there is an original work of art, and with lossy compression a machine has decided that some aspects of the original work of art are superfluous. Again, what matters here is what you expect from your music.


Compressed music (lossy) does not sound as live or real as uncompressed music sounds but if you’re listening to music exclusively on a phone or other handheld device that really doesn’t matter much. It’s when you start listening on higher quality equipment with the expectation of a higher-quality listening experience that lossy files show their limitations. 


Lossless Compression: Refers to file compression where the original data is preserved during compression and can be extracted upon decoding into an exact replication of the original.  

Definitions

MPEG is an acronym for Moving Pictures Expert Group, the organization responsible for developing standards for audio and video coding. 


Bit Rate:  The number of bits that are transmitted or processed per unit of time (e.g. 320kB/s means the device is transmitting and processing 320,000 bits per second).


PCM: Pulse Code Modulation, standard format for digital representation (reproduction) of audio. PCM is the standard for CD, computers and pro-audio applications. It is both uncompressed and lossless, and is typically used as the source file in audio and music applications.


Speaking the Language: There are two things we need to consider with digital audio that are often mistakenly interchanged which results in a ton of confusion.


Audio Coding Format: The compression format (codec) used for digital audio, such as mp3, AAC, FLAC, etc. This is the way the audio is manipulated and compressed, or decoded for playback.


Digital Container Format: This is how the data is stored and described (via metadata) to the decoding software. Specific container, or data wrappers, can only be decoded by specific formats – some formats won’t work with some containers. Examples of container formats are AIFF and WAV.


Basically, raw audio data is stored in a container and prepared for storage or playback via the coding format (codec). It’s important to differentiate the two, especially when trying to identify higher resolution audio players and files.

The following list of file formats is listed more or less in the order you will come across them in the real world.


mp3:  Developed by the MPEG organization and by far the most widely supported file type. Originally intended as an audio layer in video files and not for music.


Pros: Small file size and faster data transfer requires less storage and processing power (i.e. perfect for handheld devices).

Cons: Uses lossy compression so some sound quality is lost.

Bitrate: 8kb/s to 320kb/s

Sampling Frequency: 16kHz to 48kHz (44.1 is the sampling rate for standard CD encoding)

Most Commonly Used: Portable devices or devices with low processing power and storage capabilities.

AAC (.m4a): Used by Apple iTunes , Advanced Audio Coding is the next-generation mp3 and was intended specifically for music. 


Pros: Same audio quality as mp3 but takes up less storage space.

Cons: Uses lossy compression so some sound quality is lost.

Bitrate: 8kb/s to 320kb/s

Sampling Frequency: 8kHz to 96kHz (when using specific encoding formats)

Most Commonly Used: iOS, Android and most gaming devices. 

WAV: Waveform Audio Format File was developed by Microsoft and is the standard format for all Windows-based audio files and algorithms.    


Pros: Uncompressed and lossless. Basically an exact reproduction of the original source.

Cons: Requires far more storage space per file than compressed formats.

Bitrate: Bitrate generally refers to lossy formats, so it is not of concern with this format.

Sampling Frequency: 44.1kHz, or two times (plus) the upper limit of human hearing as per the Nyquist Theorem. This is the Redbook CD standard for audio.

Most Commonly Used: For archiving and editing audio files where any loss of data (quality) is not acceptable.

WMA: Windows Media Audio was developed by Microsoft as a competitor to the mp3.   


Pros: Newer codecs offer a lossless option.

Cons: Uses lossy compression so some sound quality is lost. Limited device availability.

Bitrate: Up to 20kb/s

Sampling Frequency: 8kHz to 48kHz

Most Commonly Used: Most portable and home players, but very few handheld devices without a separate driver download.

AIFF: Audio Interchange File Format was developed by Apple and is basically Apple’s counterpart to WAV.   


Pros: Uncompressed and lossless. Basically an exact reproduction of the original source.

Cons: Requires far more storage space per file than compressed formats.

Bitrate: N/A

Sampling Frequency: 44.1kHz.

Most Commonly Used: For archiving and editing audio files where any loss of data (quality) is not acceptable.

FLAC: Free Lossless Audio Codec was developed by Xiph.org and is royalty-free and open.   


Pros: Uncompressed and lossless but with up to a 40% smaller file size over WAV and AIFF.

Cons: Not supported by most Apple devices and formats (iOS, iTunes and Apple Music).

Bitrate: N/A

Sampling Frequency: Up to 96kHz (32-bit word).

Most Commonly Used: For archiving and replicating original audio files and sources. Provides better playback quality than lossy formats.

ALAC: Apple’s version (Apple Lossless Audio Codec) of FLAC is similar in performance to FLAC in terms of quality and file size.   


Pros: Uncompressed and lossless and compatible with all Apply devices and formats.

Cons: Requires more storage space per file than compressed formats, but less than WAV and AIFF.

Bitrate: N/A

Sampling Frequency: Up to 96kHz (32 bit word).

Most Commonly Used: For higher quality playback with Apple devices and formats.

Master Quality Authenticated (.mqa): MQA is a 48kHz/16-bit codec that can be containered with any lossless file format such as FLAC. MQA uses a hierarchal encoding algorithm that packs the differing frequency layers during compression and requires specifically designed hardware players for decompression, although MQA is supported by the Tidal desktop application. The unpacked audio is restored to the original.



.cda: Is the common file extension used by Windows Audio for each audio track on a Redbook CD. It is not actually an audio file but is a 44 byte (352 bit) pointer file that operates as a road map of sorts to the actual location of the audio data on a CD. The confusion stems from the fact that some audio editors and CD burners sometimes appear to the user as if the actual .cda file is being played.



.m4p: Is an MPA AAC file that is DRM (Digital Rights Management) copy protected. An m4p file can only be played on an authorized computer or player (typically a user’s iTunes or Apple Music authorized computer, etc.). For the dyslexic among us, don’t confuse an m4p file with an mp4 which is a video file format.



MPC: Formerly referred to as MPEG+, Musepack is an open source lossy format that has a variable bitrate based on the requirements of the musical passage being encoded.  MPC is a lossy compression codec, but because it has a variable bitrate, it produces a higher quality audio than standard mp3.  



Vorbis: A patent-free open-source codec that was developed as an alternative to proprietary encoding formats such as mp3, AAC, and VQF (Yamaha’s version of the mp3). Xiph.org oversees Vorbis and the container format OGG which Vorbis typically uses. This format is generally referred to as OGG Vorbis.



Monkey’s Audio (.ape): .ape files are compressed but are lossless so they may be decompressed into exact replications of the original file. They also generally require about half the storage space of an uncompressed raw audio file.



Real Audio (.ra): Real Audio Networks developed .ra files in the mid-90s and the format was used by radio stations for their online streaming, although use of .ra has diminished as other formats have been developed.

True Audio (.tta): True Audio’s open-source lossless audio compression coded is widely supported by Windows and Apple platforms. .tta files can compress the original up to 70% but are mostly similar to FLAC and APE files in performance.

Welcome to myKEF

Join Our myKEF Community

Become a member and unlock exclusive offers and benefits.

Create Your Account
Register Your Product Account Details Order History
You have no items in your cart
Any applicable discounts will be applied at checkout
You haven’t sign in yet. Sign in or create account to enjoy the most of your KEF experience.
Visa Mastercard American Express Apple Pay Google Pay Discover

SELECT A REGION