RegisterRegister 
 
The Ingenious MP3 Compression Algorithm

 
Post new topic   Reply to topic    Absolute Insight Forum Index -> Articles

Haggs Reply with quote
AI Team Member


Joined: 05 May 2004
Posts: 2723
Location: Minneapolis, Minnesota

PostPosted: Fri Oct 08, 2004 2:13 pm    Post subject: The Ingenious MP3 Compression Algorithm
 
Over the past 5 years MPEG Audio Layer 3 files, more commonly known as MP3s, have flourished due to a very creative and intelligent compression algorithms. While literally millions, if not billions, of people have hundreds, if not thousands, of these MP3 files on their computers, few probably understand and appreciate what a gem the MP3 really is. Prior to the development of MP3 technology, music was typically found in WAV file format which has a ten times greater file size. MP3 listeners know that there is virtually no difference in noticeably audio quality between listening to CDs directly versus listening to an MP3 encoded at 128kb/s (kilobytes per second). How could a file be compressed ten fold without hardly any loss in audio quality?

MP3 files use "lossy compression" with "psychoacoustic algorithms". Lossy compression is defined as compression that occurs from taking away bits of information. In terms of an MP3, this means that an MP3 ripper removes audio information from the original WAV file. However, this lossy compression is done with psychoacoustic algorithms. Psychoacoustic algorithms are methods that describe how to selectively remove audio information that the human ear would most likely be unable to notice anyways.

For example, the psychoacoustic algorithms usually leave the 1 KHz to 4 KHz audio frequencies untouched, because this is the frequency range that the average human ear can hear the best. The algorithms also remove quieter audio sounds that are playing at the same time as louder ones, since the quieter audio sounds would not likely be noticed present or not. The third way that the algorithms are able to compress the file is by converting stereo signals into mono signals. Deep bass is difficult for the human ear to detect direction, so it is very difficult for the average ear to tell whether the bass sounds are coming from the right or the left, or mono. Since mono audio requires fewer bits of information, the file is compressed further. Last, the file size is further reduced by optimizing the data for the most frequently occurring signals.

So next time you queue up your play list in Winamp, iTunes, or your favorite MP3 player, you can appreciate the intellectual genius behind each track.
Back to top
kompakt Reply with quote
Innovator


Joined: 17 Jun 2004
Posts: 1475
Location: Eden Prairie, MN

PostPosted: Fri Oct 08, 2004 2:36 pm    Post subject:
 
genius indeed
Back to top

Haggs Reply with quote
AI Team Member


Joined: 05 May 2004
Posts: 2723
Location: Minneapolis, Minnesota

PostPosted: Fri Oct 08, 2004 2:39 pm    Post subject:
 
What I wonder now is.. could similar theory be applied to graphics engines and image files? Of course there is a limitation to the way the eye percieves what it sees. Perhaps someone has already adapted the lossy compression with psychovisual algorithm to some sort of image technology. It's definitely worth checking into I think.
Back to top

Impact Reply with quote
Site Admin


Joined: 01 May 2004
Posts: 3258
Location: Pinecrest, FL

PostPosted: Fri Oct 08, 2004 3:50 pm    Post subject:
 
I dont think so. As far as I know from the little bit of OpenGL and Direct3D modeling, you need every bit of information (for meshes), for images (2d), I'm not sure. I don't think so, because yet again, theres no pixels overlapping, theres no colors in images we cant see... etc
Back to top

smoothgreenrhythms Reply with quote
Sub-Modder


Joined: 20 Sep 2004
Posts: 162
Location: East Lansing, MI

PostPosted: Mon Oct 11, 2004 1:59 pm    Post subject:
 
some people hate MP3s and hate people who use MP3s, they bitch about how good flac is. These people have no appreciation for convenience; flac files save all sound quality, but they are huge, and take up too much room. I guess they are cool if you want one CD.
Back to top

Clockwork Reply with quote
Sub-Modder


Joined: 06 Aug 2004
Posts: 120

PostPosted: Mon Oct 11, 2004 4:52 pm    Post subject:
 
Yeah, there's a lot of stupidity going on with the FLAC supremists. In all reality, the sound difference is barely noticeable. Also - you have to consider what audience we're talking about. People listen to the radio. The radio. The sound quality there is nowhere near a CD even.

Visual information is a lot harder to compress for so many reasons. Two big ones are usability and cross-format compatibility. You have to consider that most music being sent around might as well be read only. If you were to take mp3 compressed music and try to alter pitches etc. it chops up pretty quickly from the missing information. Also, with images, they're sent across so many different monitor resolutions and color calibrations, any change in the amount of information is readily noticeable on at least one person's PC.

I had to crank out a speech on the subject in communications (speech) last year. I think I got like a B+...which was bull, since I really knew my stuff at the time.
Back to top

Impact Reply with quote
Site Admin


Joined: 01 May 2004
Posts: 3258
Location: Pinecrest, FL

PostPosted: Tue Oct 12, 2004 10:09 am    Post subject:
 
Matt brings up a good point with the read-only arguement, didn't even think of that.
I prefer FLAC to MP3 when I can get it. At 4x the size, its not going to fill 320gb anytime soon, and I think its worth it so i can hear CD quality. I swear I can hear a difference, but it might just be my mind playing tricks on me...
Back to top

kompakt Reply with quote
Innovator


Joined: 17 Jun 2004
Posts: 1475
Location: Eden Prairie, MN

PostPosted: Thu Oct 14, 2004 10:10 pm    Post subject:
 
Quote:

What I wonder now is.. could similar theory be applied to graphics engines and image files?


i think its possible, you would just have to think of a really clever way to preserve all the things in the image, which impact mentioned.
Back to top
Post new topic   Reply to topic    Absolute Insight Forum Index -> Articles
Page 1 of 1

Choose Display Order
Display posts from previous:   
User Permissions

 
Jump to:  


Skin Created by: Sigma12
Powered by phpBB © 2001, 2002 phpBB Group