|
|
Welcome to the Exploding Garrmondo Weiner Interactive Swiss Army Penis. |
GFF is a community of gaming and music enthusiasts. We have a team of dedicated moderators, constant member-organized activities, and plenty of custom features, including our unique journal system. If this is your first visit, be sure to check out the FAQ or our GFWiki. You will have to register before you can post. Membership is completely free (and gets rid of the pesky advertisement unit underneath this message).
|
|
Thread Tools |
Chocorific |
For those interested in the (mathematical) details of audio compression:
Wikipedia has a large number of (quite good) articles about different subjects: Discrete cosine transform - Wikipedia, the free encyclopedia Modified discrete cosine transform - Wikipedia, the free encyclopedia Linear prediction - Wikipedia, the free encyclopedia Simple linear prediction is very easy math, but for transform coding you need a lot of advanced knowledge about analysis/calculus. There's nowhere I can't reach. |
So most lossy audio codecs store frequency information (as opposed to storing points of amplitude). On the other hand, most lossless audio codecs store small efficient equations that predict pieces of the waveform pretty well, trying to have as little error (and thus, sample adjustments to correct it) as possible.
Am I understanding it right? This thing is sticky, and I don't like it. I don't appreciate it. Good morning, post-apocalyptia! |
Chocorific |
Yep, that's the basic concept. Lossy audio codecs do a "transform" (fourier, fast fourier, discrete cosine, wavelet, etc.) before doing their "real" works, which consists of deciding (smart) which information can be dropped without perceived loss of quality (quantize).
These transforms also have nice properties for derivation when looking at the fourier transformation, which makes them quite interesting. It took me however nearly four semesters (analysis I+II for the basics, complex analysis and analysis III) to fully understand the concept (and be able to proof why it works). I guess it's a bit easier with prediction coding, but I didn't look into that very deep. However fitting polynoms to arbitrary functions can be tricky as well :-) You see that comparing the two compression methods is not really possible. Lossy is happening in frequency domain, lossless in time domain. What would be interesting: to see someone combine prediction and transform coding (again resulting in a lossless bitstream). Anyone aware of such a codec? I am a dolphin, do you want me on your body? |
Encoding with 320cbr (preset-insane) also changes other settings, not only the bitrate. But the extra quality is probably not perceivable and therefore wasted. Some of the guys "with golden ears" over at hydrogen audio can distinguish subtle artifacts on test samples with V2 that need V0 or even 320cbr to sound absolutly transparent. There's also this nice chart, visualizing the quality gain (notice the jump in filesize from -V0 to 320cbr): Spoiler:
I myself use -V0 for classical music, soundtracks and the like. Just to have some headroom, to make sure the sound is flawless. The filesize increase is realy minor compared to 320cbr or lossless. I was speaking idiomatically.
Last edited by sup!; May 31, 2008 at 01:51 AM.
|
And, wow, reading that HydrogenAudio wiki page you linked, it seems that VBR will use the bit reservoir too if necessary. And here I had thought that VBR encoding did not have a bit reservoir at all! The more you learn...
What kind of toxic man-thing is happening now? Good morning, post-apocalyptia!
Last edited by Moguta; Jun 10, 2008 at 05:14 PM.
|
helo.
since i cant vote (Not a poll here) i just leave my 2 cents.. From what i've understood: the sound quality of 192 CBR is roughly the same as that of LAME V2 (which has a ~190 target -- but which is seldom reached in practice, i.e. never with so-called historical recordings à la Horowitz, Glenn Gould, Schnabel, etc.). the main difference being the resulting file size. [1] a 'full-sound' 80mins. Audio-CD ripped @ 192 CBR ===> ~120MB (1A sound) [2] a 'cheap-sound' 80mins. Audio-CD ripped @ LAME V2 ===> ~60MB (1A sound) But since i am used to 100-120MB sized RAR-archives per disc (classical music) i now tend to rip to LAME V0...to reach "my targeted" 100-120MB's. Hence: [3] a 'full-sound' 80mins. Audio-CD ripped @ LAME V0 ===> ~120MB (1A+ sound) [4] a 'cheap-sound' 80mins. Audio-CD ripped @ LAME V0 ===> ~100MB (1A+ sound) [1] vs. [3] === same file size target reached. [3] has higher sound quality. [1] vs. [4] === my file size target reached. [4] has higher sound quality, and even smaller file size. With LAME V2 i never reach my file size target...and its sound quality is always less than LAME V0 (per definitionem), so why should i stick to LAME V2? Ergo, LAME V0 is now my 1st choice. Even if i cant hear the sound quality difference between V2 and V0 (because my Sansa mp3-player is §$%&! and its headphones/earphones even more so ) i like the simple MP3 concept that 1.0min = 1.0MB(128cbr), or that 1 Audio-CD(74mins.) = 100MB(mp3). The LAME V2 setting ruins that beautiful calculation principle. FELIPE NO
Last edited by Lousy; Sep 27, 2008 at 12:48 PM.
|