|
||
|
|
|||||||
| Welcome to the Exploding Garrmondo Weiner Interactive Swiss Army Penis. |
|
GFF is a community of gaming and music enthusiasts. We have a team of dedicated moderators, constant member-organized activities, and plenty of custom features, including our unique journal system. If this is your first visit, be sure to check out the FAQ or our GFWiki. You will have to register before you can post. Membership is completely free (and gets rid of the pesky advertisement unit underneath this message).
|
![]() |
|
|
Thread Tools |
Rimo, you bring up a very good question. And I agree that you would probably be well-served sticking to -V2. It is my view that "upgrading" to -V0 does little more than offer the comfort that your audio is in higher bitrate, consuming that additional space without sounding significantly better.
I want to know, honestly, how many people discussing this have tried a scientific ABX listening test to evaluate what settings work for them? (ABX test software is easily available, via a component in the Foobar2000 audio player.) Because when differences in sensory input become small, such as when comparing high-quality encoding modes, subconscious psychological biases can easily distort one's perception. Only double-blind tests eliminate this strong suggestive bias. They do this by removing the listener's ability to know which sample is which, such as knowing that a certain sample is encoded at a higher bitrate, knowledge that often distorts the listener's perception such that they falsely "hear" it as being better because they expect it to be better. To illustrate my point, recently I posted on OverClocked ReMix about how the size-restricted encode of Valse Aeris on the site had very noticeably jarring artifacts because it was encoded with an average bitrate of 103Kbps. This was while listening on my Sennheiser HD 280 Pro headcans. After people disagreed with my assessment of it sounding so horrible, I tried listening to it on my Klipsch ProMedia 2.1 speakers. The difference I heard clearly on my headphones I actually couldn't hear AT ALL on my speakers, even at a bitrate as low as 103Kbps! So while I could easily ABX the difference between Valse Aeris encodes 12 out of 12 times on my headphones, I gave up trying to ABX the difference on my speakers after a few trials when the program basically said I could toss a coin and get the same results. Before going on, I would like to note that I was ABX comparing the 103Kbps ABR encode against a -V2 --vbr-new encode of about 198Kbps. After noticing that the bitrate could actually be bumped up from 103Kbps and still fit OverClocked ReMix's size limit, I fooled around and was able to get it to an average bitrate of 114Kbps staying within the limit. And the most surprising thing for me was that with only this 11Kbps average increase, the audio artifacts that were formerly so very audible and evident on my headphones became surprisingly subtle! And yet -V2, the command line I live by, encoded it an entire 95Kbps higher. With that drastic a decrease in flaws with only an 11Kbps increase, I somehow doubt that I would need a mode as high as -V2, with its much more drastic increase, to get transparent sound. I also fooled around with -V6 while trying to fit Valse Aeris within the filesize limit. While it actually went over the limit (with a bitrate of 122Kbps), I noticed that it seemed to sound even better than the 114Kbps ABR (not surprising), but while I thought that I could still hear some subtle audio artifacts, I knew it would start to be a challenge to actually ABX test them at this point. And remember that was using just -V6, on my hi-fi headphones! And I couldn't hear the difference at all on my mid-level speakers! Looking at the Hydrogen Audio wiki for LAME encoding, notice that it specifies -V3 through -V0 as "High quality: HiFi, home or quiet listening", and additionally states that "-V4 --vbr-new should be close to perceptual transparency". After my experiences with Valse Aeris, I suspect that this wiki entry is quite close to the truth. -V2 ought to be more than fine for encoding all your music, and if one is interested in greater space efficiency, -V3 and -V4 may be worth testing out. (Note, that if you ABX test your own audio, you must know what to look for. Percussion that is high in the mix, especially cymbals and casanets, tends to artifact. And there are several testers and devs over at Hydrogen Audio that can tell you what other "problem" samples to look out for; just make sure you're talking to someone who actually knows their stuff.) Also, I'd like to correct a few statements I saw above: There is no reason to think that -V0 is the most you can get out of MP3, nor that -V0 is more tuned than any other preset. -b 320 will indeed get you the "best quality possible" because -V0 is unlikely to ever result in a 320Kbps MP3, even if there is enough audio information to be encoded to fill all those bits usefully. However, audible differences between 320Kbps and -V0 are probably about as unlikely to occur as those between -V1 and -V0, so this discussion about the "best quality of MP3" is really more academic than practical. And LAME 3.98 beta 5 is considered the "best" version of LAME, by what authority or consensus? Hydrogen Audio still lists the recommended version as LAME 3.97 stable. Be careful of being on the bleeding edge, using betas and such. While 3.98 beta 6 seems to have some obvious flaws, without prudent testing what's to say that 3.98 beta 5 doesn't have more subtle regressions? Jam it back in, in the dark. Good morning, post-apocalyptia!
Last edited by Moguta; Mar 24, 2008 at 10:48 PM.
Reason: typo
|
Along the lines of your own argument, I would propose that -V2 and perhaps even -V3 or -V4 are superior to -V0 because -V0 wastes bits with rarely any additional audible quality. However, I notice you do mention hearing artifacts in -V2 encodes. Can you provably demonstrate that you hear artifacts in -V2 that you do not hear in -V0, by way of ABX testing? I have never been able to pick up any differences, but I am not arrogant enough to deny you the opportunity to prove that you do hear something that I do not.
Also, I hope no one would be excited about downloading -V8 encoded music, considering they'd be getting 32KHz-resampled audio at bitrates around 85Kbps. I imagine it's enough to hurt anyone's ears. ![]()
For this reason, I really wish that #gamemp3s had not increased their encoding mode to -V0. Not only I am quite hesitant to transcode the files lower to save my own space, at risk of additional degradation, but I think that #gamemp3s' distribution would benefit from smaller torrents that take up less space on users' computers. If I remember correctly, the decision to increase the encoding standard was simply done by putting a poll on the website. It's no surprise that voters wanted the "upgrade", with the common mentality that higher bitrates and higher quality switches must be better than anything lower.
How ya doing, buddy? Good morning, post-apocalyptia!
Last edited by Moguta; Mar 25, 2008 at 06:15 PM.
|
I agree that CBR is certainly less than optimal. But I don't see how that means that "V0 maxes out what MP3 can give you". There may be some wasted bits, yes, but it would seem that 320Kbps CBR "maxes out" because it can still end up encoding more audio information than a -V0 encode. And not only that, but seeing that most lossless audio tends to have bitrates somewhere around 900Kbps, I can't help but doubt that there are many cases besides digital silence where 320Kbps worth of audio is not present. Of course, I could be wrong, since MP3 does store different information than lossless formats. But even if I'm wrong about that, I'm not sure that "VBR V0 is the end of MP3" or that "You can't get more quality without rewriting the standard". After all, why continue to develop LAME if no improvements can be made without a format rewrite? And additionally, one can always make a higher VBR preset. For example, a preset could encode everything at 320Kbps except if the frame did not have enough information to do so, so no bits are wasted. Of course, once we get the idea that "Best for lossy encodings is determined by perceived audio quality divided by bitstream size", it begins to seem apparent that an encoding mode might possibly be too high, even if it doesn't waste bits by the strict definition. And that's why I think that -V0 is an unnecessary ~%25 size increase, just as I'm sure my proposed 320Kbps VBR setting would be overkill.
I agree that it would be great if Vorbis had wider hardware adoption. But the ratio of effort to implement vs. consumer demand just seems too out of kilter for it to be worthwhile for manufacturers. This thing is sticky, and I don't like it. I don't appreciate it. Good morning, post-apocalyptia!
Last edited by Moguta; Mar 27, 2008 at 11:20 PM.
|
So most lossy audio codecs store frequency information (as opposed to storing points of amplitude). On the other hand, most lossless audio codecs store small efficient equations that predict pieces of the waveform pretty well, trying to have as little error (and thus, sample adjustments to correct it) as possible.
Am I understanding it right? I am a dolphin, do you want me on your body? Good morning, post-apocalyptia! |
And, wow, reading that HydrogenAudio wiki page you linked, it seems that VBR will use the bit reservoir too if necessary. And here I had thought that VBR encoding did not have a bit reservoir at all! The more you learn... ![]()
I was speaking idiomatically. Good morning, post-apocalyptia!
Last edited by Moguta; Jun 10, 2008 at 05:14 PM.
|