Research quality of coding sound by different mp3 encoders

By Aleksey Lukyanov

The format MP3 for high-quality encoding of sound becomes more and more popular. Initially developed for usage within the standards of video compression MPEG1 and MPEG2 it very fast has received distribution as the independent format. Saving excellent quality of sound at the large degrees of compression and more than active attempts of the corporation - developer IIS Fraunhofer to extract a maximal of the profit from the child became the main reasons of such turn of events.

In the basis MP3 is direct development MPEG Layer I and Layer II and psychoacoustic model for coding an original signal is also used in it. Because of it the process of coding is ambiguous and can be varied depending on the used encoder. This ambiguity means that having encoded the same signal by two different encoders we can receive after decoding two signals differing on sounding. It is obvious that that encoder which in the best way saves an original signal is preferable. The purpose of the given review is to clarify which one from modern encoders will give us the best result.

Technique of testing

Usually to compare initial and encoded signal the method of matching their Amplitude-Frequency Rates (AFR). There are two varieties of this method: matching average AFR signals and matching of change AFR in time. The first one is used more often because of its simplicity- matching is made only once. However at the average the significant part of the information about a signal is lost and as a consequence at absolutely identical AFR initial and encoded signal can rather differ in sounding if in a signal there are large on amplitude but rather short-term change of some frequencies. The second one allows to avoid such problems however it is necessary to make for implementation of testing so huge quantity of calculations with a floating point (fast Fourier transform gives rather quite good results for the analysis of the large blocks samples however for definition AFR in10-20 samples it is absolutely inapplicable) that all process of testing will take some days for a minute of signal CD qualities. To avoid the problem of shortage productivity the technique of testing based on the absolutely other principle was developed. The encoder MP3 actually is the "rough" archiver, at "unpacking" its production it is impossible to restore the initial information completely. So the quality of coding can unambiguously be estimated through percent of saving information. Thus if to subtract the encoded signal from the original signal the result will represent that part of the information which was lost during coding. Average value amplitude of this result will be the parameter defining quality of coding. It was called "average difference of signals" (ADS).

However, not all is so good in this method. For example, if to invert a signal concerning its zero level (or just to displace this level a bit) ADS will have the value of a double average level of a signal (or the value of offset zerolevel) in spite of the fact that the signal will sound identically to the initial one (and to have appropriate AFR).

Therefore another parameter called "average difference of waveforms" (ADW) became more applicable for the estimation of coding quality. For its definition the initial and coded signals are handled so that each sample instead of deviation amplitude of the speaker from the position of the balance is equal to the value of difference of these amplitudes for previous and this sample (such technique of representation a signal was used in ADPCM coding). For the already processed signals their ADS was calculated which for original signals is ADW. The main difference between ADW and ADS is in usage modules of their differentials for the estimation the difference of signals instead of difference in amplitude. Due to this the waveforms regardless of zerolevel's position are compared.

But despite of obvious disadvantages ADS it was decided to use both methods at testing. This solution was motivated by increased requirements to an encoder. Though the standard MP3 also was initially developed as "giving similar sounding, but not identical signals" (that is what words "psychoacoustic model" mean) the maximum saving all parameters of an original signal all the same is the primary task of an encoder because of their obvious correlation with similar sounding.

The disadvantages of such method of matching are obvious. ADS and ADW do not contain the information about in what exactly encoded signal loses quality in comparison with initial. Moreover, they cannot be the perfect characteristic the quality of coding at all. The situation when the signal with ADW close to zero for the man will sound the same as the signal with much greater ADW is possible.

But at the same time it is impossible for the signal with ADW close to zero to differ strongly in sounding from initial. Therefore ADS and ADW are redundant characteristics and guarantee that the difference between signals won't be more than some level. Therefore first stage of researches was definition of maximum values ADS and ADW at which human ear can not note differences in signals yet. For this purpose "white" noise of a different level was added to sine-wave signals different frequencies (from 200 Hz up to 20ÊHz) of maximum amplitude. The obtained signals were listened together with initial and those threshold values the level of "white" noise at which it is not audible behind the main sine wave signal were determined. For such noise level ADS and ADW were determined which were taken as "measurement standard". In case the coding of a signal gives ADS and ADW smaller than standard the further improvement quality of coding is represented useless. For the values ADS and ADW larger than standard there is a probability of heard difference in signals. However, because of usage psychoacoustic model this probability is rather small even for values ADS and ADW exceeding standard in some times.

Among the advantages of this method of matching it is necessary to mark its applicability to any methods of coding a signal. Already now there are some formats of coding a sound that are not yielding on degree compressions MP3 based not on saving the defined part of a frequency spectrum of a signal but on, for example, dynamic change of frequency encoding parameters. For restoring a signal in such methods interpolation techniques are used and the signal obtained in result saves average AFR initial despite of significant differences in local AFR. For such formats of coding the method of matching average AFR is inapplicable because it will give too overestimated results but ADS and ADW will describe quality of coding without problems. Therefore research was not limited by only MP3 format. For example, the rather young standard from Microsoft - WMA also has taken part in it. In process development of technologies and improvement of encoders the results of researches will be enlarged and updated.

Toolkit

For realization of researches two programs for operation with a sound and one programming language were used. Winamp v.62 with the decoder MP3 from IIS Fraunhofer (v.2.23) was used for decoding MP3 of files in WAV. Such choice was made because the majority of other decoders/players MP3 use special posts - signal processing for giving them "softer" sounding, while it is necessary for researches of quality coding of a signal in the pure state without any posts - processing. Then saved WAV the file with an encoded signal was exposed to processing in the sound editor GoldWave v4.16. By virtue features of the standard MP3 the encoded signal contains some (defined by an encoder) silence or little noticeable noise before the beginning of a signal. Therefore the real beginning of the sound data was found by the method of visual matching initial and encoded signal and the garbage prior to it was deleted. In several cases some first hundreds of samples from an original signal were not encoded at all. However, for saving exactness it was decided to consider such errors of encoders just as "inaccuracies" in saving a signal and therefore to leave them without notice for all the same they will affect on ADS and ADW.

For finding ADW and ADS using Delphi 3 two small programs were written. The first one created two files of the identical format, one of which contained ÐC and the second ÐÔC. The second program found average value given for files of the format results the first one.

The used encoders

	128 Kbit/s	256 Kbit/s	160 Kbit/s	VBR
Audioactive Production Studio v.1.0.5	+	+	-	N/A
Audioactive Production Studio v.1.5.4	+	+	-	N/A
Blade Encoder DLL v.0.91	+	+	-	N/A
GOGO DLL v.2.3	+	+	-	+
L3Enc v.2.61	+	+	-	N/A
Lame DLL v.3.7	-	+	-	N/A
Lame DLL v.3.83	+	+	+	N/A
Lame v.3.61	+	+	-	+
Microsoft WMA	+	N/A	+	N/A
MP3 Producer v.2.1 Professional	+	+	-	N/A
MP3Enc v.3.0	+	+	-	N/A
SCMPX v.1.5.1	+	+	-	+
Streambox Ripper v.2.009	+	+	-	N/A
XingMPEG Encoder v.2.20	+	+	-	N/A
(+) - the research was carried out spent; (-) - the research was not carried out; (N/A) - the format is not supported

Audioactive Production Studio v.1.0.5

http://www.telos-systems.com/

It is based on the code from IIS Fraunhofer and differs from the "native" code only by optimization on speed. It has the simple interface that however is rather convenient for coding a plenty of files. From parameters of coding it is possible to change only bitrate and sort of optimization: on speed and on quality. In case of an incorrect frequency rate for selected citrate resampling is made automatically. The program is able to make not only MP3 file and WAV-MP3 - MP3 encapsulated into RIFF format. One of features is the support of special equipment for the hardware coding of a sound made by the corporation Audioactive. "Saving of perfect quality at speed of coding in real time "is named the main advantage of this equipment. However because of increasing capacity of processors this advantage doesn't play a big role for already on Pentium II 266 we can encode a sound in real time using only "program" encoder that makes purchase of additional equipment deprived of sense.

Audioactive Production Studio v.1.5.4

The differences of this encoder from the predecessor are minimum. The code was more modified for reaching the top speed however the quality of coding has suffered. The interface of the program has remained without changes and that is reasonable as it is rather difficult to create something more convenient for coding a plenty of files.

Blade Encoder DLL v.0.91

http://bladeenc.mp3.no/

At researches Blade Encoder was not used itself but its DLL version was. The code of this encoder is based on demonstration variant ISO of the code differing only by significant optimization on speed of coding. As the program operating this library the sound editor GoldWave was taken. There are both possibility of coding of one file and special mode of mass coding in it. As well as for Production Studio there is a possibility to save the encoded signal both as MP3 file and as WAV. However this possibility should be related more to GoldWave than to Blade Encoder. Among changed parameters of coding it is possible to point channel (STEREO or MONO), bit rate and sampling rate of result - it is quite enough for fast obtaining required result. However such quantity of customizations is not enough for the really most experienced user.

GOGO DLL v.2.3 both Lame DLL v.3.7 and Lame DLL v.3.83

These libraries of coding MP3 were used together with the very convenient program Easy a CD - DA Extractor intended for extract of sound tracks from musical compact discs. A basis for all three libraries is the code first GPL of an encoder LAME v.3.5 (Lame Ain't an MP3 Encoder). In turn this code initially is based on the demonstration code ISO that made distribution LAME under GPL impossible. However in May 2000 the last pieces of another code were completely copied and in the code LAME nothing common with ISO remained. It has allowed to begin distribution of the code under GPL however as the source texts on C. This fact is explained rather simply: the creator MP3 IIS Fraunhofer takes the definite sum from all corporations distributing encoders for usage of the standard irrespective from a sort of the license distribution. Thus at distribution of an encoder under GPL the corporation does it to its loss. At the same time initial codes are not an encoder and consequently are not taxed from IIS Fraunhofer. Without violation of the law, the compiled encoder can be used only by person, compiled it. However in the Internet all the same it is possible to find the compiled versions of an encoder. Besides many corporations use the initial codes LAME for inclusion of support MP3 in the products. Thus used libraries have appeared. First of them, under the statement of the writers, differs from LAME v.3.5 only by speed of operation. The rest of them are only the direct usage newer versions of the code LAME as plug-in to Easy CDDA Extractor. The main feature of the LAME code is absolutely new (created by the writers of the project) LAME psychoaucoustic model named G-psycho. It seems to be closer to real features of human hearing than model of competing commercial products.

The library GOGO as parameters of coding allows to point the channel of a signal and the choice is carried out between STEREO, JOINT STEREO and MONO. For every channel it is possible to install sampling rate 44.1 and 22.05. For 44.1 minimum bitrate is 112 Kbps, for 22.05 - 56 Kbps. Besides for every bitrate there is a possibility to include usage VBR. In such case indicated bitrate becomes minimum admitted at coding. Maximum bitrate for VBR is always 320 Kbps.

Libraries LAME have another possibilities. They also allow pointing channel of a signal with similar choice between STEREO, JOINT STEREO and MONO. However irrespective of sampling rate bitrate can receive all spectrums of values from 8 Kbps up to 320 Kbps. Moreover due to support of the extension standard MP3 MPEG Layer 2.5 there are no problems with choice of sampling rate because the sampling rate for coded and original signal will coincide. For any combination channel and bitrate there is a possibility to select a mode of operation of an encoder - maximum qualitative, standard and fast. The features of distinction quality of these modes are given below in the brief description Lame v.3.61.

It is necessary to note one unpleasant feature of LAME DLL v.3.7 - there is bug because of which extra information is added in the end of the encoded signal. It occurs because of buffering writing coded data on disk when instead of the last incomplete information block smaller size of the buffer all buffer is written wholly. However it does not influence quality of coding a signal and therefore it doesn't create problems at realization of researches. Nevertheless after realization of the tests for 256 Kbps it was decided to eliminate this encoder from researches, as the users will hardly begin to use an encoder with such obvious error irrespective from the quality of result given out by it.

Lame v.3.61

This encoder also belongs to the set Lame however as against three previous libraries it is the quite independent cantilever program Win32. For its compilation Microsoft Development Studio v.5.0 was used. The process of compilation has passed perfectly and the successful result was achieved from the first attempt.
More floppy possibilities of customization and support of the mode VBR are the significant differences of an encoder from the same libraries of new versions. Some bewilderment was caused by a special mode of operation "-f". In documentation this mode is indicated as "giving the worst quality of coding at substantial growth of speed of operation". In a reality the speed of coding increased approximately in 3 times however quality often exceeded (!!!) obtained in a mode of operation "-h" according to documentation intended for obtaining the best quality of coding at small speed of operation. The similar legitimacy was saved and for the subsequent versions of the code used at creation of libraries LAME DLL v.3.7 and LAME DLL v.3.83.

L3Enc v.2.61 and MP3 Producer v.2.1 Professional and MP3Enc v.3.0

This bar of encoders represents commercial products of the creator MP3 - IIS Fraunhofer and accordingly is based on the authoring code. The only encoder having any graphic interface is MP3 Producer. However interface is rather inconvenient as for coding a plenty of files there is no possibility to install parameters of coding. There is a possibility to install bitrate and channel for coding one file however installation of a concrete type channel for stereo is carried out automatically depending on bitrate. The two-cantilener encoders have the similar disadvantage. Maximum bitrate of an output file is 256 Kbps, minimum - 18 Kbps for stereo and 8 Kbps for mono.

While the possibilities of customizations parameters of coding for L3Enc and MP3 Producer coincide in main, MP3Enc differs by the greater flexibility. For example it allows to install 10 different degrees of saving of quality (or speed of operation) or to point what frequency filters should previously be applied to an original signal. In the whole set of its customizations coincides with other not less power cantilever encoder Lame v.3.61.

Microsoft WMA

This format of coding audio information became the second attempt of Microsoft to storm the problem of compression a sound. In 1995 the corporation loved by everybody was engaged in active progress and propagation the format of compression Microsoft ADPCM in which only 4 bits was on sample instead of 16. Besides sense of the stored information was considerably changed: if for usual PCM format the value of sample is equally to signal level at the given moment, for ADPCM it became equal to offset of a signal level concerning the previous instant. In the advertisement brochures it was said that the distortions which are brought in such a way of coding are absolutely imperceptible on hearing, however in a reality the frequency higher than 15 kHz were killed on the radical that made this way of compression inapplicable for storage of the sound data CD qualities.

Windows Media Audio was created rather recently and is based on the method of coding similar to MP3. On hearing this format produces practically indistinguishable from MP3 result that has made WMA a rather strong competitor. Therefore it was decided to carry out researches with involvement WMA to estimate which format gives better quality of coding. The absence of support bitrate higher than 160 Kbps and complete uncertainty about the type of channel became the essential disadvantage of WMA. However in case of success of this format Microsoft will certainly improve it that will call instant response from competing IIS Fraunhofer. Thus it is possible to name appearance WMA as the beginning the era of hard competition formats of sound compression that will lead to raise quality of coding.

The encoder WMA differs from encoder's MP3 by its "uniqueness". It is installed with other system codec's Windows 9x as one more format of compression of sounds. After that all based on MCI players start to recognize the new format of the sound data. Besides in similar applications there is a possibility of converting of the sound data in WMA. One of such programs is special plug-in to Winamp "out_wma.dll". With its help WMA files that have taken part in research were created. Their decoding was made with the help GoldWave which can work with the MCI-interface. The signal obtained after decoding passed the processing similar to decoded MP3 signals. As maximum bitrate for WMA is equal 160 Kbps it was decided to carry out small research for quality of coding MP3 with this bitrate. The best encoder by the results of the tests for 128 and 256 Kbps was chosen as the encoder MP3 to which WMA was compared.

SCMPX v.1.5.1

This program as a matter of fact is not an encoder MP3 but a whole miniature studio for playing, creation and processing MP3 and WAV files. The interface is very similar to Winamp however the principle of operation is absolutely another. If Winamp is only "shell" for various plug-in of players, SCMPX is the "monolithic" program, which code of operation with MP3 "is hardwired" rigidly in main executable file.

The program is freeware and the encoder built - in it and decoder MP3 is written by the writer completely independently with usage of documentation ISO on standards MPEG. Thus MP3 received as a result of operation is not connected with IIS Fraunhofer in any way and is compatible only with audio encoders MPEG1 and MPEG2. However luckily MP3 from IIS Fraunhofer also is compatible with these audio decoders. It turns out that SCMPX MP3 and IIS Fraunhofer MP3 are compatible among themselves! It is surprising, isn't it?

However we shall return to possibilities SCMPX. So there is a support of bitrates from 32 Kbps that 320 Kbps, possibility of inclusion VBR, choice the type of channel, method of compression MPEG Layer I, II or III. There are two modes of coding - fast and standard. The speed of coding at usage of the fast mode is approximately twice more than standard. For completion it is necessary to mention the phrase present in the window of choice parameters: "NOTE: For completion of coding a lot of time is necessary!". Nevertheless this speed is two times more GOGO DLL that is one of the fastest researched encoders. For the judges Intel it is necessary to mention support MMX but it is difficult to surprise anybody by this.

Streambox Ripper v.2.009

http://www.streambox.com/Products/Ripper/index.asp

This encoder excels by the very beautiful and rather convenient interface. Except support MP3 there is also support WMA. The quantity of formats files read by it is amazing. Except standard WAV and MP3 it can read out RM, RA, RMM and tracks musical CD. The encoder MP3 is based on the code IIS Fraunhofer L3Enc therefore possibilities at the choice of parameters of coding completely coincide with L3Enc. As well as for MP3Enc and for Lame v.3.61 there is a possibility previously to skip the data through a frequency filter realized as an equalizer. There is a possibility of processing a plenty of files at once and the encoder will independently select all sound files supported by it from the indicated directory and will honesty recode them in the selected format making (where it is necessary) resampling, downmixing and similar to them operation.

XingMPEG Encoder v.2.20

This encoder developed by the corporation XingTech differs from all others by record high speed of coding. The code of compression was developed completely by XingTech similar to how it was made in SCMPX. The interface of an encoder is directed obviously to the side of maximum simplification of life to the user. For this purpose the special concept of the template of compression representing just the defined combination parameters of compression designated by the own name is taken. For coding it is necessary to create new "operation", to select the template for this operation and to enjoy the fast obtained result. Complexity in creation of new templates became the reverse side of simplicity- at first encoder will take the user through long and senseless wizard and only after that will enable of simultaneous editing of all parameters. Complexity is also consists in the fact that except audio this encoder can encode and video and because of that it is necessary to fill in zero in all records concerning to MPEG video-stream. Bitrates from 8 Kbps up to 384 Kbps, three sorts of channel (STEREO, DUAL CHANEL, JOINT STEREO), three sampling rate 32000, 44100 and 48000 and also two sorts MPEG1 Layer II and III are supported.

Results of testing

System configuration on which all tests were carried out:

AMD K6-II 400 (100MHz Bus)
64MB RAM
HDD WD Caviar AC281000
Creative Labs AWE64 (+ TECHNICS Hi-Fi Audio Amp based on Vacuum Triodes)
CD Creative 48x&CDRW Mitsumi 4804TE

As main operating system Windows98SE was used.

The testing was carried out in three "categories": 256 Kbps, 128 Kbps and 160 Kbps and VBR (Near 160). There are no questions with first two categories but it is necessary to explain why the results for 160 Kbps and VBR are joined.

So what does VBR MP3 file represent? It is the usual MP3 file in which each information block can be encoded with different bitrate depending on sort of the sound data in each moment. Thus a little bit better coding of a signal is carried out at a smaller size of the resulting file. As at coding a sound with usage VBR bitrate 112 Kbps was selected as minimum and bitrate 256 Kbps as maximum, average value of bitrate must be about 180 Kbps. However according to the sizes of the obtained files this value was hardly more than 160 Kbps. From the definition VBR it is possible to expect that the quality of coding can be better than at coding with constant bitrate equal to average bitrate VBR as in the case VBR the quality of coding dynamically varies depending on a signal. Therefore it was made a decision to compare VBR MP3 with static MP3 with bitrate 160 Kbps hardly smaller than average bitrate obtained VBR MP3.

All encoders were compared to "ideal" parameters - values ADS and ADW for standard difference. It allows not only to estimate quality of coding (its minimum value) but also to compare it for various bitrates and encoders.

As an encoded signal approximately 55 seconds from the beginning of the song "Master of Puppets" group "Metallica" from an album "S&M" - fulfilment together with the New York symphonic orchestra was chosen. At first the signal was read out from CD with usage Easy a CDDA Extractor then it was cut off till about 55 seconds and it is saved in WAV the file with parameters "PCM 44100 ÊHz16 bit Stereo".

The standard value ADS for such file format is equal 210, ADW - 270. However for each concrete man these values can a little vary, and with the purpose of applicability of results for any man these values were reduced by 20 units. (Unit in this case is equal to a minimum possible difference of a level of signals. For 16 bits of a sound all there are 65536 different levels.)

The quality of coding was estimated not only on ADS and ADW, but also method of direct matching on hearing. Sharp discrepancies of results on ADS and ADW with a subjective estimation of sounding quality it was revealed not.

128 Kbps

The test on average value of a difference initial and encoded signal
0.5 % from a IC	The name of an encoder	Mode of operation	ADS	Max. DS
8,387458855	Audioactive Production Studio v.1.0.5	Fast mode	1132,64341	11810
8,431893047	Audioactive Production Studio v.1.0.5	High Quality	1126,674633	11810
8,387458855	Audioactive Production Studio v.1.5.4	Fast mode	1132,64341	11810
8,431801853	Audioactive Production Studio v.1.5.4	High Quality	1126,686818	11810
5,952169153	Blade Encoder DLL v.0.91		1596,056791	41794
7,914958894	GOGO DLL v.2.3		1200,258918	18515
7,832799174	L3Enc v.2.61	Fast mode	1212,848662	12322
8,304373414	L3Enc v.2.61	High Quality	1143,975533	15099
10,76754231	Lame DLL v.3.83	Fast mode	882,2811863	12686
8,286922549	Lame DLL v.3.83	High Quality	1146,384553	13870
10,74184197	Lame v.3.61	Fast mode	884,3920837	13706
8,507245596	Lame v.3.61	High Quality	1116,695162	13343
8,439034489	MP3 Producer v.2.1 Professional		1125,721196	11810
8,052126934	MP3Enc v.3.0	Fastest encoding (qual=0)	1179,812499	16034
9,418877487	MP3Enc v.3.0	Higest quality (qual=9)	1008,612758	16038
3,85164398	SCMPX v.1.5.1	Fast mode	2466,479262	43422
6,828521907	SCMPX v.1.5.1	Standard mode	1391,223478	27588
7,884195505	Streambox Ripper v.2.009		1204,942216	13884
3,201008759	XingMPEG Encoder v.2.20		2967,814434	46834
8,234275459	Microsoft WMA 128 Kbit/s		1153,714136	30339

The test on average value of a difference of the form initial and encoded signal
0.5 % from an IC	The name of an encoder	Mode of operation	ADW	Max. DW
14,83302029	Audioactive Production Studio v.1.0.5	Fast mode	842,7144137	12246
14,70365711	Audioactive Production Studio v.1.0.5	High Quality	850,1286386	14712
14,83302029	Audioactive Production Studio v.1.5.4	Fast mode	842,7144137	12246
14,70364969	Audioactive Production Studio v.1.5.4	High Quality	850,1290673	14712
9,966820769	Blade Encoder DLL v.0.91		1254,16121	33146
13,85790818	GOGO DLL v.2.3		902,0120382	13906
14,37163281	L3Enc v.2.61	Fast mode	869,7689514	13322
14,47373372	L3Enc v.2.61	High Quality	863,6334093	15433
15,85743072	Lame DLL v.3.83	Fast mode	788,2739783	19324
14,31173521	Lame DLL v.3.83	High Quality	873,4091161	15068
15,85460268	Lame v.3.61	Fast mode	788,4145855	20801
14,47357785	Lame v.3.61	High Quality	863,64271	12747
14,71441298	MP3 Producer v.2.1 Professional		849,5072159	14712
15,35612454	MP3Enc v.3.0	Fastest encoding (qual=0)	814,0074644	20783
15,25961131	MP3Enc v.3.0	Higest quality (qual=9)	819,1558583	22686
5,419446852	SCMPX v.1.5.1	Fast mode	2306,508458	46818
9,662618294	SCMPX v.1.5.1	Standard mode	1293,645223	28792
14,76722933	Streambox Ripper v.2.009		846,4688753	14258
5,172680672	XingMPEG Encoder v.2.20		2416,541981	43411
15,75982322	Microsoft WMA 128 Kbit/s		793,1561052	13227

% From a IC	The name of an encoder	Mode of operation
26,6249730	Lame DLL v.3.83	Fast mode
26,59644465	Lame v.3.61	Fast mode
24,6784888	MP3Enc v.3.0	Higest quality (qual=9)
23,99409867	Microsoft WMA 128 Kbit/s
23,40825147	MP3Enc v.3.0	Fastest encoding (qual=0)
23,22047915	Audioactive Production Studio v.1.0.5	Fast mode
23,22047915	Audioactive Production Studio v.1.5.4	Fast mode
23,15344747	MP3 Producer v.2.1 Professional
23,13555016	Audioactive Production Studio v.1.0.5	High Quality
23,13545155	Audioactive Production Studio v.1.5.4	High Quality
22,98082345	Lame v.3.61	High Quality
22,77810714	L3Enc v.2.61	High Quality
22,65142483	Streambox Ripper v.2.009
22,59865776	Lame DLL v.3.83	High Quality
22,20443198	L3Enc v.2.61	Fast mode
21,77286707	GOGO DLL v.2.3
16,4911402	SCMPX v.1.5.1	Standard mode
15,91898992	Blade Encoder DLL v.0.91
9,271090832	SCMPX v.1.5.1	Fast mode
8,373689431	XingMPEG Encoder v.2.20

So, the results of researches for most popular in Internet bitrate are rather unfavorable. The quality of coding even for the leader Lame DLL v.3.83 Fast Mode in 4 times is worse than the measurement standard. It does not mean that to distinguish on hearing an encoded signal from initial it is possible without problems - the study of psychoacoustic model plays a huge role there. Nevertheless probability of various sounding signals is very high and saving "CD qualities" is impossible.

Having a look at the results from the point of view of the used codes it is possible to note such legitimacy: Lame in the fast mode gives about 26,5 %, newer code from IIS Fraunhofer MP3Enc gives 2 % of backlog (93 % Lame) with a quality factor 9 and 3 % (88 % Lame) with a quality factor 0. Numerous clones of the first variant of the code IIS Fraunhofer and Lame in the mode of excellence follows further - backlog 3-5 % (80-88 % Lame). Based on the specification ISO encoders lag behind on 10 % (62 % Lame) and close the table as much as possible optimized on speed SCMPX in the fast mode and code from XingTech - backlog on 18 % (32 % Lame) that is more than three times. The code WMA from Microsoft on a level is approximately equal to best of encoders IIS Fraunhofer - 2,5 % (90 % Lame) backlog from Lame. For first "feeler" in the field of high quality formats compression of sounds it more than well.

256 Kbps

The test on average value of the difference between initial and encoded signal
0.5 % from a IC	The name of an encoder	Mode of operation	ADS	Max. DS
21,32965101	Audioactive Production Studio v.1.0.5	Fast mode	445,3893781	7901
21,02348122	Audioactive Production Studio v.1.0.5	High Quality	451,875686	6992
20,81670181	Audioactive Production Studio v.1.5.4	Fast mode	456,3643215	7529
21,02348122	Audioactive Production Studio v.1.5.4	High Quality	451,875686	6992
27,7670984	Blade Encoder DLL v.0.91		342,1315351	7884
30,7532969	GOGO DLL v.2.3		308,9099693	6218
19,86302313	L3Enc v.2.61	Fast mode	478,275635	7450
19,87954391	L3Enc v.2.61	High Quality	477,8781667	7614
31,43990903	Lame DLL v.3.7	Fast mode	302,1637242	5443
31,11695484	Lame DLL v.3.7	High Quality	305,2997971	5845
31,66898003	Lame DLL v.3.83	Fast mode	299,9780855	4079
31,44892226	Lame DLL v.3.83	High Quality	302,0771243	5095
31,43638043	Lame v.3.61	Fast mode	302,1976407	5443
31,17300786	Lame v.3.61	High Quality	304,7508294	5845
21,01846213	MP3 Producer v.2.1 Professional		451,9835913	6992
28,61576953	MP3Enc v.3.0	Fastest encoding (qual=0)	331,9847817	7904
28,95767814	MP3Enc v.3.0	Higest quality (qual=9)	328,0649765	15952
27,67907328	SCMPX v.1.5.1	Fast mode	343,2195833	8029
27,30435172	SCMPX v.1.5.1	Standard mode	347,9298867	6745
23,49827965	Streambox Ripper v.2.009		404,2849154	7050
13,44016639	XingMPEG Encoder v.2.20		706,8364874	7389

The test on average value the difference of the form between initial and encoded signal
0.5 % from a IC	The name of an encoder	Mode of operation	ADW	Max. DW
39,47244082	Audioactive Production Studio v.1.0.5	Fast mode	316,6766417	10866
39,46238636	Audioactive Production Studio v.1.0.5	High Quality	316,7573265	10204
38,78497082	Audioactive Production Studio v.1.5.4	Fast mode	322,2897874	10866
39,46238636	Audioactive Production Studio v.1.5.4	High Quality	316,7573265	10204
31,59284232	Blade Encoder DLL v.0.91		395,6592405	12022
41,61197952	GOGO DLL v.2.3		300,3942649	7278
37,96645774	L3Enc v.2.61	Fast mode	329,2379839	10994
38,15367232	L3Enc v.2.61	High Quality	327,6224604	10170
42,1547439	Lame DLL v.3.7	Fast mode	296,5265316	5055
42,40444562	Lame DLL v.3.7	High Quality	294,7804132	6328
42,46338768	Lame DLL v.3.83	Fast mode	294,371238	5162
42,91559297	Lame DLL v.3.83	High Quality	291,2694229	6679
42,15038792	Lame v.3.61	Fast mode	296,5571758	5055
42,46722934	Lame v.3.61	High Quality	294,3446087	6328
39,45317535	MP3 Producer v.2.1 Professional		316,8312788	10204
39,62166166	MP3Enc v.3.0	Fastest encoding (qual=0)	315,4839923	11886
35,17706485	MP3Enc v.3.0	Higest quality (qual=9)	355,3451675	14338
30,28484091	SCMPX v.1.5.1	Fast mode	412,7477518	14292
30,51430423	SCMPX v.1.5.1	Standard mode	409,6439461	8580
38,58096304	Streambox Ripper v.2.009		323,993986	10979
34,69107951	XingMPEG Encoder v.2.20		360,3231775	11778

% From a IC	The name of an encoder	Mode of operation
74,36451524	Lame DLL v.3.83	High Quality
74,13236771	Lame DLL v.3.83	Fast mode
73,6402372	Lame v.3.61	High Quality
73,59465293	Lame DLL v.3.7	Fast mode
73,58676835	Lame v.3.61	Fast mode
73,52140046	Lame DLL v.3.7	High Quality
72,36527642	GOGO DLL v.2.3
68,23743119	MP3Enc v.3.0	Fastest encoding (qual=0)
64,13474299	MP3Enc v.3.0	Higest quality (qual=9)
62,07924269	Streambox Ripper v.2.009
60,80209183	Audioactive Production Studio v.1.0.5	Fast mode
60,48586758	Audioactive Production Studio v.1.0.5	High Quality
60,48586758	Audioactive Production Studio v.1.5.4	High Quality
60,47163748	MP3 Producer v.2.1 Professional
59,60167264	Audioactive Production Studio v.1.5.4	Fast mode
59,35994072	Blade Encoder DLL v.0.91
58,03321624	L3Enc v.2.61	High Quality
57,9639142	SCMPX v.1.5.1	Fast mode
57,82948087	L3Enc v.2.61	Fast mode
57,81865595	SCMPX v.1.5.1	Standard mode
48,1312459	XingMPEG Encoder v.2.20

First seen is more than triple exceeding of a lower rod quality 128 Kbps while at a rate the difference is doubled. Besides practical impossibility to tell on hearing an encoded signal with bitrate 256 Kbps from initial gets the explanation. At a level of change in 1,4 times more than standards psychoacoustic model brings to nothing all distinction of signals. Thus it is possible to maintain that 99 % of all signals encoded with bitrate 256 Kbps on hearing ñan't be told from initial. However as the coding gives result a little bit worse standard, it is possible to find such signal, which can be told from initial on hearing but it will be improbable to meet such signal among usual music.

Continuing to examine results from the point of view the used codes for this bitrate it is possible to state a complete victory Lame - 74,4 %, and in this case mode of excellence really gives a little bit better result than fast (except LAME DLL v.3.7, where bug appeared especially strongly in the high-quality mode - not only the end of the file suffered but also its beginning). However this difference is minimum - 0,2 % and in this case it is reasonable to prefer the greater speed of coding. As well as for 128 Kbps the newer code from IIS Fraunhofer follows Lame, however percent of backlog this time is greater - for a quality factor 0 it is 6 % (92 % Lame) and for a quality factor 9 - 10 % (86 % Lame). Older code from IIS Fraunhofer - backlog about 12.5 % (82 % Lame) follows further. Then there are ISO encoders and the first variant equal to them of the code IIS Fraunhofer L3Enc - 15 % of backlogs (78 % Lame). The list is closed by the champion on speed from XingTech - 25 % of backlog (66 % Lame).

In results the defined legitimacy is visible. For bitrate 256 Kbps the code from IIS Fraunhofer gives better quality in faster mode of coding and Lame - in slower one. For bitrate 128 Kbps the situation is fulfilled just what isn't needed. From here it is possible to make output: the code IIS Fraunhofer is optimized for operation on low bitrates and Lame - on high ones. Nevertheless psychacoustic the model Lame in the fast mode (and therefore with a plenty of the disconnected filters) all the same has become more suitable for coding with bitrate 128 Kbps despite of obvious non-optimized under this bitrate. On the other hand code from IIS Fraunhofer it is not optimized under bitrate 256 Kbps and consequently the disconnecting a part of psychoacoustic model lead to improvement quality of coding - model is too strict for this bitrate.

160 Kbps and VBR (Near 160)

The test on average value of the difference between initial and encoded signal
0.5 % from a IC	The name of an encoder	Mode of operation	ADS	Max. DS
14,65488947	Lame DLL v.3.83	Fast mode	648,2478098	9864
10,6639622	Lame DLL v.3.83	High Quality	890,8508699	14324
10,33345958	Microsoft WMA 160 KBit/s		919,3436068	30115
11,49889133	GOGO DLL v.2.3		826,1666042	10077
9,604511118	Lame v.3.61		989,1185385	13705
11,93326542	SCMPX v.1.5.1	Fast mode	796,0939162	18732
12,36357401	SCMPX v.1.5.1	Standard mode	768,3862279	12723

The test on average value of a difference of the form initial and encoded signal
0.5 % from a IC	The name of an encoder	Mode of operation	ADW	Max. DW
20,87581611	Lame DLL v.3.83	Fast mode	598,7789858	13874
17,96869635	Lame DLL v.3.83	High Quality	695,6542511	11807
19,26450415	Microsoft WMA 160 KBit/s		648,8617566	12232
20,62740562	GOGO DLL v.2.3		605,9899259	13249
17,13079205	Lame v.3.61		729,6802136	12902
15,74840548	SCMPX v.1.5.1	Fast mode	793,7311505	22850
16,83297038	SCMPX v.1.5.1	Standard mode	742,5902688	14683

% From a IC	The name of an encoder	Mode of operation
35,53070558	Lame DLL v.3.83	Fast mode
32,12629694	GOGO DLL v.2.3
29,59796372	Microsoft WMA 160 KBit/s
29,1965444	SCMPX v.1.5.1	Standard mode
28,63265854	Lame DLL v.3.83	High Quality
27,6816709	SCMPX v.1.5.1	Fast mode
26,73530317	Lame v.3.61
The note: the results of testing with VBR are marked red

Recently coding MP3 with VBR becomes very popular. It is considered that the resulting quality for the file will be better than for similar on a size but with hard bitrate as one set parts of a signal hard bitrate is more than enough for coding but for another set hard bitrate is not enough. Arranging bitrate on the file depending on a level of the requirements it is theoretically possible to achieve better quality of coding. Is it really so? The results show that it is not. Though all VBR MP3 were of a little bit greater size than their static colleagues, the least maximum level of distîrtions has remained for Lame - in 2.8 times worse than measurement standard. Comparing results from 128 Kbps we receive improvement a lower rod of quality in 1,45 times at increase of a file size in 1,25 times.

Relative insignificance of this bitrate proves the fact of the significant superiority quality of coding the fast mode above high quality. Microsoft WMA has lagged behind from Lame on 6 % (83 % Lame) that again confirms gravity of intentions Microsoft concerning high-quality audio compression.

320 Kbps (that "ideal")

The test on average value of the difference between initial and encoded signal
0.5 % from a IC	The name of an encoder	Mode of operation	ADS	Max. DS
49,97117978	Lame DLL v.3.83	Fast mode	190,10958	2818
52,31987278	Lame DLL v.3.83	High Quality	181,5753651	3458

The test on average value the difference of the form initial and encoded signal
0.5 % from IC	The name of an encoder	Mode of operation	ADW	Max. DW
65,76225263	Lame DLL v.3.83	Fast mode	190,0786469	3261
68,36263959	Lame DLL v.3.83	High Quality	182,8484107	3976

% From a IC	The name of an encoder	Mode of operation
120,6825124	Lame DLL v.3.83	High Quality
115,7334324	Lame DLL v.3.83	Fast mode

These two tests were carried out only out of curiosity with the purpose to determine whether it is possible with using of coding MP3 to receive ADS and ADW equal or smaller standard. As it is found out it is possible however compression in such case is carried out in 4.5 times from initial WAV file. The difference between the mode of coding excellence and fast coding became even more significant than for bitrate 256 Kbps - 5 %, however for values quality of coding more than 100 % any differences are absent or infinitesimal as human hearing can't be able to note the difference between two such signals irrespective of character distortions.

Summary

So, we shall summarize testing. As the best encoder it is possible without any doubts to name Lame DLL v.3.83 - that is newest version of the code Lame (dated back by June 2000). For low bitrates (lower than 192 Kbps) it is necessary to use the fast mode of coding as psychoacoustic model G-psycho differs by complexity (as well as human brain, it is the model of its sensitivity to sounds) absolutely unnecessary on low bitrates when a bar of the data hardly suffices on transmission of low frequencies, let alone high and "thin" differences between them. At usage of the fast mode the most part of complex logic G-psycho is disabled and though the encoder starts mercilessly to cut frequencies the result all the same turns out on sounding a little bit better than for the competitors. For maximum bitrates G-psycho even in the fast mode is rather exact, as it is not necessary for it to try to be laid in close frameworks bitrate. However complete G-psycho will produce a little bit better result but this improvement does not cost decreases three times the speed of coding.

The encoder from XingTech has become the worst.. At optimizing the code on speed the writers too simplified psychoacoustic model for what the encoder has paid by bad quality. So MP3 file obtained by this encoder with bitrate 256 Kbps on quality is approximately equal MP3 to the file obtained with the help Lame DLL v.3.83 with bitrate192 Kbps.

Best of VBR encoders was GOGO DLL v.2.3 however usage VBR is set under the large question as smaller on a size MP3 with fixed bitrate was more qualitatively VBR MP3. However between GOGO and LAME there is a large difference and for static bitrate component about 1.25 times for bitrate 128 Kbps. Therefore bad result for VBR can be related simply to absence of its support by a good encoder.

The separate mention is deserved by the format Microsoft WMA. As a whole it has become even better than its direct competitor- MP3 from IIS Fraunhofer having lost only to advanced Lame.

By the obtained results it is also possible to outline frameworks of applicability each of bitrates. Very first of "CD qualitative" (128 Kbps) unfortunately is good only by small size of files - compression in 11 times. Therefore it can be recommended to pass a sound through networks with small capacity, that Internet for example is. Bitrate160 Kbps sounds a little better than 128 Kbps and consequently it is possible to recommend it for storage of "ephemeral" music - the quality is quite good for organization of discos however insufficient for the really requiring judge of a digital sound. Bitrate 256 Kbps is necessary for saving that "CD qualities" - in overwhelming majority of cases to differ the encoded information from the original it will be impossible. Bitrate 320 Kbps was used only when coding with usage 256 Kbps give heard distortions of a signal. However such situation is rather improbable.

Write a comment below. No registration needed!