I can't say that the whole world was waiting with trepidation for the new processor core from the well-known and diversifying manufacturer of various peripheral controllers, mainboards, wall and table radiators and even chipsets - Taiwanese company VIA Technologies. Nevertheless, there was a certain interest toward this event, especially from reviewers of computer hardware who were tired to describe and measure in scores, fps and seconds their delight about another several hundreds of Megahertz of an ordinary top processor which however hadn't suffered much without them. And they were interested just for the sport of it. You can even stake on it: will all x86-applications finally work on the originally x86-compatible processor? And other questions alike. We have satisfied our interest and now suggest that you do the same. VIA C3 «Nehemiah»It's impossible to recognize the new core according to the top cover mark (the suffix «A» implies only the presence of L2 cache - like the first Celeron processors which were coupled with the L2 cache did)To brush up the details of the architecture of the VIA C3 processors look through the comparison of the low-end solutions which, in particular, demonstrates the 1GHz VIA C3 processor based on the Ezra-T core (which is participating in the today's tests as well). The new-comer has some fundamental differences from its predecessors: it's the first time since Samuel2/Ezra/Ezra-T when the core has really significant differences. The basic characteristics of the Nehemiah are the following:
The core has some more architectural peculiarities but in our case too detailed description will be hardly interesting. Now let's look at the differences between the Nehemiah core and what it can be compared to. It looks perfect when compared with the Ezra-T: extended and improved caches, full-speed FPU unit, and SSE support promise a good performance gain. (Note that possibility of the processor to execute SSE instructions does not guarantee a certain speed of their execution, but, still, we can count on a certain performance increase, otherwise such support makes no sense.) The extended pipeline implies a higher frequency potential (also meaning overclocking). However, there are some unpleasant moments. First of all, the heat dissipation of the processor has increased markedly: although the 1GHz model dissipates only 15 W (comparing with 27.8 W of the Celeron Tualatin 1A GHz and 46.1 W of the Duron Morgan 1 GHz), it requires active cooling, thus killing the first key point of the C3 processors. Secondly, the Nehemiah requires Tualatin-compatible mainboards, i.e. it doesn't suit, for instance, for upgrade of the i440BX based system. Theoretically, it's possible to construct adapters for attaching Nehemiah processors to old boards, but I think no one will mess up with it. OverclockingNow, to make up for the sad news from the architectural front let's turn to the favorite entertainment of computer fans: overclocking. The Celerons were twiceclocked so many times already, that kind bridges on the Athlons had really entered our lives, and now we have the VIA's products to mock at! Forget about overclocking effectiveness - we are going to work, not to calculate the percentage gain. :) Apart from FSB overclocking which mainly depends on capabilities of the mainboard (if other components, for example, memory, do not put obstacles), there is also a CPU multiplier at our disposal. It's locked in the C3 processors on the MSR level (special internal CPU registers), not on the hardware level, and, actually, it's possible to use software to unlock. The changes will remain until the next reboot, that is why it's rational to make them through the BIOS (the edited one) or through the program that starts with the system. But there is a better way! On the CPU's back, right above the marked die, there are some peculiar bridges. Linking up the bridges (with a lead pencil, special conductive adhesive, varnish, solder etc.) will set a respective bit in the MSR register to «1» when the power is supplied, while disconnecting them (not recommended for pins linked up at the factory because of possible mechanical damages) will set it to «0». After some experiments with the bridges we managed to lift up the multiplier: when all the bridges were joined together the internal multiplier turned into «x16», but since it's out of use (at least this moment) our tested model started at 133x9=1200 MHz. Increasing the bus frequency, we reached on the ASUS TUSL2-C mainboard a maximum of 148x7.5=1100 MHz, and 144x9=~1300 MHz for the processor unlocked to x9 multiplier. So, the overclocking potential of the new core is higher than of the old one, but there's nothing to compare with Tualatins that start with the 100 MHz bus, but end with the 150+ MHz, thus giving the 50+% effect. Performance estimationTestbed:
Software:
Test applications:
Note that we didn't test the overclocked version of the new C3, because it really interesting to compare Nehemiah with its predecessor, which was tested of 1 GHz frequency. As for comparison with the Celeron (Tualatin)... well, you'll see it with you own eyes on the diagrams. We will return to this point later to make a conclusion. Also, it was quite interesting to use in tests a new board from VIA (VPSD) based on the CLE266 chipset. Our regular readers are familiar with the performance of this chipset already, and now VIA demonstrates the general-purpose model of the mATX format. Due to the peculiarities of the CLE266 (weak integrated video (for games) and lacking support of an external AGP port), this board is doomed to budget office or niched home computers, though only tests will clear all the things up. Test resultsLet's start with CPU RightMark and see what talents the new C3 core can boast of. Differential equations solving which use only ordinary x87 and MMX instructions shows the effect of the twice speeded Nehemiah FPU perfectly clear (almost 1.5 times), but you can also notice that even the new C3 has still a long way to go to catch up with the Tualatin. At rendering the Nehemiah's FPU has a 1.5 times better performance again; the SSE instructions are executed indeed, with the speed comparable to the Celeron. Sorry, but even with SSE, even considering a higher frequency of the Intel's processor, Tualatin is still twice faster, no doubt. Note that the C3 Ezra-T failed this test as the Lame codec encountered some errors (according to the programmers of the codec - on the standard MMX code). Here C3 demonstrates a weak calculation speed comparing with the Celeron Tualatin (as we already estimated in CPU RightMark) and the CLE266 chipset has a negative effect on the speed, even though video is not used in this test and the DDR bandwidth on this chipset is theoretically twice higher. The scores obtained in video stream encoding into the MPEG4 format are not dull at all. Two hours on the Ezra-T, an hour on the Nehemiah and half an hour on the Tualatin give a clear picture. There is one more factor that affects performance of the tested systems: either core of the C3 has a very low memory exchange rate. Writing memory keeps all models look equal, and even Celeron is a looser (~200 MB/s against ~230), while for reading - Intel's processor reads data at 750 MB/s which is almost 2.5 times faster: C3 doesn't get a real benefit even of the theoretical CLE266+DDR266 bandwidth. Moreover, continuous videomemory calls under Windows GUI, though they take less than a percent of the overall time, contribute to the low speed of the system. It's obvious that the chipset would work much better with Intel's processors (of course, when there's no need in gaming or other 3D performance). We'll comment results of these two very different applications together because there is one odd fact that brings them together. The Celeron is, as always, far ahead, though the gap is not so big in WinAce, because the test highly depends on the memory speed. The CLE266 slows down the Nehemiah, very usual, though the slowdown never exceeds 5%. But the last couple performs in quite a strange manner: Ezra-T is almost on the same level as Nehemiah. Hmm, the new core must be superior, at least, due to the FPU speed - final rendering is craving for resources of the coprocessor. Equal results mean that the processor has nothing to do with it. Well, I can assume that productivity of these two applications is limited exactly by the low memory read speed on the C3 processor. As we pointed in our previous reviews of Tualatin core, there was very close situation with rendering, though the scores were higher. OK, now we see Tualatin is ~2.5 times (~750/~300 ?) faster in 3ds max (that require quick reads from memory mostly) and a bit lower in WinAce (where memory writes take considerable time). In the synthetic benchmark SYSmark the Nehemiah raised the standard of its ally who fell down in the fight against the test packet's script. Results look pretty good, but they don't have much common sense, because the final scores are just scores, not the real seconds of application response on the user's actions. Comparison of the C3 and Celeron in games will hardly give us more material to discuss. Now it's time to test and discuss the integrated video of CLE266 (CastleRock). The lack of Hardware T&L is well noticeable in the general 3DMark scores and it allows us to estimate a performance level in modern games developed for HTCL. However, in certain games with the software emulation it's possible to reach the level of a certain, quite low, degree of playability. As there is no standard OpenGL driver for CastleRock, Return to Castle Wolfenstein and DroneZ refused to start running on this system. Besides, I would attract your attention to the fact that high resolution and hard graphic settings in DroneZ slightly drops performance of the Nehemiah thanks to increased computing power of the new core comparing with the old one. VIA C3M266-L coupled with the Nehemiah hit only 11.5 fps in Serious Sam: The Second Encounter (Speed@800x600x16), while we were able to see any movie either DVD or MPEG4 at highest quality without any delays. So, taking VIA's solutions as a single whole, the C3 «Nehemiah» + VIA C3M266-L can make a good budget system or a TV console. ConclusionWe didn't aim to maintain purity and academic precision of the tests comparing processors of the different architecture running at different frequencies. The matter is that the VIA C3 processor comparing with the Intel Celeron, remains «faceless» (it's noticeable even without frequency adjustment), though it was significantly improved. Moreover, I don't think it really makes sense to compare the new C3 with the old one, because considering Celeron scores I don't think that it makes any sense at all to pay attention to performance of VIA's processor. This is the only, unique, modern x86 processor which de facto doesn't have this attribute - performance. :) The price of the C3 Nehemiah tends to be about $30, that is too close to $40 for the Celeron Tualatin already available at 1.4 GHz. At the same time, the new C3 has much lower... OK, has NO performance, and only gives a chance to save electric power. But we were really happy that VIA finally released a processor which executes the x86 code flawlessly. The SSE instructions work and do this perfectly! Had this processor entered market a year ago, it could have become a noticeable player. But today we can only discuss technical advantages and disadvantages of
the core, not speak about market share and production news... OK, let's
take the added SSE instruction execution unit and the removed (according
to flags of the MSRs) 3DNow! instruction execution unit. We still don't have
complete version of the VIA's documentation at the moment, but we are
aware of the modern tradition to use internal processor's units which translate
external instructions to the quite different internal ones to execute, so
linking this tradition to the fact that one execution unit appears while another
disappears, as well as considering VIA's tending to REuse everything over
and over again, we finally want to ask: whether anything appeared or disappeared
at all? Maybe, C3 has one special universal unit that executes an internal
RISC-like microcode and Nehemiah use new translator for it that differs from
the Ezra's one? This is just an assumption, but, still... Think of it, not of market prospects.
Write a comment below. No registration needed!
|
Platform · Video · Multimedia · Mobile · Other || About us & Privacy policy · Twitter · Facebook Copyright © Byrds Research & Publishing, Ltd., 1997–2011. All rights reserved. |