In the previous article devoted to benchmarking an engineering sample of a quad-core AMD Phenom we failed to "cross all t's," because this processor has a bug officially acknowledged by AMD. Thus, we could benchmark Phenom either in a reduced performance mode (with patched BIOS, which disables some processor units) or without the patch, which potentially allowed mistakes in their operation. Our testbed did not freeze even once, and the patchless mode theoretically shows the real performance of future "debugged" processors, but our article still produced an impression of being incomplete. And now we have an AMD Phenom X4 9850, B3 stepping. This bug is fixed here. The processor operates at 2.5 GHz, it possesses an improved integrated Northbridge (2 GHz), and it even supports DDR2-1066. To all appearances, it's indeed the fastest solution AMD can offer now, so it will be interesting to compare Phenom X4 9850 with Intel processors. We've selected four dual- and quad-core processors from Intel based on the Core 2 architecture, operating at about 2.5 GHz, with new and old cores. So Phenom will be surrounded by possible direct competitors, which should help us analyze its performance in a proper way.
If you are interested in technical details about the architecture of the new processor from AMD and its notorious bug, you can read our previous articles - Detailed Platform Analysis in RightMark Memory Analyzer. Part 15: AMD Phenom X4 and Bug in AMD Phenom X4 Processors. Effect of AMD's Patch on Low-Level Characteristics of Processor and Platform. Here we'll just publish a brief list of peculiarities of the Phenom (K10) architecture versus the previous architecture from AMD (Athlon 64 X2 / K8).
- 128-bit (versus 64-bit in AMD K8) floating point (FP) execution units
- L1-LSU (Load-Store Unit) bus is expanded to 2x128 bit (read) and 2x64 bit (write)
- L1-L2 cache bus in the processor core is expanded to 128 bit
- Data prefetch into L1 cache
- Shared L3 instruction/data cache of exclusive (non-inclusive) architecture in the integrated memory controller
- Integrated dual-channel memory controller (2x64 bit, support for ganged or unganged modes) for DDR2 and DDR3 memory (only DDR2 in the first processors)
- Improved branch prediction unit, which can now predict indirect branches
- Sideband Stack Optimizer, a part of the decoder (similar to Stack Pointer Tracker in the Intel Core architecture)
- Significantly improved (faster) execution of SSE commands
Well, it all sounds great. And now let's see how well it works in practice...
Hardware and Software
Testbed configurations
CPU |
Motherboard |
Memory |
Video |
Intel Core 2 Duo E6600 |
ASUS Maximus Extreme |
Corsair CM3X1024-1800C7DIN |
GeForce 8800 GTX |
Intel Core 2 Duo E7200 |
ASUS Maximus Extreme |
Corsair CM3X1024-1800C7DIN |
GeForce 8800 GTX |
Intel Core 2 Quad Q6600 |
ASUS Maximus Extreme |
Corsair CM3X1024-1800C7DIN |
GeForce 8800 GTX |
Intel Core 2 Quad Q9300 |
ASUS Maximus Extreme |
Corsair CM3X1024-1800C7DIN |
GeForce 8800 GTX |
AMD Phenom X4 9850 |
ASUS M3A32-MVP Deluxe |
Corsair TWIN2X4096-9136C5DF |
GeForce 8800 GTX |
- Memory: 4 GB (4 x 1 GB modules)
- HDD: Samsung HD401LJ (SATA-2)
- Coolers: Thermaltake TMG i1, Thermaltake TMG A1
- Power supply unit: Cooler Master RS-A00-EMBA
Processor |
Phenom X4 9850 |
Core 2 Duo E6600 |
Core 2 Duo E7200 |
Core 2 Quad Q6600 |
Core 2 Quad Q9300 |
Core |
Agena |
Conroe |
Wolfdale |
Kentsfield |
Yorkfield |
Process technology, nm |
65 |
65 |
45 |
65 |
45 |
Core clock, GHz |
2.5 |
2.4 |
2.53 |
2.4 |
2.5 |
# of cores |
4 |
2 |
2 |
4 |
4 |
L1 cache, I/D, KB* |
64/64 |
32/32 |
32/32 |
32/32 |
32/32 |
L2 cache, KB** |
4x512 |
4096 |
3072 |
8192 |
6144 |
L3 Cache, KB |
2048 |
- |
- |
- |
- |
FSB clock***, MHz |
533 (1066) |
266 (1066) |
266 (1066) |
266 (1066) |
333 (1333) |
Multiplier |
12.5 |
9 |
9.5 |
9 |
7.5 |
Socket |
AM2+ |
LGA775 |
LGA775 |
LGA775 |
LGA775 |
Heat dissipation**** |
125 W |
65 W |
65 W |
95 W |
95 W |
* Per single core in multi-core processors
** "X x Y" means "X KB per each of Y cores"
*** In AMD processors it's memory controller bus frequency
**** Specified differently in Intel and AMD processors, so a direct comparison would be incorrect
Software
|
64-bit application |
Multi-threaded application* |
Microsoft Windows XP Professional SP2 |
+ |
+ |
Microsoft Windows Vista Ultimate SP1 |
+ |
+ |
Autodesk 3ds max 9 SP2 |
+ |
+ |
V-Ray 1.5 SP1 |
+ |
+ |
Autodesk Maya 2008 Ultimate |
+ |
+ |
NewTek Lightwave 3D 9.2 |
+ |
+ |
SolidWorks 2007 SP0.0 |
+ |
+ |
PTC Pro/ENGINEER Wildfire 3.0 M120 |
+ |
- |
UGS NX5 5.0.0.25 |
+ |
+ |
Wolfram Research Mathematica 6 |
+ |
+ |
MapleSoft Maple 11 |
- |
+ |
MathWorks MATLAB 2007 |
+ |
+ |
Adobe Photoshop CS3 10.0 |
- |
+ |
Microsoft Visual Studio 2008 |
+ |
+ |
Apache HTTP Server 2.2.8 |
- |
+ |
PHP 5.2.5 |
- |
+ |
MySQL Community Server 5.0.51a |
- |
+ |
ACDSee 10 Photo Manager |
- |
+ |
xat.com Image Optimizer 5.10 |
- |
- |
IrfanView 4.10 |
- |
- |
XnView 1.93.4 |
- |
- |
Paint.NET 3.30 |
+ |
+ |
7-Zip 4.57 |
+ |
+ |
WinRAR 3.71 |
- |
+ |
UltimateZip 3.2 |
- |
- |
FLAC 1.2.1 |
- |
- |
LAME-MT 3.97 |
+ |
+ |
Musepack MPC Encoder 1.16 |
- |
- |
Nero Digital Audio Encoder 1.1.34.2 |
- |
+ |
Ogg Encoder 2.83 (Lancer) |
- |
+ |
Canopus ProCoder 3.0 |
- |
+ |
DivX Codec 6.8.2 |
- |
+ |
XviD Codec 1.1.3 Final |
- |
- |
x264 Codec rev 807 |
- |
+ |
VirtualDub 1.8.0 |
- |
+ |
Call of Duty 4: Modern Warfare (Patch 1.5) |
- |
+ |
Call of Juarez (Patch 1.1.0.0) + DX10 Enhancements Pack |
- |
- |
Crysis (Patch 1.2) |
+ |
+ |
S.T.A.L.K.E.R. (Patch 1.006) |
- |
+ |
Unreal Tournament 3 (Patch 1.2) |
- |
+ |
Company of Heroes (Patch 1.71) |
- |
+ |
World in Conflict (Patch 1.007) |
- |
+ |
* Means that two or more simultaneously active threads are actually present during tests; not just the fact that a process generates several threads.
Write a comment below. No registration needed!
|
|
|
|
|