iXBT Labs - Computer Hardware in Detail

Platform

Video

Multimedia

Mobile

Other

KYRO from STMicroelectronics/Imagination Technologies



On the 31 of August, 1999 the world started to change when the NVIDIA company has announced GeForce 256. From that time all the developments have been aimed at 3D graphics to be more realistic. The developers do not think whether to provide with some particular hardware support, they just do it. So, GeForce 256 has performed its mission. But today it's not only NVIDIA that contributes to the progress, it's also ATI. Soon we'll witness new products with HW T&L support from Matrox and 3dfx. The latest 3D accelerators support a lot of interesting useful functions, which allow to increase an image quality. And there is one more fact to stress. The Voodoo5 5500 card from 3dfx with 2 graphics processors shows the performance on the level of a card with only one graphics core - GeForce 256 from NVIDIA with DDR SGRAM. Today, multichip solutions do not seem so profitable and winning in the speed aspect. Besides, almost all modern 3D graphic solutions are based on polygon or so-called traditional rendering architecture. But the company that has been developing for a long time graphics processors based on tile (or non-traditional) rendering architecture is Imagination Technologies Ltd (earlier known as VideoLogic). Its PowerVR Technologies department deals with graphics processors. Last year the STMicroelectronics became a partner of Imagination in silicon engineering and chip production (earlier it was known as SGS Thomson). STMicroelectronics is a giant in semiconductors' production with its own plants and powerful engineering labs. Some time ago STM was a partner of NVIDIA at creation and production of chips of NV1 an Riva128 series.

So, the cooperative labour of PowerVR Technologies and STMicroelectronics has led to a new product KYRO which belongs to the new series of graphics processors with tile architecture - PowerVR Series3 (PVRS3).

As you might know, the first version of accelerators of Power VR series was quite successful in the market (i.e. m3D accelerator from Matrox). Then after a while it released NEON-250 (a new accelerator of Power VR Series 2), but it wasn't the best one, as at that time there were inexpensive accelerators with the higher performance. In result, the Neon 250 based cards were only on the British market.

Well, now let's pass over to the features of KYRO architecture.

Tile architecture

As you know, in the traditional 3D rendering architecture all operations are carried out upon polygons, which build the scene. The polygons' assortment is implemented on the base of the information from a Z-buffer. These data, depending on the reading way, can form either Z-buffer or W-buffer. The difference is in the sorting way. The performance might increase, if the polygons are executed beginning from the near to the far. The Z-buffer requires some memory. But it causes extra expenditures. There are 2 solutions: you can apply different modified algorithms or use a non-traditional rendering architecture (i.e. a tile one). One of the advantages of the tile architecture is that it's unnecessary to use a Z-buffer. The Z-buffer in tile accelerators is integrated in a chip. This helps to economize the memory; then the performance increases at the expense of absence of additional reading/writing cycles in the memory. The tile architecture works the following way: the whole scene is divided in small parts called tiles (i.e. 16x32 pixels). Each tile is operated separately. For it, there are the following basic tools:

  • Frame Buffer Tile;
  • Deferred Texturing;
  • high accuracy and image quality at color blending;
  • scalability.

Frame Buffer Tile is a separate space in the chip (alike an integrated memory), which is intended for sorting of polygons, pixels processing etc. All these processes are executed in the chip at 32-bit color mode and at the full core frequency. For example, at the clock frequency of 125 MHz the tile processor can handle about 20 million triangles per second.

Deferred Texturing deals with final pixels (those which will be displayed). And there is no need in unnecessary texture memory access.

Next. Since the blending operations is made in the chip with 32-bit accuracy, the color rendition is very high.

The scaling is very easy for the tile processors: the performance increases proportionally the number of the tiles.

Now let's turn to the chip itself.

KYRO basic characteristics

3D part includes the following hardware possibilities:

  • Versatile DirectX7 vertex format. DirectX8 support will be included in the drivers after an official release of DirectX8 API;
  • Full support of OpenGL ICD;
  • 8-layer multitexturing with some unique possibilities;
  • A mechanism of full preparation of triangles and parameters of texturing/blanking: a hidden surface removal, shading and texturing, where the all parameters of the scene are kept in the local memory (it unloads the processor and the interface bus). All these operations are carried out in the chip;
  • DLR mechanism (Display List Renderer) allows to divide the whole scene in separate parts (tiles or regions, TILES), establishing tile priorities;
  • RGB-shading and mirror reflection effect;
  • Texture filtering (bilinear, trilinear and anisotropic);
  • Color and texture alpha-blending;
  • Table and vertex mist;
  • Texture compression;
  • Relief texturing: Environmental Bump Mapping (EMBM), Embossed Bump Mapping and Perturbed UV Bump Mapping (aka Dot Product);
  • Full-screen spline (FSAA by OGSS method) according to the x2 and x4 scheme

128 bit GUI acceleration (2D-part) includes:

  • 3-operand ROPs;
  • Clipping;
  • Color extension;
  • Bit block operations (BitBLT): transparency and expansion;
  • Pipeline optimization, that ensures protection from "fallouts" of the operations with 24bit screen depth;
  • All 2D operations are carried out hardwarily, including monochrome extension, dots and lines drawing.

The interface part supports:

  • AGP 1x, 2x and 4x modes with usage of SBA for operations of AGP-texturing. AGP 4x provides 1.056 GBps peak channel with a possibility to keep up to 32 questions in the storage buffer.
  • DMA Bus Mastering;
  • Digital output interface: 12 bit multiplex digital interface (12 bit DVA_D digital port), an interface for standard LCD-devices and digital TV-coders (Conexant Bt868/869, Chrontel CH7003/7004 or digital transmitter with Silicon Image PanelLink Sil154 interface), 24 bit LCD modes are available up to 1280x1024 at 60 Hz of frame synchronization;
  • Digital port modes - Master/Slave. In the "Master" mode the chip generate DAC clock signals of HSYNC and VSYNC. In the "Slave" mode a data stream is synchronized, if the mentioned synchrosignals are generate by external circuits;
  • A videoport for a video capture, a TV-tuner and videoconferences(H.324 protocol). A compatibility with a VBI port supports for data capture for Intercast, Closed Caption and TeleText;
  • All subsettings of PAL/NTSC formats. If a TV-out is switched on, the digital panel is off, but CRT-display may be used as a "mirror" with the same resolution.
  • PC'99 specification requirements;
  • BIOS version is completely compatible with IBM VGA and corresponds to VBE 3.0, DPMS and DDC2Bi requirements.

"Stream" video and MPEG2 decoding support:

  • Full hardware support for DVD;
  • Mechanism of motion compensation;
  • Support of 4:2:0 overlays;
  • Color conversion;
  • Hardware interpolation on X and Y axes.

128 bit memory interface supports:

  • SDRAM and SGRAM interface;
  • 125 MHz core/memory clock (2 GBps bandwidth);
  • Total size of supported local memory from 16 to 64 MBytes;

The integrated 270 MHz palette DAC contains:

  • Clock frequency synthesizer for internal circuits synchronization and realization of phase-locked-loop frequency control;
  • 32x32 hardware cursor;
  • Possibility of 1920x1280 in TrueColor at 75 Hz frame synchronization.

The board

Now let's look on the card:

The card has AGP2x/4x interface, 64 MBytes SDRAM located in 8 chips on the right side of the PCB:

The memory chips are produced be IBM, they are characterized by the 6.8 ns access time, 143 MHz working frequency. As for KYRO working frequency, we might suppose, that the chip works at 115 or 125 MHz.

The board has an active cooler, which you might see only on some Canopus Spectra cards. This small quality fan moves the air volume along the videocard.

The board is green with yellow writings. The PCB has a jack for digital displays and the relative control chip.

Overclocking

At the present moment there are no any utilities for an overclocking on the KYRO, that's why the card was working at the regular frequencies.

Installation and drivers

Now we'll dive into the practical part. The testbed is following:

  • Intel Pentium III 1000 MHz:
  • Chaintech 6OJV (i815) mainboard;
  • 256 MBytes PC133 ;
  • IBM DPTA 20GB HDD;
  • Windows 98 SE;
  • ViewSonic P810 (21").

For testing we used 4.44 drivers' version.

Setup the drivers and you will see the following:

2D-parameters adjustment:

image output on the LCD (doubling):

Direct3D-parameters setting:

OpenGL-parameters setting:

As you can see, the settings allow to adjust main 3D-functions: filtering, switching-out of Vsync, Z and W-buffers control, EMBM relief texturing etc. The drivers include ICD OpenGL, but, unfortunately, they don't support API PowerSGL (that's why the game Unreal failed to set up through this interface).

No means for overclocking are available here.

The test results

I would notice with pleasure, that the KYRO based card from STMicroelectronics has all 2D marks on a quite high level.

Let's check the speed of KYRO in 3D-graphics with 3 games:

  • Rage Expendable (Direct3D, multitexturing);
  • id Software Quake3 v.1.17 (OpenGL, multitexturing);
  • Epic Games Unreal (Direct3D, multitexturing).

These tools are quite enough to show the videocard performance when working through 3 main API. For comparison we give you the diagrams of 2 more videocards: ELSA ErazorX (based on NVIDIA GeForce 256 SDR) and reference card NVIDIA GeForce2 MX.

As you see, the KYRO based card falls behind here in the all tests but for Expendable in 32-bit color mode. The conclusion is that with such poor drivers KYRO based cards couldn't compete with those based on NVIDIA GeForce2 MX.

EMBM support doesn't help the performance increase, because the most of the games with EMBM support work only with G400 based cards from Matrox.

First it may seem that the tile architecture hasn't proved to be the best. But it has! Look: KYRO has only 2 rendering pipelines with 1 texturing module on each, and GeForce 256, for example, has 4 pipelines with one textured block on each. And the performance of the both cards doesn't differ.

Note, that "High Quality" was switched on when in Quake3 in 32-bit color mode (which ensure the trilinear filtration). In 16-bit color I also specially set the trilinear filtration to show you that the performance difference between 16-bit color and 32-bit color is nearly zero.

Although, the general speed decrease when using the trilinear filtration on the KYRO based card is horrible. And this is not surprising, since KYRO has only 2 pixel pipelines and 2 texturing modules. This means that multitexturing and trilinear filtration can't work simultaneously at the high speed. It's similar to the performance decrease with anisotropic filtration in use. Then, the speed when in the FSAA mode (Anti-aliasing) is awfully low.

In general, today NVIDIA GeForce2 MX can easily beat KYRO. Though, we are not sure what the price is, and whether the drivers will be improved. If they are, this chip might be promising. Besides, there are complains about graphic image deformations in some games.

As for the anisotropic filtration, you can see yourself how high the quality is:

trilinear filtration

anisotropic filtration

The image quality of the second screenshot is much higher. But the performance in this case is 3 times lower!

Note, that KYRO has a real trilinear filtration as compared with an approximated one of NVIDIA Riva TNT/TNT2 and 3dfx Voodoo3/4/5 (in OpenGL/Glide).

When playing DVD-Video with WinDVD 2.1 programmed player the CPU load is not more than 25%, though there are some small artifacts.

Conclusion

Well, the KYRO based card (PowerVRS3) from STMicroelectronics/Imagination has crashed nearly in all speed tests when competing with the NVIDIA GeForce2 MX and GeForce 256 SDR based cards.

Undoubtedly, KYRO has shown all advantages of a tile architecture (if not take in not account the performance in the games). But in general, users would hesitate whether it's worth to buy. Only if the price is not more than $100 and the drivers are improved, it will have a chance of success in the market.

Some manufacturers (VideoLogic, Innovision) have announced KYRO based card. Besides, there is some probability that some cards will be based on a new version of KYRO, what might increase the performance.

Highs:

  • the tile architecture (it allows the 2 pipeline chip to compete against a 4 pipeline processor with a polygon architecture);
  • perfect DVD playback;
  • elaborate card design;

Lows:

  • low performance;
  • too high initial price;
  • high performance decrease with a trilinear and anisotropic filtration in use;
  • no overclocking means.

Write a comment below. No registration needed!


Article navigation:



blog comments powered by Disqus

  Most Popular Reviews More    RSS  

AMD Phenom II X4 955, Phenom II X4 960T, Phenom II X6 1075T, and Intel Pentium G2120, Core i3-3220, Core i5-3330 Processors

Comparing old, cheap solutions from AMD with new, budget offerings from Intel.
February 1, 2013 · Processor Roundups

Inno3D GeForce GTX 670 iChill, Inno3D GeForce GTX 660 Ti Graphics Cards

A couple of mid-range adapters with original cooling systems.
January 30, 2013 · Video cards: NVIDIA GPUs

Creative Sound Blaster X-Fi Surround 5.1

An external X-Fi solution in tests.
September 9, 2008 · Sound Cards

AMD FX-8350 Processor

The first worthwhile Piledriver CPU.
September 11, 2012 · Processors: AMD

Consumed Power, Energy Consumption: Ivy Bridge vs. Sandy Bridge

Trying out the new method.
September 18, 2012 · Processors: Intel
  Latest Reviews More    RSS  

i3DSpeed, September 2013

Retested all graphics cards with the new drivers.
Oct 18, 2013 · 3Digests

i3DSpeed, August 2013

Added new benchmarks: BioShock Infinite and Metro: Last Light.
Sep 06, 2013 · 3Digests

i3DSpeed, July 2013

Added the test results of NVIDIA GeForce GTX 760 and AMD Radeon HD 7730.
Aug 05, 2013 · 3Digests

Gainward GeForce GTX 650 Ti BOOST 2GB Golden Sample Graphics Card

An excellent hybrid of GeForce GTX 650 Ti and GeForce GTX 660.
Jun 24, 2013 · Video cards: NVIDIA GPUs

i3DSpeed, May 2013

Added the test results of NVIDIA GeForce GTX 770/780.
Jun 03, 2013 · 3Digests
  Latest News More    RSS  

Platform  ·  Video  ·  Multimedia  ·  Mobile  ·  Other  ||  About us & Privacy policy  ·  Twitter  ·  Facebook


Copyright © Byrds Research & Publishing, Ltd., 1997–2011. All rights reserved.