Catch up on stories from the past week (and beyond) at the Slashdot story archive

 



Forgot your password?
typodupeerror
×
Graphics Open Source Games Hardware

Nvidia's Fermi Architecture Debuts; Nouveau Driver Already Working 70

crookedvulture writes Nvidia has lifted the curtain on reviews of its latest GPU architecture, which will be available first in the high-end GeForce GTX 680 graphics card. The underlying GK104 processor is much smaller than the equivalent AMD GPU, with fewer transistors, a narrower path to memory, and greatly simplified control logic that relies more heavily on Nvidia's compiler software. Despite the modest chip, Nvidia's new architecture is efficient enough that The Tech Report, PC Perspective, and AnandTech all found the GeForce GTX 680's gaming performance to be largely comparable to AMD's fastest Radeon, which costs $50 more. The GTX 680 also offers other notable perks, like a PCI Express 3.0 interface, dynamic clock scaling, new video encoding tech, and a smarter vsync mechanism. It's rather power-efficient, too, but the decision to focus on graphics workloads means the chip won't be as good a fit for Nvidia's compute-centric Tesla products. A bigger GPU based on the Kepler architecture is expected to serve that market." Read on below for good news (at least if you prefer Free software) from an anonymous reader. Update: 03/22 19:35 GMT by T : Mea culpa -- that headline should say "Kepler," rather than Fermi; HT to Dave from Hot Hardware (here's HH's take on the new GPU).
Our anonymous friend writes "The open-source Nouveau driver project that reverse-engineers the official NVIDIA driver to provide a free software alternative has made some big accomplishments. Nouveau announced today they have same-day Kepler support and are now de-staging on Linux. The GeForce GTX 680 'Kepler' launch just happened hours prior to Nouveau, somehow managing initial mode-setting support with early hardware, from a project that NVIDIA 'officially' does not support. The de-staging in the Linux kernel now means that the driver is at version 1.0 with a stable ABI."
This discussion has been archived. No new comments can be posted.

Nvidia's Fermi Architecture Debuts; Nouveau Driver Already Working

Comments Filter:
  • Wrong architecture! (Score:5, Informative)

    by kz26 ( 1017248 ) on Thursday March 22, 2012 @01:20PM (#39442545) Homepage
    I believe you mean Kepler, not Fermi, in the story title.
  • Re:Fermi ? (Score:2, Informative)

    by Lunix Nutcase ( 1092239 ) on Thursday March 22, 2012 @01:58PM (#39442955)

    The saddest part is the summary correctly mentions that it's Kepler. Timothy once again shows off either his piss poor editing skills or the fact that he's illiterate.

  • by Anonymous Coward on Thursday March 22, 2012 @02:28PM (#39443283)
    Firstly, this new architecture (GK104) has a great number of cores (192 versus 32 of the Fermi architecture) sharing a single control logic within a stream multiprocessor (SM). Internally, each SM is SIMD, so this move is bad for divergent kernels, i.e., algorithms containing if-then-else constructs. Secondly, as usual from Nvidia, the GeForce brand has poor double-precision performance, only 1/8 of the SP's. On the other hand, the AMD Radeon HD7000 family doubles this fraction, being much faster at DP operations, which is a must for scientific computing.
  • Re: Nouveau (Score:5, Informative)

    by Anonymous Coward on Thursday March 22, 2012 @02:54PM (#39443569)

    Troll spotted

    As a Nouveau dev, I can tell what's wrong with Nouveau and it is not the lack of acceleration!

    First of all, we have 2D and 3D acceleration (up to OpenGL 3 and a toy directx 10/11 support that runs Unigine Heaven) for all cards, back to the TNT 2 (of course, no hw opengl 3 there). OpenGL has been good-enough for me to play many games at decent framerates and have a composited desktop running on all my cards minus one. This one is the half-Fermi/half-Kepler nvd9 that still needs some love.

    Up until the G50, there mostly was no real power management. Clocks were set at boot time and that was enough for us.
    G50 introduced reclocking support on mobile GPUs. The boot clocks were no longer set to the stock values but only to lower clocks (let's say half the normal frequency). Most desktop GPUs lacked power management.
    GT215 extended the laptop power management scheme to desktops.
    Fermi, of course, kept that scheme but pushed it a little further. Now, boot clocks are terribly low (core = 50MHz, memory = 100MHz) at boot time.

    On my GTX460, Nouveau is perfectly usable on kde 4.8 (I have 100fps with KWin and the OpenGL backend) but games are obviously really slow, about 30fps for xonotic.

    At the same clocks, Nouveau's performance is about 80% of the proprietary driver and thus, not bad. Our real problem, is that we need reclocking support to get more performance out of the cards. We have been working on it for about 1.5 years and trust me, it isn't the easiest part of the hardware to reverse engineer.

    So, what's the current state of reclocking support?
    - G50->GT200: Clocks can be set to the desired frequency and the operation should be stable. Some cards don't work but we are ironing the corner cases. In some cases, the screen turns black for a few ms while reclocking. It's a bug I'm working on.
    - G215 -> GF100: Clocks can be set for all engines and memory but the end-result isn't usually working because of some black voodoo we aren't doing right now. It is being addressed.
    - GF100 series: Only the engines can be reclocked. Nothing but very experimental memory reclocking. It is being worked on.
    - Kepler: Hey, it was released today, most of us haven't put my hands on anything yet.

    If reclocking is supported on your card, dynamic reclocking is a piece of cake (compared to reclocking) and the support for it has already been written.

    To sum up, we have hw acceleration on all cards but nvd9 (unless you use some microcode from the blob) and Kepler. The only problem with 3D is the lack of proper power management but it is being worked on and we have made great progress. As cards are all different but in fact doing the same thing for it (even across generations), I have good hopes that Kepler will be fully functional 3D-wise before a new series come.

    Remember that, contrarily to the blob, we do support cards older than geforce 7 AND we provide out of the box/open source hw acceleration that is already way sufficient for desktop usage. Also, remember that this work is mostly done by a core team of less than 10 people, most of us being students and only one being paid by Red Hat.

    Martin Peres, PhD student working on power management on Nouveau

  • by jensend ( 71114 ) on Thursday March 22, 2012 @04:19PM (#39444447)

    The double precision situation is a lot worse than that. For GK104, fp64 performance is only 1/24 fp32. Previous to this, NV's consumer cards did fp64 at 1/12 (midrange) or 1/8 (high-end) fp32; I guess that wasn't enough handicapping to protect their Tesla line so they bumped it up.

    If you need more precision than fp32 and want to use nV consumer GPUs you should consider software emulation. A very simple software double emulation scheme can give you 1/6 - 1/4 of fp32 performance. Of course it's less precise than fp64- it has 48 significand bits (double fp32's 24, less than fp64's 53) and 8 exp. bits (same as fp32, 3 less than fp64), and to get ~1/4 of fp32 performance you have to skip a lot of error/NaN/inf handling type stuff. But it's probably sufficient for a lot of applications where people use fp64. Even software "quad-single" (96 significand bits using 4 32-bit floats) would likely be faster than nV's native fp64.

    OTOH, AMD doesn't have much reason to handicap its cards, as you mention, its cards do fp64 at 1/4 fp32-- and that's with full IEEE 754 compliance. They used to be at a big disadvantage for GPGPU, but with their new compute-oriented GCN architecture and their now-huge fp64 lead for $2000 cards, I think a lot of GPGPU folks will switch.

The one day you'd sell your soul for something, souls are a glut.

Working...