Become a fan of Slashdot on Facebook

 



Forgot your password?
typodupeerror
×
Graphics Power Hardware

Nvidia's RTX 5090 Power Connectors Are Melting (arstechnica.com) 75

An anonymous reader quotes a report from Ars Technica: Two owners of Nvidia's new RTX 5090 Founders Edition GPUs have reported melted power connectors and damage to their PSUs. The images look identical to reports of RTX 4090 power cables burning or melting from two years ago. Nvidia blamed the issue on people not properly plugging the 12VHPWR power connection in fully and the PCI standards body blamed Nvidia.

A Reddit poster upgraded from an RTX 4090 to an RTX 5090 and noticed "a burning smell playing Battlefield 5," before turning off their PC and finding the damage. The images show burnt plastic at both the PSU end of the power connector and the part that connects directly to the GPU. The cable is one from MODDIY, a popular manufacturer of custom cables, and the poster claims it was "securely fastened and clicked on both sides (GPU and PSU)." While it's tempting to blame the MODDIY cable, Spanish YouTuber Toro Tocho has experienced the same burnt cable (both at the GPU and PSU ends) with an RTX 5090 Founders Edition while using a cable supplied by PSU manufacturer FSP. Plastic has also melted into the PCIe 5.0 power connector on the power supply.

Nvidia's RTX 5090 Power Connectors Are Melting

Comments Filter:
  • ...AI errors.

  • by bill_mcgonigle ( 4333 ) * on Monday February 10, 2025 @06:37PM (#65157345) Homepage Journal

    How much would it cost nVidia to put a temperature sensor on the PCB at the power connector, seeing as how this is a recurring problem?

    • Re: Thermocouple? (Score:4, Interesting)

      by drinkypoo ( 153816 ) <drink@hyperlogos.org> on Monday February 10, 2025 @06:48PM (#65157357) Homepage Journal

      Putting it exactly where the problem is would be expensive because there is no directly suitable connector.

      It's not really necessary anyway. What IS necessary is moving up to a connector that is less crap, i.e. not based on molex mini-fit jr. Molex has connectors which will suit just fine, like the ones used for powering electric jack legs on RVs. The PC industry is just still dicking around with this cheap crap instead of moving to a real connector.

      • by arglebargle_xiv ( 2212710 ) on Monday February 10, 2025 @08:11PM (#65157513)
        Yup, all they need to do is add Anderson connectors to the cards and they'll have plenty of headroom for the future. Or, to be really safe, CEE 17's.
        • Nah we already learned a good solution from Tesla owners. Just wrap the connector with a wet towel (be careful not to get your GPU wet) and it'll support more power.

      • You know what? I thought about this some more and I kind of changed my mind. Not about what they should do, but what it would cost to add a temp sensor while still using the same inadequate connectors. I think you could do it reasonably cheaply by designing thermistors (or whatever kind of sensor) which fit into the spaces where a Mini-Fit Jr. pin goes. However you would still need a new, wider connector on the GPU so that you could intersperse the connector pins with the temperature sensors.

        The right thing

      • by gweihir ( 88907 )

        Indeed. There are enough high-current low voltage converters around that work reliably. For example, solar routinely needs them.

        The real problem they have here is that they run the same voltage over multiple pins without balancing and go close to the limits. A sane design might use multiple connectors for redundancy, but it would make sure each pin can take the full current, exactly to not run into this problem.

      • Solid take.
        12VHPWR is a pile of shit, and PCI-SIG are specifying connectors with the all-too-infamous Mixture Of Morons specification model.
      • There's nothing "cheap crap" about the Minifit Jr, in fact it's used extensively in industrial systems as well. The fundamental issue is that they are simply pushing this specific connector to close to it's design limit with inadequate margin for failure.

        Why would the PC industry move away from Minifit Jr? It literally hasn't had a problem with them for over 2 decades except for this one singular iteration. Don't throw the baby out with the bathwater.

    • Re:Thermocouple? (Score:5, Interesting)

      by thegarbz ( 1787294 ) on Monday February 10, 2025 @06:49PM (#65157359)

      Not ideal. The problem is that NVIDIA doesn't decide how board partners lay out their device. They followed a standard. Now the standards body may have some soul-searching to do, they've already tried to address this problem once by mandating proper current available signalling and adjusting of the pins so that the connector has to be seated all the way but ultimately temperature sensor may not help much. Fundamentally there's still not enough margin for error so simply straining the connector can cause the metal contacts to not seat perfectly. And they need perfect contact for their max power rating.

      The fundamental problem is they made the connector too damn compact. Bigger pins, better contact would have resolved this. If they had the space for a temp sensor in the middle they could have just opted for a larger connector in the first place. Cars don't have this problem because their connectors are *looks up technical term* fucking massive.

      Really this whole 12VHWPR thing needs to just be relegated to the dustbin of history and something better developed.

      • Re:Thermocouple? (Score:5, Interesting)

        by JaredOfEuropa ( 526365 ) on Monday February 10, 2025 @07:25PM (#65157429) Journal
        Seems like that's not the only issue with the 40 series. My 4090 died randomly, power connector was fine. A repair guy had a look at it and reflowed the GPU core, which seems to have fixed it. According to him, problems with the core's connection to the board, or even with the core itself developing faults, are rather more common than with other graphics cards, and it seems it's an issue across all brands that offer this card. I hope the 50 series doesn't suffer from the same problem.
        • Not an issue with the core but rather the connection to the board as you said. This isn't general for all 40series but varies between vendor. One of the primary issues there is the sheer mounting pressure that a modern heatsink applies. Depending on the design of the cooler it can apply insane stresses to the PCB causing it to bow significantly under the pressure. Many designs these days all but mandate a support clip on the back of the GPU core to apply pressure without bending the PCB. Those without it, e

      • The problem is that NVIDIA doesn't decide how board partners lay out their device.

        The report is about Founder Edition cards, which are manufactured by NVIDIA itself, not by partners.

        • Not sure what your point is. Do you think partner manufacturers have magically less power demands on their 4090 cores? The issue is the design of the power connector which is *NOT* NVIDIA designed.

          There's literally countless cases of cards with 12VHPWR connectors suffering from burnouts across all manufacturers. The fact that this story happens to be about a founders edition card is not relevant, and again to the parent's point: NVIDIA has a technical impossibility in placing a temperature sensor in the con

    • Probably easier to just design it with a connector rated for the wattage that could possibly be flowing through it.

      Well, we would think so at least. And then we get two generations of insanely priced GPUs that melt their power connections.

      • Probably easier to just design it with a connector rated for the wattage that could possibly be flowing through it.

        They do.

        Problem, is the connector only conducts sufficiently at specified wattage with 16 asterisks after the claim.
        There is no other PC standardized connector for this problem.

        AMD-branded cards that stick with the 8-pin molex will be using 2-3 of them on RDNA4, depending on power level.
        PCI-SIG needs to fire whatever dumpsterfire committee came up with 12VHPWR, and its bullshit "backwards compatible" fix 12V-2x6, and give us a connector that isn't a fucking fire hazard.

      • They already do this. The problem is fundamentally in the rating. The rating has no margin for error included in it, so you can pull this kind of wattage through it just fine as long as everything in your system is *perfect*. And by perfect I mean zero stress or strain on the connector. Plugged in once - virgins being tight isn't just a high school fratboy comment, and making complete contact. As soon as you have a connector that you reuse the spring of the socket weakens. Tilt it slightly when installing c

    • Too much. They need to dump the fucking connector design, asafp.
  • GPU bug? (Score:4, Interesting)

    by WaffleMonster ( 969671 ) on Monday February 10, 2025 @06:50PM (#65157363)

    Wild guessing GPU is pulling way too much energy from one set of pins. Otherwise if there was a weak connection or poor crimps it would likely be localized to one side not both.. pin 6 is toasty on both ends.

    • by gweihir ( 88907 )

      No need to guess. This will be the same thing as last time: Running an unbalanced multi-wire-per-voltage connector up to the pin-limits is a _very_ bad idea. It just takes one pin with a not so good connection and the whole thing melts. Sane people will restrict themselves to something like 50% per pin or even less.

      • Most multi-pun connector manufacturers publish derating figures for their product. Molex microfit and minifit design docs have this for certain. They should have used a 6x1 minifit jr
  • by sinij ( 911942 ) on Monday February 10, 2025 @07:10PM (#65157399)
    I guess asking price of $3K didn't leave enough room for proper QA testing.
    • by gweihir ( 88907 )

      Or more likely that $3000 price tag did not leave $30 for a connector that can reliable take the power.

      • You're suggesting they spend $30 for a quality connector when the CEO can buy a bigger yacht? What are you? A communist?!

        • by gweihir ( 88907 )

          Hahaha, no. I suggest seeing these $3000 Nvidia cards as what they are: badly made consumer-crap.

          Not that I care. I found a few years ago that I am perfectly happy with FullHD. No issues with cards for that.

    • QA testing doesn't address user cases. Of note is that not all the cards are having issues. The connector design is incredibly finnicky when it gets to its limits. Simply trying to keep your cables neat in your PC often puts enough strain on the connectors for them to make imperfect contact. Not an issue when you're not pushing the connector to its limits.

  • by Pezbian ( 1641885 ) on Monday February 10, 2025 @07:20PM (#65157417)

    Why hasn't a 24V rail been added as standard? Or even 48V?

    • by ddtmm ( 549094 )
      That would be worse. Delivering a higher voltage means they would have to regulate the voltage down even more to the required component voltages, which would generate more heat on the card. That would ultimately mean more power consumption and more required cooling. You're just trading one problem for another. Add to that, there are no 24V or 48V rails on a ATX supply, so now you're buying a custom power supply and trying to find a place to put it.
      • We already had to get a custom power supply for the fucking connector that melts
        • No we didn't. Literally every manufacturer offers either adapters or in some cases whole replacement cables if you have a modular PSU. My relatively ancient power supply has a perfectly modern 12v-6x2 (H++) cable coming from it.

      • Is stepping 24V down to the voltages required that much less efficient than stepping down 12V

        I did a quick google search, and i don't immediately find much on the topic, but you seem to know more?

        • Yes it is a more difficult engineering challenge. Efficiency is achieved by either keeping a stable load or keeping the stepping small. GPUs are not a stable load drawing anywhere from a couple of watts to over 500watts. That is not an easy case to design for in a switching system. It's also why there are already so many regulators on the PCB. In many cases several operate in parallel under varying loads.

    • A100 GPUs use 48vdc in datacenters, apparently:

      https://l4rz.net/running-nvidi... [l4rz.net]

      And the GaN based buck converters are apparently much smaller in dimension than what we've been used to:

      https://www.eenewseurope.com/e... [eenewseurope.com]

      https://www.eetimes.com/how-ga... [eetimes.com]

      So... yeah, I would like to see them update the standard to allow for 48v PSUs in consumer PCs.

      • A100 GPUs run under relatively constant load. That is a very different design case to design for than a consumer GPU. They also don't have a bargain basement design or compete on cost.

        GPUs are already expensive enough without having to spend hundreds more on more advanced regulators like in the A100.

    • Because you're solving one problem while creating another. Specifically you've now added extra design constraints to efficiently stepping down the voltage to the 0.8-1.2V needed for the core generating more heat and requiring more power through the connector.

      The issue isn't the voltage. The issue is 12VHPW is too small of a connector. They should have just stuck with the original connectors which had significantly larger pins. The MiniFit Jr had a 13.5A current limit per pin. For the purposes of PCs they dr

  • Maybe somebody should first explain why we can safely push 1800 watts down a 2-conductor wire from a standard outlet, but we need a 12-pin connector for video cards and can't manage a measly 600 watts without starting a fire.
    • by PPH ( 736903 ) on Monday February 10, 2025 @07:35PM (#65157449)

      Maybe somebody should first explain why we can safely push 1800 watts

      Maybe because 1800 Watts at 120 Volts is 15 Amps. 600 Watts at 12 Volts* is 50 Amps.

      *Just guessing about the RTX power requirements based on what a typical PS provides.

      • by rta ( 559125 )

        And why "amps" is a problem is because the power dissipated in a wire (or connector) increases w/ the square of current, whereas it's only linear in voltage.
        power = I^2 * R

        So if you have a connector with say 0.0001 ohms (which is a random number and pretty high for a connector based on the math below) ... it is dissipating:
        in the 50A@12V -> 50 * 50 *12 * 0.0001 = 3W
        vs 15A@120V -> ---> = 2.7W

        Huh. that's closer than i would've thought

        • Your standard outlet and power plug for AC is still a lot bigger, with a lot more reliable contact surface.
        • by PPH ( 736903 )

          That's not how loss in a section of conductor works. First, the supply voltage doesn't contribute to the loss, the voltage drop across the conductor section does. So:

          50A * 0.0001 = 0.005 V

          15A * 0.0001 = 0.0015 V

          And the power dissipation for each case is V*I:

          50 A * 0.005 V = 0.25 Watts

          15 A * 0.0015 V = 0.0225 Watts

          Quite a difference. It may seem like a small amount of power in each case, but it is dissipated in a very small volume of metal. So it gets hot.

          • by rta ( 559125 )

            yow. sorry about that. I have NO idea why i multiplied by voltage when plugging the numbers in my previous post.. :facepalm: it's been way too long. Thanks for pointing it out.

              i^2 * r just combines the two steps into one to give the power dissipation. (it comes from plugging v=IR into P = IV )

            50A * 50A * 0.0001 ohm = 0.25W

            15A * 15A * 0.0001 ohm = 0.0225W

            as you pointed out.

        • 0.0001 ohms seems pretty low actually.
          The actual connected used here is rated at a maximum of 0.005ohms per pin, with 6 pins for power and 6 for ground, that is 0.005/6 + 0.005/6 = 0.0017 ohms.
          Pushing 50A through that is 1.4W of heat in the connector alone.
          They're also only rated at 50 mating cycles before the resistance goes up by another 0.005R per pin. That'll double the heat
          Another 5mR for thermal aging: 105C for 240 hours
          Humidity rating adds another 5mR

          The connector is rated for 600W,, but at that powe

    • In a word, amps. 600W @ 12V is 50A. 1800W@120V is 15A, and the standard 120V connector is a nice fat wide contact.
    • Ohms law. You can’t escape it.

    • by Lehk228 ( 705449 )
      ohms law (lower voltage means higher amps and it's amps that burn out a connector), plus the connector standards for inside a PC are total shit
    • by gweihir ( 88907 )

      Simple: That standard outlet and plug has to conform to requirements that mandate generous reserves. Connectors that can safely do 1000W at 12V exist. They just cost money.

      • To further that it's worth noting that the original ATX12v connector is a derated minifit-Jr connector which then has an ADDITIONAL 1.9x safety factor applied. On the other hand the 12VHPWR connector is an originally rated microfit-JR connector with only an abysmal 1.1x safety factor. The standards committee really fucked up. There's just not enough room for error here on a connector design where the pins aren't encapsulated (i.e. strain on the cable causes torque on the mating surfaces)

        • by gweihir ( 88907 )

          Indeed. A 1.1x "safety" factor with no derating is a joke and an accident waiting to happen. If even only the most tiny problem manifests, things will blow up. If the connector just exactly meets specs, things will already very likely blow up. They probably had no experienced EEs on that standards committee or they kept silent. Sounds like they looked at the absolute maximum ratings on the data-sheet and decided to go with those numbers, and nobody knew what derating even means.

          Somebody did cheaper-than-pos

  • I'm over here paying extra to have thermal sim, and real world load testing on my ESP-32 hat PCB for LEDs. A 14$ board... ;-D
  • Time to modernize the standard and bump 12 volts up to 48. It doesn’t look like GPU power usage is going to decrease any time soon and going to 48 volts will drop the current to 1/4 of what it was. The cards already have built in power supplies to drop the 12 volts down.

    • by gweihir ( 88907 )

      But then people would have to throw away their old PSUs and these loads are just temporary insanity for a small customer segment anyways. They simply should have used good connectors and put in a reasonable safety margin.

    • Why throw the baby out with the bath water simply because someone designed a poor connector? 48V places a very expensive burden on regulator design. GPUs are expensive enough as they are. We don't have the $3-5k to fork out for a data-centre grade card where that kind of voltage regulation is common.

      And you want the people who brought you this shitty connector to redesign the entire power system? No thanks. Let's deal with the actual problem - fire a bunch of people at the standards committee and get a norm

  • And has been running 24/7 for over a year with no burnt cables. It is a Seasonic 1000w unit purchased when I bought the video card.

    How can they fuck up a standard connector? We all know how much current you can pump through a certain gauge of wire.

    I am so glad I went with AMD this last time around on my build.

    • We all know how much current you can pump through a certain gauge of wire.

      It's that kind of knowledge that caused the problem. People think of the wire. They think of the connector pins. What they don't think of is safety factor (room for error), loading factor (more hot pins in close proximity requires additional derating), they don't think of an unencapsulated connector design (where strain on the cable causes torque on connector mating surfaces) or that people will re-plug cables (which causes problems over time).

      We all know how much current these connectors are capable of. Bu

  • They appear to be using connectors with little or no safety margin, where everything must be perfect or they overheat. It seems so simple to use more pins, bigger pins or some other strategy to make them more reliable

    • by gweihir ( 88907 )

      Last time that was the case. Running a "max 8A" connector at 8A. With the additional problem that this is a multi-pin design and no balancing on the power lines, hence anybody smart would never go up to the max in the first place. And then you have people that are not very competent plug these in. I would not feel comfortable with more than half the rated current per pin under these conditions.

      As to why do they not use better connectors? Simple: greed. I mean, they would make less in profits if they put in

  • For the price they are asking for these things, AND the ridiculous power draw that is borderline within ATX(or whatever it is now) power supply designs. Just move those connectors to an external port on the card and include a power adapter that accepts a standard computer power cable with a pass through to the main power supply's AC input.

    No personal computer should need a 1000w supply. Or even a 500w supply.

    Especially when just playing a game, for which modern GPUs shouldn't be at max power dissipati
  • Just give it an external power source and plug it directly into the wall. Power requirements for GPUs are getting so high, it's going to happen soon enough anyways.
  • Using a connector again that already caused bad issues is not smart. Sure, most or all of these problems will be because some people have reached the level of incapability needed to be to incompetent to plug-in a connector. You still have to idiot-proof that connection, unless you start only selling to EEs with proven credentials.

    • Using a connector again that already caused bad issues is not smart.

      It's not the same connector. The connector design was changed in late 2023 in hopes to address the original melting problem - and it largely did. The problem is the same morons at the standard committee involved in the first one seemingly were involved in this one too.

  • Well at least they can RMA it with full warranty because they didn't buy it secondhand from a scalper or with a BS bot account...ohhh shit.
  • Heard this before, didn't the 40xx do this too?
    • Yes but the connector has changed since then. They redesigned the pins. What colloquially is called 12VHPWR can refer to either the original connector or the one with a 12V-6x2 (H++) connector on the cable. The issue here appears to be the newer one is also melting.

  • It's designer error. 12HVPWR is the stupidest connector choice ever. It could have been a fucking 1/4" quick disconnect jammed over a plated tab on the pcb, for fuck's sake
  • from the hottest ticket gpu currently none available?

Riches cover a multitude of woes. -- Menander

Working...