![Graphics Graphics](http://a.fsdn.com/sd/topics/graphics_64.png)
![Power Power](http://a.fsdn.com/sd/topics/power_64.png)
![Hardware Hardware](http://a.fsdn.com/sd/topics/hardware_64.png)
Nvidia's RTX 5090 Power Connectors Are Melting (arstechnica.com) 77
An anonymous reader quotes a report from Ars Technica: Two owners of Nvidia's new RTX 5090 Founders Edition GPUs have reported melted power connectors and damage to their PSUs. The images look identical to reports of RTX 4090 power cables burning or melting from two years ago. Nvidia blamed the issue on people not properly plugging the 12VHPWR power connection in fully and the PCI standards body blamed Nvidia.
A Reddit poster upgraded from an RTX 4090 to an RTX 5090 and noticed "a burning smell playing Battlefield 5," before turning off their PC and finding the damage. The images show burnt plastic at both the PSU end of the power connector and the part that connects directly to the GPU. The cable is one from MODDIY, a popular manufacturer of custom cables, and the poster claims it was "securely fastened and clicked on both sides (GPU and PSU)." While it's tempting to blame the MODDIY cable, Spanish YouTuber Toro Tocho has experienced the same burnt cable (both at the GPU and PSU ends) with an RTX 5090 Founders Edition while using a cable supplied by PSU manufacturer FSP. Plastic has also melted into the PCIe 5.0 power connector on the power supply.
A Reddit poster upgraded from an RTX 4090 to an RTX 5090 and noticed "a burning smell playing Battlefield 5," before turning off their PC and finding the damage. The images show burnt plastic at both the PSU end of the power connector and the part that connects directly to the GPU. The cable is one from MODDIY, a popular manufacturer of custom cables, and the poster claims it was "securely fastened and clicked on both sides (GPU and PSU)." While it's tempting to blame the MODDIY cable, Spanish YouTuber Toro Tocho has experienced the same burnt cable (both at the GPU and PSU ends) with an RTX 5090 Founders Edition while using a cable supplied by PSU manufacturer FSP. Plastic has also melted into the PCIe 5.0 power connector on the power supply.
It's like real-life (Score:1)
...AI errors.
Re: (Score:2)
"a burning smell playing Battlefield 5,"
A whole new level of realism, smell-o-vision
Thermocouple? (Score:3)
How much would it cost nVidia to put a temperature sensor on the PCB at the power connector, seeing as how this is a recurring problem?
Re: Thermocouple? (Score:4, Interesting)
Putting it exactly where the problem is would be expensive because there is no directly suitable connector.
It's not really necessary anyway. What IS necessary is moving up to a connector that is less crap, i.e. not based on molex mini-fit jr. Molex has connectors which will suit just fine, like the ones used for powering electric jack legs on RVs. The PC industry is just still dicking around with this cheap crap instead of moving to a real connector.
Re: Thermocouple? (Score:4, Funny)
Re: (Score:2)
Nah we already learned a good solution from Tesla owners. Just wrap the connector with a wet towel (be careful not to get your GPU wet) and it'll support more power.
Re: (Score:2)
You know what? I thought about this some more and I kind of changed my mind. Not about what they should do, but what it would cost to add a temp sensor while still using the same inadequate connectors. I think you could do it reasonably cheaply by designing thermistors (or whatever kind of sensor) which fit into the spaces where a Mini-Fit Jr. pin goes. However you would still need a new, wider connector on the GPU so that you could intersperse the connector pins with the temperature sensors.
The right thing
Re: (Score:2)
Indeed. There are enough high-current low voltage converters around that work reliably. For example, solar routinely needs them.
The real problem they have here is that they run the same voltage over multiple pins without balancing and go close to the limits. A sane design might use multiple connectors for redundancy, but it would make sure each pin can take the full current, exactly to not run into this problem.
Re: (Score:2)
12VHPWR is a pile of shit, and PCI-SIG are specifying connectors with the all-too-infamous Mixture Of Morons specification model.
Re: (Score:2)
There's nothing "cheap crap" about the Minifit Jr, in fact it's used extensively in industrial systems as well. The fundamental issue is that they are simply pushing this specific connector to close to it's design limit with inadequate margin for failure.
Why would the PC industry move away from Minifit Jr? It literally hasn't had a problem with them for over 2 decades except for this one singular iteration. Don't throw the baby out with the bathwater.
Re: (Score:2)
There's nothing "cheap crap" about the Minifit Jr
Everything about it is cheap crap. The pins are particularly shitty. I've worked with many, many kinds of connectors, repinned them and so on, and Mini-Fit Jr is absolutely trash. It has poor contact design, period.
Why would the PC industry move away from Minifit Jr? It literally hasn't had a problem with them for over 2 decades
People have had problems with them being flaky for ages. I've had to replace pins on them several times. And when you do, what a fucking nightmare it is getting them out of the connector, even the official tool isn't good at it.
Re:Thermocouple? (Score:5, Interesting)
Not ideal. The problem is that NVIDIA doesn't decide how board partners lay out their device. They followed a standard. Now the standards body may have some soul-searching to do, they've already tried to address this problem once by mandating proper current available signalling and adjusting of the pins so that the connector has to be seated all the way but ultimately temperature sensor may not help much. Fundamentally there's still not enough margin for error so simply straining the connector can cause the metal contacts to not seat perfectly. And they need perfect contact for their max power rating.
The fundamental problem is they made the connector too damn compact. Bigger pins, better contact would have resolved this. If they had the space for a temp sensor in the middle they could have just opted for a larger connector in the first place. Cars don't have this problem because their connectors are *looks up technical term* fucking massive.
Really this whole 12VHWPR thing needs to just be relegated to the dustbin of history and something better developed.
Re:Thermocouple? (Score:5, Interesting)
Re: (Score:2)
Not an issue with the core but rather the connection to the board as you said. This isn't general for all 40series but varies between vendor. One of the primary issues there is the sheer mounting pressure that a modern heatsink applies. Depending on the design of the cooler it can apply insane stresses to the PCB causing it to bow significantly under the pressure. Many designs these days all but mandate a support clip on the back of the GPU core to apply pressure without bending the PCB. Those without it, e
Re: (Score:2)
The problem is that NVIDIA doesn't decide how board partners lay out their device.
The report is about Founder Edition cards, which are manufactured by NVIDIA itself, not by partners.
Re: (Score:2)
Not sure what your point is. Do you think partner manufacturers have magically less power demands on their 4090 cores? The issue is the design of the power connector which is *NOT* NVIDIA designed.
There's literally countless cases of cards with 12VHPWR connectors suffering from burnouts across all manufacturers. The fact that this story happens to be about a founders edition card is not relevant, and again to the parent's point: NVIDIA has a technical impossibility in placing a temperature sensor in the con
Re: (Score:3)
Probably easier to just design it with a connector rated for the wattage that could possibly be flowing through it.
Well, we would think so at least. And then we get two generations of insanely priced GPUs that melt their power connections.
Re: (Score:2)
Probably easier to just design it with a connector rated for the wattage that could possibly be flowing through it.
They do.
Problem, is the connector only conducts sufficiently at specified wattage with 16 asterisks after the claim.
There is no other PC standardized connector for this problem.
AMD-branded cards that stick with the 8-pin molex will be using 2-3 of them on RDNA4, depending on power level.
PCI-SIG needs to fire whatever dumpsterfire committee came up with 12VHPWR, and its bullshit "backwards compatible" fix 12V-2x6, and give us a connector that isn't a fucking fire hazard.
Re: (Score:2)
They already do this. The problem is fundamentally in the rating. The rating has no margin for error included in it, so you can pull this kind of wattage through it just fine as long as everything in your system is *perfect*. And by perfect I mean zero stress or strain on the connector. Plugged in once - virgins being tight isn't just a high school fratboy comment, and making complete contact. As soon as you have a connector that you reuse the spring of the socket weakens. Tilt it slightly when installing c
Re: Thermocouple? (Score:2)
GPU bug? (Score:4, Interesting)
Wild guessing GPU is pulling way too much energy from one set of pins. Otherwise if there was a weak connection or poor crimps it would likely be localized to one side not both.. pin 6 is toasty on both ends.
Re: (Score:2)
No need to guess. This will be the same thing as last time: Running an unbalanced multi-wire-per-voltage connector up to the pin-limits is a _very_ bad idea. It just takes one pin with a not so good connection and the whole thing melts. Sane people will restrict themselves to something like 50% per pin or even less.
Re: GPU bug? (Score:2)
First time? (Score:3)
Re: (Score:2)
Or more likely that $3000 price tag did not leave $30 for a connector that can reliable take the power.
Re: (Score:2)
You're suggesting they spend $30 for a quality connector when the CEO can buy a bigger yacht? What are you? A communist?!
Re: (Score:2)
Hahaha, no. I suggest seeing these $3000 Nvidia cards as what they are: badly made consumer-crap.
Not that I care. I found a few years ago that I am perfectly happy with FullHD. No issues with cards for that.
Re: (Score:2)
QA testing doesn't address user cases. Of note is that not all the cards are having issues. The connector design is incredibly finnicky when it gets to its limits. Simply trying to keep your cables neat in your PC often puts enough strain on the connectors for them to make imperfect contact. Not an issue when you're not pushing the connector to its limits.
Maybe 12V just isn't enough? (Score:3)
Why hasn't a 24V rail been added as standard? Or even 48V?
Re: (Score:3)
Re: Maybe 12V just isn't enough? (Score:2)
Re: (Score:2)
No we didn't. Literally every manufacturer offers either adapters or in some cases whole replacement cables if you have a modular PSU. My relatively ancient power supply has a perfectly modern 12v-6x2 (H++) cable coming from it.
Re: (Score:2)
Is stepping 24V down to the voltages required that much less efficient than stepping down 12V
I did a quick google search, and i don't immediately find much on the topic, but you seem to know more?
Re: (Score:2)
Yes it is a more difficult engineering challenge. Efficiency is achieved by either keeping a stable load or keeping the stepping small. GPUs are not a stable load drawing anywhere from a couple of watts to over 500watts. That is not an easy case to design for in a switching system. It's also why there are already so many regulators on the PCB. In many cases several operate in parallel under varying loads.
Re: (Score:2)
A100 GPUs use 48vdc in datacenters, apparently:
https://l4rz.net/running-nvidi... [l4rz.net]
And the GaN based buck converters are apparently much smaller in dimension than what we've been used to:
https://www.eenewseurope.com/e... [eenewseurope.com]
https://www.eetimes.com/how-ga... [eetimes.com]
So... yeah, I would like to see them update the standard to allow for 48v PSUs in consumer PCs.
Re: (Score:2)
A100 GPUs run under relatively constant load. That is a very different design case to design for than a consumer GPU. They also don't have a bargain basement design or compete on cost.
GPUs are already expensive enough without having to spend hundreds more on more advanced regulators like in the A100.
Re: (Score:2)
Because you're solving one problem while creating another. Specifically you've now added extra design constraints to efficiently stepping down the voltage to the 0.8-1.2V needed for the core generating more heat and requiring more power through the connector.
The issue isn't the voltage. The issue is 12VHPW is too small of a connector. They should have just stuck with the original connectors which had significantly larger pins. The MiniFit Jr had a 13.5A current limit per pin. For the purposes of PCs they dr
Maybe somebody should first explain why... (Score:2)
Re:Maybe somebody should first explain why... (Score:4, Informative)
Maybe somebody should first explain why we can safely push 1800 watts
Maybe because 1800 Watts at 120 Volts is 15 Amps. 600 Watts at 12 Volts* is 50 Amps.
*Just guessing about the RTX power requirements based on what a typical PS provides.
Re: (Score:2)
And why "amps" is a problem is because the power dissipated in a wire (or connector) increases w/ the square of current, whereas it's only linear in voltage.
power = I^2 * R
So if you have a connector with say 0.0001 ohms (which is a random number and pretty high for a connector based on the math below) ... it is dissipating:
in the 50A@12V -> 50 * 50 *12 * 0.0001 = 3W
vs 15A@120V -> ---> = 2.7W
Huh. that's closer than i would've thought
Re: (Score:2)
Re: (Score:2)
That's not how loss in a section of conductor works. First, the supply voltage doesn't contribute to the loss, the voltage drop across the conductor section does. So:
50A * 0.0001 = 0.005 V
15A * 0.0001 = 0.0015 V
And the power dissipation for each case is V*I:
50 A * 0.005 V = 0.25 Watts
15 A * 0.0015 V = 0.0225 Watts
Quite a difference. It may seem like a small amount of power in each case, but it is dissipated in a very small volume of metal. So it gets hot.
Re: (Score:2)
yow. sorry about that. I have NO idea why i multiplied by voltage when plugging the numbers in my previous post.. :facepalm: it's been way too long. Thanks for pointing it out.
i^2 * r just combines the two steps into one to give the power dissipation. (it comes from plugging v=IR into P = IV )
50A * 50A * 0.0001 ohm = 0.25W
15A * 15A * 0.0001 ohm = 0.0225W
as you pointed out.
Re: (Score:2)
0.0001 ohms seems pretty low actually.
The actual connected used here is rated at a maximum of 0.005ohms per pin, with 6 pins for power and 6 for ground, that is 0.005/6 + 0.005/6 = 0.0017 ohms.
Pushing 50A through that is 1.4W of heat in the connector alone.
They're also only rated at 50 mating cycles before the resistance goes up by another 0.005R per pin. That'll double the heat
Another 5mR for thermal aging: 105C for 240 hours
Humidity rating adds another 5mR
The connector is rated for 600W,, but at that powe
Re: (Score:2)
Re: (Score:2)
time for a hardware manufacturer to put an aux power plug on a card!
Re: (Score:1)
Re: (Score:2)
Ohms law. You can’t escape it.
Re: (Score:2)
"I fought the law and the law won"
Re: (Score:2)
Re: (Score:2)
Simple: That standard outlet and plug has to conform to requirements that mandate generous reserves. Connectors that can safely do 1000W at 12V exist. They just cost money.
Re: (Score:2)
To further that it's worth noting that the original ATX12v connector is a derated minifit-Jr connector which then has an ADDITIONAL 1.9x safety factor applied. On the other hand the 12VHPWR connector is an originally rated microfit-JR connector with only an abysmal 1.1x safety factor. The standards committee really fucked up. There's just not enough room for error here on a connector design where the pins aren't encapsulated (i.e. strain on the cable causes torque on the mating surfaces)
Re: (Score:2)
Indeed. A 1.1x "safety" factor with no derating is a joke and an accident waiting to happen. If even only the most tiny problem manifests, things will blow up. If the connector just exactly meets specs, things will already very likely blow up. They probably had no experienced EEs on that standards committee or they kept silent. Sounds like they looked at the absolute maximum ratings on the data-sheet and decided to go with those numbers, and nobody knew what derating even means.
Somebody did cheaper-than-pos
Meanwhile... (Score:1)
Move to 48 volts already (Score:2)
Time to modernize the standard and bump 12 volts up to 48. It doesn’t look like GPU power usage is going to decrease any time soon and going to 48 volts will drop the current to 1/4 of what it was. The cards already have built in power supplies to drop the 12 volts down.
Re: (Score:2)
But then people would have to throw away their old PSUs and these loads are just temporary insanity for a small customer segment anyways. They simply should have used good connectors and put in a reasonable safety margin.
Re: (Score:2)
Why throw the baby out with the bath water simply because someone designed a poor connector? 48V places a very expensive burden on regulator design. GPUs are expensive enough as they are. We don't have the $3-5k to fork out for a data-centre grade card where that kind of voltage regulation is common.
And you want the people who brought you this shitty connector to redesign the entire power system? No thanks. Let's deal with the actual problem - fire a bunch of people at the standards committee and get a norm
My 7900XTX Has 3 PCIe From Power Supply (Score:2)
And has been running 24/7 for over a year with no burnt cables. It is a Seasonic 1000w unit purchased when I bought the video card.
How can they fuck up a standard connector? We all know how much current you can pump through a certain gauge of wire.
I am so glad I went with AMD this last time around on my build.
Re: (Score:2)
We all know how much current you can pump through a certain gauge of wire.
It's that kind of knowledge that caused the problem. People think of the wire. They think of the connector pins. What they don't think of is safety factor (room for error), loading factor (more hot pins in close proximity requires additional derating), they don't think of an unencapsulated connector design (where strain on the cable causes torque on connector mating surfaces) or that people will re-plug cables (which causes problems over time).
We all know how much current these connectors are capable of. Bu
I don't understand the design (Score:2)
They appear to be using connectors with little or no safety margin, where everything must be perfect or they overheat. It seems so simple to use more pins, bigger pins or some other strategy to make them more reliable
Re: (Score:2)
Last time that was the case. Running a "max 8A" connector at 8A. With the additional problem that this is a multi-pin design and no balancing on the power lines, hence anybody smart would never go up to the max in the first place. And then you have people that are not very competent plug these in. I would not feel comfortable with more than half the rated current per pin under these conditions.
As to why do they not use better connectors? Simple: greed. I mean, they would make less in profits if they put in
Start including an external AC connection. (Score:2)
No personal computer should need a 1000w supply. Or even a 500w supply.
Especially when just playing a game, for which modern GPUs shouldn't be at max power dissipati
Time for external power (Score:2)
Somebody has learned nothing, it seems (Score:2)
Using a connector again that already caused bad issues is not smart. Sure, most or all of these problems will be because some people have reached the level of incapability needed to be to incompetent to plug-in a connector. You still have to idiot-proof that connection, unless you start only selling to EEs with proven credentials.
Re: (Score:2)
Using a connector again that already caused bad issues is not smart.
It's not the same connector. The connector design was changed in late 2023 in hopes to address the original melting problem - and it largely did. The problem is the same morons at the standard committee involved in the first one seemingly were involved in this one too.
Uh oh (Score:2)
Heard this before (Score:1)
Re: (Score:2)
Yes but the connector has changed since then. They redesigned the pins. What colloquially is called 12VHPWR can refer to either the original connector or the one with a 12V-6x2 (H++) connector on the cable. The issue here appears to be the newer one is also melting.
It's not user error (Score:2)
Well what did you expect (Score:2)