Cray Unveils Its First GPU Supercomputer 76
An anonymous reader writes "Supercomputer giant Cray has lifted the lid on its first GPU offering, bringing it into the realm of top supers like the Chinese Tianhe-1A"
The machine consists of racks of blades, each with eight GPU and CPU pairs (that can even be installed into older machines). It looks like Cray delayed the release of hardware using GPUs to work on a higher level programming environment than is available from other vendors.
Which following the pattern of other articles... (Score:5, Funny)
...will promptly be used for mining BitCoins.
Re: (Score:2)
But is it compatible with Duke Nukem Forever?
Sadly, the machine I casemodded in full Duke regalia in anticipation of DNF back in 1997 is wholly incapable of running the game, and since it's AT form factor it ain't gettin' upgraded...
Re: (Score:3)
Now consider that I'd prefer to have at least a thousand cells per side in all 3 dimensions, which makes the problem ten thousand times larger, preferably several thousa
Imagine (Score:2)
Re: (Score:3)
Re: (Score:2)
A Beowulf cluster of Beowulf clusters is not a Beowulf cluster, it's a multidimensional Beowulf cluster.
Likewise, a BOINC of Beowulf clusters, or a "jagged Beowulf cluster", is not just a Beowulf cluster.
Re: (Score:2)
You'd want to make a MOSIX cluster of Beowulf clusters, so as to allow for each cluster to appear as a node without any conflicts. To make it 3D, you'd use a Kerrighed cluster of MOSIX clusters of Beowulf clusters.
Re: (Score:2)
All your cluster grits are belong to us
Re: (Score:2)
All your cluster grits are belong to Cowboy Neal, you insensitive clod!
Re: (Score:2)
Damn!
Re: (Score:3)
...can it run Metro 2033 on High?
My single GTX580 can.
Re: (Score:2)
Kraken Cray XT5 (Score:1, Interesting)
Re:Kraken Cray XT5 (Score:5, Informative)
Uh, no you couldn't. The rate of bitcoin creation is fixed (it's about 50 BTCs / 10 mins, for now). If you add more computational time the system will adjust and it'd become proportionally harder to generate them, so the global rate would keep stable.
So despite the 100 thousand-fold increase in mining difficulty in the past 15 months, the network continuously self-adjusts itself to issue one block of Bitcoins about every 10 minutes. The difficulty increase is entirely caused by users competing between themselves to acquire these blocks.
http://blog.zorinaq.com/?e=49 [zorinaq.com]
Re: (Score:2)
1511.61 is ~21% of the 7200 daily BTCs, so you'd need around that in computing power.
According to the NICS page [tennessee.edu], the Kraken has a peak performance of 1.17 PetaFLOP.
According to bitcoin watch [bitcoinwatch.com], the network has now a performance of 46.4 PetaFLOP.
Now, I'm no mathematician, but it seems to me that 1.17 is far from 21% of 46.4; according to my calculator, it's in fact little over 2.5%.
Re: (Score:1)
Re: (Score:2)
Last I checked, the entire bitcoin network had a mining strength of 1,747 Ghash/s
Then you're out of date, it's 3653 Ghash/s [bitcoinwatch.com] now.
Re: (Score:1)
Total network hashing: 1,747 Ghash/sec
Either the network strength has significantly increased in the past week, or one of those two sites shouldn't be trusted. Your source looks more reliable.
Re: (Score:2)
Re: (Score:2)
You are ignoring that it costs more to run a supercomputer than it could generate in bitcoins.
Re: (Score:2)
Wow. So that's why I left Bitcoin on for four days straight and didn't mine a single coin.
Explain to me again why anyone is going to be running background Bitcoin processes in 2015?
Re: (Score:1)
Re: (Score:2)
You need to join a pool. Acting alone, and at the current rate processing power is being added with all the pump and dump slashspam, you likely won't win a 50 coin fabulous prize if you left your computer running for four years.
Re: (Score:1)
If the apparent value of bitcoins is far higher than the cost of electricity needed to generate them, people will still run clients.
Re: (Score:1)
The Kraken reportedly consumes about 2.8 megawatts of power, so assuming your figures are accurate, the power alone would cost about $6,720/day (at $0.10/kWh) for a "profit" of $1730/day. Factor in the fact that it's a $30 million machine with a very short usable lifespan (i.e. massive depreciation), and they'd be losing a ridiculous amount of money.
Re: (Score:2)
Re: (Score:2)
Occam is higher-level than C or Fortran, and it should be possible to adapt Erlang to parallelize across a cluster.
Re: (Score:1)
You're forgetting things like PGAS and other higher-level parallel programming models. MPI is the dominant technology in use so these machines have to support it well. But they also support more future-looking tools.
Re: (Score:3, Interesting)
Really, you've tried it and it made you want to jump out of a window? OpenMP is an extremely simple, easy to use add-on to the C language. It is one of the two current standards used for parallelized scientific computing, and although it will eventually be succeeded by a language with more features, it will be difficult for its successor to match its ease and workmanlike grace.
I honestly have trouble believing someone could have much difficulty with it. If you want to have the work in a "for" loop paralleli
Re: (Score:2)
MPI != OpenMP
HTH.
Re: (Score:2)
Maybe so, but the comment to which you replied, and with which you disagreed, was specifically about "using a message passing library." That's MPI, not OpenMP. It's like responding to someone saying, "I don't like spam!" with "But grilled cheese sandwiches are so much tastier when you put ham on them, so clearly you're wrong!" Your statement may be technically correct, but as a response to the topic at hand, it is in error. :-)
Re: (Score:2)
What did you find awful about UPC? I've foudn it very pleasant to work with.
Re: (Score:2)
Re: (Score:1)
Yep. Works great!
Re: (Score:2)
Have you tried it off-node?
Yes, and it works just fine; the only issues would be if I got my placement wrong via poor layout/blocking or I neglected to upc_memget something that I needed intense access to but for some reason couldn't make local in the initial layout. Neither of those require much forethought at all to avoid.
Re:"High level" programming environment? Sigh. (Score:4, Insightful)
The point is not for the job to be easy for your lazy ass, the point is for the code to execute as quickly as possible.
Re: (Score:2)
Re: (Score:1)
There's more information on the GPU programming model in the HPCWire article. It is OpenMP directive-based, making it quite a bit easier to use than low-level CUDA and other such things.
Re:Chinese computer dick waving (Score:5, Insightful)
You would be right except that their are applications that do require the performance.
You can never have too much computing power for some applications like climate modeling. So what is your point?
Re: (Score:2)
Nothing but a complete moneypit if we have no actual experiments to run on them that require that kind of scale
And we have plenty, the big ones off the top of my head being nuclear weapons work (as we've replaced live tests with computer simulations entirely), protein folding, climate modelling, and signals intelligence processing. I'm sure other ./ers without your childishly narrow experience of the world can think of others.
Will it support Fortran? (Score:4, Interesting)
There is still a lot of HPC applications written in Fortran with this run them?
Also how hard if any of a porting will be needed to get good results from this.
Re: (Score:2)
Re: (Score:2)
Re: (Score:2)
So I guess the second part of the question is. Have the HPC libraries been ported yet. I have heard one of the big reasons that Fortran is still so popular is the large library of highly optimized HPC libraries. The other reason is that Fortran is supposed to be really easy to optimize which I can believe.
Into the Realm? (Score:4, Informative)
bringing it into the realm of top supers like the Chinese Tianhe-1A
Uh, Cray already has machines in service that blow Tianhe-1A out of the water on real science. Tianhe-1A doesn't even exist anymore. It was a publicity stunt. Cray is already making the top supers. It's others that have to catch up.
Re: (Score:2)
ANL's Mira is going to be roughly half as fast as LLNL's Sequoia.
Re: (Score:1)
Jaguar.
As others have noted, IBM also makes machines that far surpass Tianhe-1A on real work. So does SGI.
Tianhe-1A was interesting for its intended purpose (Top-500) but it's a long way from being a productive tool.
My God... (Score:2)
It's made of cores!