Catch up on stories from the past week (and beyond) at the Slashdot story archive

 



Forgot your password?
typodupeerror
×
Intel Hardware

45 Teraflops: Intel Unveils Details of Its 100-Billion Transistor AI Chip (siliconangle.com) 16

At its annual Architecture Day semiconductor event Thursday, Intel revealed new details about its powerful Ponte Vecchio chip for data centers, reports SiliconANGLE: Intel is looking to take on Nvidia Corp. in the AI silicon market with Ponte Vecchio, which the company describes as its most complex system-on-chip or SOC to date. Ponte Vecchio features some 100 billion transistors, nearly twice as many as Nvidia's flagship A100 data center graphics processing unit. The chip's 100 billion transistors are divided among no fewer than 47 individual processing modules made using five different manufacturing processes. Normally, an SOC's processing modules are arranged side by side in a flat two-dimensional design. Ponte Vecchio, however, stacks the modules on one another in a vertical, three-dimensional structure created using Intel's Foveros technology.

The bulk of Ponte Vecchio's processing power comes from a set of modules aptly called the Compute Tiles. Each Compute Tile has eight Xe cores, GPU cores specifically optimized to run AI workloads. Every Xe core, in turn, consists of eight vector engines and eight matrix engines, processing modules specifically built to run the narrow set of mathematical operations that AI models use to turn data into insights... Intel shared early performance data about the chip in conjunction with the release of the technical details. According to the company, early Ponte Vecchio silicon has demonstrated performance of more than 45 teraflops, or about 45 trillion operations per second.

The article adds that it achieved those speeds while processing 32-bit single-precision floating-point values floating point values — and that at least one customer has already signed up to use Ponte Vecchio. The Argonne National Laboratory will include Ponte Vecchio chips in its upcoming $500 million Aurora supercomputer. Aurora will provide one exaflop of performance when it becomes fully operational, the equivalent of a quintillion calculations per second.
This discussion has been archived. No new comments can be posted.

45 Teraflops: Intel Unveils Details of Its 100-Billion Transistor AI Chip

Comments Filter:
  • by etash ( 1907284 ) on Monday August 23, 2021 @05:16AM (#61719811)
    is it 45 normal teraflops? or "AI" operations per second? According to this: https://www.nvidia.com/content... [nvidia.com] the A100 can do 19.5 32bit teraflops, but 156 tera TENSOR flops in 32bit. So not sure what does Intel mean.
    • by Entrope ( 68843 ) on Monday August 23, 2021 @05:49AM (#61719847) Homepage

      Apparently the 45 teraflops is comparable to the 19.5 teraflops number, counting a generic multiply-accumulate as two operations. https://www.igamesnews.com/pc/... [igamesnews.com] quotes Intel as claiming a petaflop for one Pointe Vecchio module when using matrix/tensor units.

  • As all sentient being are forced to work to survive in this society.
  • by pablo_max ( 626328 ) on Monday August 23, 2021 @06:59AM (#61719947)

    :)

  • This is the really interesting question, i.e. have Tesla as a total newcomer in the area managed to upstage Intel? Each Tesla D1 chip is supposed to deliver 22.6 TF of FP32 operations, which would be pretty much 50% of what Intel is claiming here, but the real question is how much power does it take to do so, and how many can you pack together and keep cooled. There is of course also the question of when it will become available in volume.

    Terje

    • Comparing an Intel announcement to an announcement by Tesla is like comparing the Ford electric truck to the imaginary Tesla Cybertruck
      • Comparing an Intel announcement to an announcement by Tesla is like comparing the Ford electric truck to the imaginary Tesla Cybertruck

        You realize Ford's electric truck is a paper launch, right? It doesn't exist as anything other than a prototype either, same as Tesla's Cybertruck. Those two are exactly comparable.

        It remains to be seen how well Aurora stacks up against Dojo. There are differences in both chips and the networking. Both use proprietary transports for off-board communication. Tesla uses something they didn't even bother to name, but its throughput is outrageous. Aurora uses Slingshot [anl.gov], a proprietary networking transport

        • You realize Ford's electric truck is a paper launch, right? It doesn't exist as anything other than a prototype either, same as Tesla's Cybertruck. Those two are exactly comparable.

          True, I was referrring to the track record of Tesla making announcements which are never realized, from unattended cross country to solar tiles to Semi Trailers which defy the laws of physics etc. I don't think Ford is in the same class.

      • No, really it's like comparing one notorious liar that actually delivered stuff, ever if not as great as claimed,
        to its younger, more extreme brother, who's always exaggerating so much, that it's become a running joke.

  • by Papaspud ( 2562773 ) on Monday August 23, 2021 @08:08AM (#61720049)
    I remember doing a report in high school about the new ICs... that had 1000's of transistors on a chip the size of a postage stamp. 100 BILLION!!!
  • by Z80a ( 971949 )

    Can it be trained to run Crysis?

Don't tell me how hard you work. Tell me how much you get done. -- James J. Ling

Working...