Become a fan of Slashdot on Facebook

Intel Shows 48-Core x86 Processor 366

Posted by timothy on Wednesday December 02, 2009 @04:26PM from the soon-will-be-in-calculators dept.

Vigile writes "Intel unveiled a completely new processor design today the company is dubbing the 'Single-chip Cloud Computer' (but was previously codenamed Bangalore). Justin Rattner, the company's CTO, discussed the new product at a press event in Santa Clara and revealed some interesting information about the goals and design of the new CPU. While terascale processing has been discussed for some time, this new CPU is the first to integrate full IA x86 cores rather than simple floating point units. The 48 cores are set 2 to a 'tile' and each tile communicates with others via a 2D mesh network capable of 256 GB/s rather than a large cache structure. "

This discussion has been archived. No new comments can be posted.

Intel Shows 48-Core x86 Processor

Load All Comments

Search 366 Comments Log In/Create an Account

Comments Filter:

Meh. I'm holding out for a kilocore. (Score:2)

by jeffb (2.718) ( 1189693 ) writes:

...or perhaps a megacore?
- 48 is sufficient for most Ph.D. dissertations. (Score:5, Interesting)
  
  by reporter ( 666905 ) writes: on Wednesday December 02, 2009 @04:53PM (#30303436) Homepage
  
  A big market for this chip is the computer-science department of 2nd-tier universities like the University of California-Santa Barbara (UCSB).
  Unlike Stanford University, UCSB lacks the money to build a full-blown multiprocessor system. If UCSB had such a system back in the 1990s, then UCSB would likely have produced as much multiprocessor research as Stanford University.
  This 48-core processor chip, due to the fact that it will eventually be a commercial product mass-produced by the millions of units, will be economically cheap. This chip will enable UCSB to build or buy a cheap multiprocessor system.
  A bunch of graduate students is already salivating at the prospect. They are drooling.
  
  Parent Share
  twitter facebook
  - Rather than Larrabee, Intel should've focus on CPU (Score:2)
    
    by Taco Cowboy ( 5327 ) writes:
    
    Intel ought to focus. They need to focus more on CPU rather than Larrabee, which is an obvious mistake.
  - Re:48 is sufficient for most Ph.D. dissertations. (Score:4, Informative)
    
    by kharchenko ( 303729 ) writes: on Wednesday December 02, 2009 @08:19PM (#30306242)
    
    >If UCSB had such a system back in the 1990s, then UCSB would likely have produced as much multiprocessor research as Stanford University
    Actually, UCSB had exactly such a system in the 90's, called Meiko: "The Department of Computer Science at UCSB purchased a 64-processor CS-2 in June 1994." [ucsb.edu]
    
    Parent Share
    twitter facebook
  - Re:48 is sufficient for most Ph.D. dissertations. (Score:5, Funny)
    
    by ceoyoyo ( 59147 ) writes: on Wednesday December 02, 2009 @10:22PM (#30307080)
    
    Word gets pretty slow when you hit a hundred pages with figures on a Core Duo, but you could always just use LaTeX or a file per chapter. I managed to get my dissertation done with just two cores and my parents managed with a typewriter (although those were masters, not PhDs).
    
    Parent Share
    twitter facebook
    - Re: (Score:3, Interesting)
      
      by cerberusss ( 660701 ) writes:
      
      That's pretty funny.
      Made me think about how I created beautiful reports, using LaTeX, on a simple 100 MHz Pentium machine running Slackware Linux. Now there's Office 2010 coming up, and I'm not sure what the system requirements are, but I'm pretty sure it doesn't do ligatures [wikipedia.org].
      (Ligatures: when you write "finally", the dot on the i looks funny next to the top of the f, thus LaTeX creates one specially designed character, a ligature, just to make it look good.)
- Re:Meh. I'm holding out for a kilocore. (Score:5, Funny)
  
  by Curate ( 783077 ) writes: <craigbarkhouse@outlook.com> on Wednesday December 02, 2009 @07:08PM (#30305572)
  
  I think it's more likely we'll see kibicores and mebicores.
  
  Parent Share
  twitter facebook
- - Re: (Score:2, Funny)
    
    by stakovahflow ( 1660677 ) writes:
    
    Manticore. Mmm. Manticore... Jessica Alba? I'll cast my vote for manticore any day of the week with Jessica Alba in there... [Dark Angel (Comic/Show) references? Yes, I went there... I'll do it again, too!] Personally, though, I think 48 cores in one proc are enough to float my boat... Then, too, so could Ms. Alba... --Stak
Advantages over just adding more FPUs? (Score:2)

by Hadlock ( 143607 ) writes:

Can someone elaborate on why you'd want 48 full processors, rather than a processor with two (dual) or four (quad) "cores" (I'm presuming core in this case == FPU in the article). Supposedly Win7's SMP support becomes much more effective at the 12-16 core thresehold.
- Re: (Score:2, Funny)
  
  by Anonymous Coward writes:
  
  To enable system administrators to say "Fuck it, we'll go to one blade!"
- Re: (Score:2)
  
  by h4rr4r ( 612664 ) writes:
  
  For a server.
  Probably not running windows, as linux and other *n.x type OSes support monstrous amounts of CPUs already.
- Idle benchmarks (Score:5, Insightful)
  
  by Colin Smith ( 2679 ) writes: on Wednesday December 02, 2009 @04:40PM (#30303154)
  
  With 48 processors you can have your system 98% idle running your typical application at full speed rather than just 50% or 75% idle as is the norm now.
  
  Parent Share
  twitter facebook
  - Re: (Score:2)
    
    by h4rr4r ( 612664 ) writes:
    
    Please tell me where I can find boxes that would run 50% idle for my use. My company would pay handsomely for such CPUs. Current Quad Xeons fail to do this.
  - Re: (Score:3)
    
    by olsmeister ( 1488789 ) writes:
    
    So would this have saved that guy's ass who spent $1M in electricity running SETI@Home on the school's computers?
- Re: (Score:3, Insightful)
  
  by Locke2005 ( 849178 ) writes:
  
  Current memory architecture has trouble keeping data fed to just 2 CPUs; unless each of the 48 cores has it's own dedicated cache and memory bus, this is a pretty useless design.
  - Re: (Score:2)
    
    by V!NCENT ( 1105021 ) writes:
    
    Yes. Serial RAM acces. Damn. When are people going to realise that RAM, which has a lot of, what are they called, banks?, hysically seperated from each other, could be made paralell?
    - - Re: (Score:2)
        
        by V!NCENT ( 1105021 ) writes:
        
        So why not multiple busses to the CPU?
        
        Re:Advantages over just adding more FPUs? (Score:4, Informative)
        
        by eabrek ( 880144 ) writes: <eabrek@bigfoot.com> on Wednesday December 02, 2009 @05:17PM (#30303864)
        
        That's what each channel is. I forget exactly, but each DDR channel is almost 200+ pins (RDRAM was considered a big win because it is about 80). And pins == money (mainly in die area).
        
        Parent Share
        twitter facebook
        
        Re: (Score:3, Informative)
        
        by eabrek ( 880144 ) writes:
        
        Multiple channels and overlapped memory access? The hardware does it automatically. No need to program anything different (well, I guess there is BIOS code somewhere that configures all the channels and bank information - but most people shouldn't see that).
        Now, programming a 48 core FPU monster? That is a much harder problem!
        
        Re:Advantages over just adding more FPUs? (Score:4, Interesting)
        
        by Locke2005 ( 849178 ) writes: on Wednesday December 02, 2009 @05:48PM (#30304396)
        
        No need to program anything different Actually, I believe performance can be improved by pre-fetching data into cache ahead of time, so you know it will be there when you need it. I was doing this in software on MIPS to improve Linpack performance; I suspect you can do much the same with Intel processors as well.
        
        GPUs are using 256 bit wide data paths now to improve data throughput; I think it is only a matter of time until the memory bus is a whole cache line (256 bits?) in width, enabling read/writing of entire cache lines in a single operation. Seems simple to me, but your pin count and power usage go up, as well as the number of separate DRAM chips you need for a wider memory bus.
        
        Parent Share
        twitter facebook
        
        Re:Advantages over just adding more FPUs? (Score:4, Interesting)
        
        by Avtuunaaja ( 1249076 ) writes: on Wednesday December 02, 2009 @09:59PM (#30306962)
        
        A cache line on a modern Intel/AMD processor is actually 512 bits, or 64 bytes.
        A memory bus 512 bits wide wouldn't really help much, though -- right now when dealing with memory, most of the time is spent in the various latencies. When you are fetching a lot of memory sequentially, you can get insane speeds even today. But that's not how you usually read memory -- instead, you read a few words from different locations, and the memory controller needs to activate the correct bank, row and column before you get what you need. On typical PC-10600 DDR3, that means at least 15 bus cycles just waiting around for the memory to adjust. Making the bus 512 bits wide would speed up the actual transfer to one bus cycle from the 4 what it takes currently, but that would only mean an improvement of about 15% -- at a huge cost for having to accommodate those 384 extra data lines on the chip, socket, motherboard and ram. It's better just to try to speed up the memory so burst transfers happen "fast enough".
        I don't know about nvidia cards, but at least for ati the card doesn't actually have a 256 bit memory interface -- instead, it has 4 completely separate 64-bit memory channels connected to a fast ring bus. The interleaving of data on those separate memory channels is done very coarsely -- basically, entire textures and such are allocated on a single channel. This means that when that texture is being fetched, the 3 other channels can serve other requests.
        This is the way I see cpu's evolve too -- even on current hardware, namely phenom 2, you get better performance when you ungang the memory channels, and wait 8 cycles for a single memory transfer instead of 4, because that way you get to wait on separate latencies on the separate channels at the same time. Of course, in the perverse case all the data you want to access resides on one of the channels, but the chance of that happening by accident is pretty much nil.
        
        Parent Share
        twitter facebook
  - Re:Advantages over just adding more FPUs? (Score:4, Informative)
    
    by TheRaven64 ( 641858 ) writes: on Wednesday December 02, 2009 @06:32PM (#30305074) Journal
    
    Processors access memory via a cache. When you load a word from memory to a register, it is loaded from cache. If it is not already in cache, then you get a cache miss, the pipeline stalls (and runs another context on SMT chips), and the memory controller fetches a cache line of data from memory. Cache lines are typically around 128 bytes. Modern memory is typically connected via channel that is 64 bits wide. That means that it takes 16 reads to fill a cache line. If you have your memory arranged in matched pairs of modules then it can fill it in 8 pairs of reads instead, which takes half as long.
    On any vaguely recent non-Intel chip (including workstation and server chips for most architectures), you have a memory controller on die for each chip (sometimes for each core). Each chip is connected to a separate set of memory. A simple example of this is a two-way Opteron. Each will have its own, private, memory. If you need to access memory attached to the other processor then it has to be forwarded over the HyperTransport link (a point-to-point message passing channel that AMD uses to run a cache coherency protocol). If your OS did a good job of scheduling, then all of the RAM allocated to a process will be on the RAM chips close to where the process is running.
    The reason Intel and Sun are pushing fully buffered DIMMs for their new chips is that FBDIMMs use a serial channel, rather than a parallel one, for connecting the memory to the memory controller. This means that you need fewer pins on the memory controller for connecting up a DIMM and so you can have several memory controllers on a single die without your chip turning into a porcupine. You probably wouldn't have 48 memory controllers on a 48-core chip, but you might have six, with every 8 cores sharing a level-3 cache and a memory controller.
    
    Parent Share
    twitter facebook
    - - Re:Advantages over just adding more FPUs? (Score:5, Informative)
        
        by afidel ( 530433 ) writes: on Wednesday December 02, 2009 @09:28PM (#30306752)
        
        The reason the i7 gains nothing going from double to triple channel memory is that the memory controller is power limited and so can only run at reduced clocking on triple channel configurations 800Mhz down from 1333Mhz. Of course for most workloads having 50% more data in RAM instead of glacially slow storage is a win =)
        
        Parent Share
        twitter facebook
  - Re:Advantages over just adding more FPUs? (Score:5, Informative)
    
    by maraist ( 68387 ) * writes: <michael.maraistN ... gmail.n0spam.com> on Wednesday December 02, 2009 @08:21PM (#30306254) Homepage
    
    What is worse is that theyve done away with cache coherence. So I dont think you can take a 48 thread mysql / java process and just scale it. You COULD use forked processes that don't share much. (ie postgres/apache/php).
    
    Parent Share
    twitter facebook
    - - Re:Advantages over just adding more FPUs? (Score:4, Informative)
        
        by Bengie ( 1121981 ) writes: on Thursday December 03, 2009 @12:03AM (#30307628)
        
        Cache coherency should be handled by the programmer, not by the hardware. Cache coherency protocols consume more bandwidth the more cores you get. The more cores you get, the more important that bandwidth becomes. At some point Cache coherency will become a bottleneck. We've been holding quite well to doubling transistor count every 18 months. If we suddenly go from strong single cores to somewhat weaker multi cores, not only will they pack more cores in for the same transistor count, but more transistors.
        Imagine, our 4 core cpus will be 8 core in ~18months, then 16 ~18 more month. Intel has hyper-threading and AMD has a similar thing, so now it's like 32 cores. So, in ~ 3 years, at our current rate, we could have 32 logical CPUs reporting for low-mid sub $1.5k computers
        
        Parent Share
        twitter facebook
- Re: (Score:2)
  
  by V!NCENT ( 1105021 ) writes:
  
  Yes. YES! Raytracing! And emulating a D3Dn card in software (Google: pixomatic) and run the latest game with acceptable framerates.
- Re:Advantages over just adding more FPUs? (Score:5, Insightful)
  
  by Yaztromo ( 655250 ) writes: on Wednesday December 02, 2009 @04:59PM (#30303528) Homepage Journal
  
  Can someone elaborate on why you'd want 48 full processors, rather than a processor with two (dual) or four (quad) "cores" (I'm presuming core in this case == FPU in the article).
  Bad assumption. In this case, we're talking about (what you would consider) a 48 core CPU. Previous designs would have apparently contained only a small number of full processing cores, and a large number of parallel units suitable only for floating point calculations (which can be great for various types of scientific calculations and simulations). This new design contains 48 discrete IA x86 cores.
  Seems like the type of processor Grand Central Dispatch [wikipedia.org] was designed for.
  Yaz.
  
  Parent Share
  twitter facebook
  - - Re: (Score:3, Interesting)
      
      by maraist ( 68387 ) * writes:
      
      Im not sure "separate computer" is accurate. you get copy on write performance benefits. So fork based processes would work like gang busters. Multi threading COULD work if they never modified common data pages. I'll just assume they have some kind of support for high performance mutexes.
  - - Re: (Score:3, Insightful)
      
      by smallfries ( 601545 ) writes:
      
      What's so bad about it?
      The worst thing about his assumption is that it is wrong. But that is sufficient to make it bad.
      Contemporary processors have many functional units, but they are only segregated into a small number of cores to minimize issues with inter-core communication, communication with memory, et cetera
      This is simplistic and wrong. It is true that fewer cores implies less inter-core communication, but this is not a design criteria for putting fewer cores in a system. While it is true that having
- Re: (Score:2)
  
  by RyuuzakiTetsuya ( 195424 ) writes:
  
  webserver on a high traffic site. Either serving up lots of db connections or a lot of http connections, either way, I can imagine this having specific uses.
- Re:Advantages over just adding more FPUs? (Score:5, Interesting)
  
  by vertinox ( 846076 ) writes: on Wednesday December 02, 2009 @05:34PM (#30304148)
  
  Can someone elaborate on why you'd want 48 full processors, rather than a processor with two (dual) or four (quad) "cores" (I'm presuming core in this case == FPU in the article). Supposedly Win7's SMP support becomes much more effective at the 12-16 core thresehold.
  The first thought comes to mind if video processing and CGI animations because those applications are embarrassingly parallel [wikipedia.org].
  And those companies usually have the money to spend on top of the line hardware.
  Eventually this will trickle down to consumer level as always and people at home can now do real time movie quality CGI on their home computers in 10 years.
  
  Parent Share
  twitter facebook
  - Re: (Score:3, Interesting)
    
    by Jeremy Erwin ( 2054 ) writes:
    
    Embarrassingly parallel is right. Cache coherency was sacrificed in order to up the number of cores, though I suppose a Beowulf on a chip is still useful for some things.
    - Re: (Score:3, Interesting)
      
      by Bengie ( 1121981 ) writes:
      
      I was recently reading an article about multi core designs and they said they'll have to drop cache coherency at some point soon and redesign locking a bit. Some other architectures don't use cache coherency to help with scaling, but that's not x86.
- NUMA vs SMP (Score:3, Interesting)
  
  by mario_grgic ( 515333 ) writes:
  
  In my experience Windows 7 64 bit is noticeably faster with NUMA configuration (Windows experience index is significantly higher because of improved memory throughput) and majority of application also run up to 10 % faster.
  I don't know if this is because of Nehalem Xeon CPUs having faster access to CPU local memory in NUMA configuration or if windows is also optimized for this?
Yet another cloud? (Score:5, Insightful)

by Mortiss ( 812218 ) writes: on Wednesday December 02, 2009 @04:36PM (#30303062)

Why is everything called cloud these days? Yet another du jour buzzword. Is this really justified here?

Share
twitter facebook
- Re:Yet another cloud? (Score:5, Insightful)
  
  by hibiki_r ( 649814 ) writes: on Wednesday December 02, 2009 @04:38PM (#30303114)
  
  When it comes to marketing cliches, when it rains, it pours.
  
  Parent Share
  twitter facebook
  - Re:Yet another cloud? (Score:5, Funny)
    
    by ArsonSmith ( 13997 ) writes: on Wednesday December 02, 2009 @04:51PM (#30303390) Journal
    
    Why can't it just be cloudy?
    sorry.
    
    Parent Share
    twitter facebook
    - Re:Yet another cloud? (Score:5, Funny)
      
      by RelliK ( 4466 ) writes: on Wednesday December 02, 2009 @05:31PM (#30304108)
      
      I don't have the foggiest idea.
      
      Parent Share
      twitter facebook
      - Re: (Score:3, Funny)
        
        by lewiscr ( 3314 ) writes:
        
        I can't wait to run Drizzle on it.
        
        Re: (Score:3, Funny)
        
        by TheLink ( 130905 ) writes:
        
        Cirrusly?
- Re: (Score:3, Interesting)
  
  by Lord Ender ( 156273 ) writes:
  
  The term "cloud" is over-used, but a 48-core chip is certainly a good match for anyone who uses virtualization, and cloud-style data services are absolutely big users of virtualization.
  Cloud computing is certainly a big deal. I recently explained to my boss that instead of spending weeks going through tickets, bureaucracy, approvals, and procurement to get a server in our own datacenter, we could go to Amazon, type the credit card number, and be up-and-running with a few clicks!
  I don't know if he understood
- Re: (Score:2)
  
  by MobileTatsu-NJG ( 946591 ) writes:
  
  Why is everything called cloud these days? Yet another du jour buzzword. Is this really justified here?
  Given that making effective use of these cores would call for engineering code to work with any number of cores, as opposed to just 2, 4, or 8, then yes it is semi-justified, especially if aimed at the server market. I do say 'semi', though, because I partially agree with you about its silliness.
- Re: (Score:2)
  
  by V!NCENT ( 1105021 ) writes:
  
  http://en.wikipedia.org/wiki/File:Cloud_computing_types.svg [wikipedia.org]
  Now imagine you'd have this 'cloud CPU' as your server at home that runs apps that you could acces with Google Chrome OS... Great family server... Or remote X and play Doom3 at work from your netbook.
  Sounds interesting now? ;)
- Re: (Score:2)
  
  by hazydave ( 96747 ) writes:
  
  They're Intel... they have this buzzword department, and those kiddies have to make a living, too. Remember the Intel Pentium 4 "Netburst" architecture. Nothing whatsoever to do with nets, networking, the internet, etc.... other than the fact Intel Marketroids were trying to convince all the Mundanes (Muggles, to you kiddies) that this CPU would magically make their internet go faster. Yup, that's it.. not the fact you're on a frickin' POTS modem.
- Re: (Score:2)
  
  by zullnero ( 833754 ) writes:
  
  No, it's just that it's a hot keyword, and a whole lot of people can't be bothered to look up what it really means. And knowing Intel pretty well, their guys most likely know full well what it is, and they took the name as a taunt to anyone who would dare consider distributing workload instead of buying more server hardware and doing it the way that benefits Intel's bottom line.
Only 48? (Score:5, Funny)

by Kingrames ( 858416 ) writes: on Wednesday December 02, 2009 @04:38PM (#30303094)

Only 48 cores? I'd ask them to double that, but reasonably, 64 cores should be enough for anybody.

Share
twitter facebook
- Re: (Score:2)
  
  by Locke2005 ( 849178 ) writes:
  
  You do know why Asynchronous transfer mode uses 48 byte packets don't you? The advocates of 32 byte and of 64 byte packets could not reach agreement, so they compromised. Perhaps the Intel designers reached a similar accomadation. (As a software engineer, I too am frequently puzzled when hardware engineers do things that are not powers of 2, e.g. the triple channel memory that Intel's socket 1366 chips currently use, forcing you to by DDR RAM in multiples of 3.)
Obligatory (Score:2)

by cowtamer ( 311087 ) writes:

Imagine a Beowulf Cluster of These !!
- Obligatory "Fixed that for you" (Score:3, Funny)
  
  by powerlord ( 28156 ) writes:
  
  Imagine a Beowulf Cluster on one of These !!
  There, fixed that for you.
Great cost savings (Score:5, Funny)

by joeflies ( 529536 ) writes: on Wednesday December 02, 2009 @04:43PM (#30303216)

because now school administrators only have to install SETI@HOME on 100 48-core computers instead of 5000 standard computers.

Share
twitter facebook
- - Re: (Score:3, Funny)
    
    by HRbnjR ( 12398 ) writes:
    
    This is an Intel chip we are talking about here... you can just round off that result ;)
Synergy! (Score:5, Funny)

by HRbnjR ( 12398 ) writes: <chris@hubick.com> on Wednesday December 02, 2009 @04:49PM (#30303356) Homepage

This new Cloud processor should create synergies with my SOA Portal system and allow me to deploy Enterprise B2B Push based Web 2.0 technologies!

Share
twitter facebook
Is there enugh cpu to chipset bandwith to make use (Score:5, Interesting)

by Joe The Dragon ( 967727 ) writes: on Wednesday December 02, 2009 @04:53PM (#30303440)

Is there enough cpu to chipset bandwidth to make use of all this cpu power?

Share
twitter facebook
- Re: (Score:2)
  
  by V!NCENT ( 1105021 ) writes:
  
  If you need very little data per core but are executing sick calculations, then yes. But probably not anything realistic...
- Re:Is there enugh cpu to chipset bandwith to make (Score:4, Interesting)
  
  by Angst Badger ( 8636 ) writes: on Wednesday December 02, 2009 @06:34PM (#30305106)
  
  Is there enough cpu to chipset bandwidth to make use of all this cpu power?
  That's really going to depend on the intended use. And on whether the intended use involves problems that a) can be efficiently parallelized, and more importantly, b) actually have been efficiently parallelized. But unless each core gets its own memory bus and its own dedicated memory with its own cache, I rather expect that the only things that are going to be parallelized to their maximum potential are wait states. All that said, it will still probably run faster than a two- or four-core CPU for many tasks, but it won't be running 48 times faster. I would not, however, refuse a manufacturer's sample if one was handed to me. ;)
  On the positive side, if this beast actually makes it to market, it might help spur the development of new parallel software.
  
  Parent Share
  twitter facebook
Sun HAS a 64 thread processor: UltraSPARC T2 (Score:4, Informative)

by IYagami ( 136831 ) writes: on Wednesday December 02, 2009 @04:56PM (#30303474)

More info at:
http://www.sun.com/processors/UltraSPARC-T2/specs.xml [sun.com]

Share
twitter facebook
- Re: (Score:2)
  
  by RyuuzakiTetsuya ( 195424 ) writes:
  
  All intel has to do is re implement Hyper Threading in each core.
  48 cores = 96 threads, IIRC.
- Not the same thing (Score:4, Informative)
  
  by Sycraft-fu ( 314770 ) writes: on Wednesday December 02, 2009 @06:04PM (#30304678)
  
  Sun's processors are heavily multi-threaded per core. It is an 8 core CPU where each core can handle 8 threads in hardware. Intel's solution is 48 separate cores, doesn't say how many threads per core.
  The difference? Well lots of threads on one core leads to that core being well used. Ideally, you can have it such that all its execution units are always full, it is working to 100% capacity. However it leads to slower execution per thread, since the threads are sharing a core and competing for resources.
  Something like Sun's solution would be good for servers, if you have a lot of processes and you want to avoid the context switching penalty you get form going back and forth, but no process really uses all that much power. Web servers with lots of scripts and DB access and such would probably benefit from it quite a lot.
  However it wouldn't be so useful for a program that tosses out multiple threads to get more power. Like say you have a 3D rendering engine and it has 4 rendering threads. If all those threads got assigned to one core, well it would run little faster than a single thread running on that core. What you want is each thread on its own core to give you, ideally, a 4x speed increase over a single thread.
  So in general, with Intel's chips you see not a lot of thread per core. 1 and 2 are all they've had so far (P4s and Core i7s are 2 threads per core, Core 2s are 1 thread per core). They also have features such as the ability for a single core to boost its clock speed if the others are not being used much, to get more performance for one thread and still stay in the thermal spec. These are generally desktop or workstation oriented features. You aren't necessarily running many different apps that need power, you are running one or maybe two apps that need power.
  As for this, well I don't know what they are targeting, or how many threads/core it supports.
  
  Parent Share
  twitter facebook
Sounds like Sinclair's waffer scale intergration. (Score:2)

by LWATCDR ( 28044 ) writes:

It does sound a lot like it. Truth is that it is probably a lot more like the old Pentium D packages but still kind of interesting.
So how many Coretex A8 cores could you fit on one of these?
- - Re: (Score:3, Informative)
    
    by sznupi ( 719324 ) writes:
    
    The 48-core chip that Intel demonstrated is 45nm!
    Also, Cortex-A9: "For 2000 DMIPS of performance when designed in a TSMC 65 nanometer (nm) generic process the core logic costs less than 1.5 mm^2 of silicon." ( http://www.arm.com/products/CPUs/ARMCortex-A9SingleCore.html [arm.com] ) So it seems "up to 3 mm^2" in your quote really means "up to" (and for a much older core of course, when it was just launching 4 years ago)
    And Cortex-A9 "consumes less than 250mW per core"...
- Re:Code Name is Offensive (Score:5, Insightful)
  
  by eln ( 21727 ) writes: on Wednesday December 02, 2009 @04:32PM (#30302978)
  
  It was called Bangalore to remind you where to call if you need any support for it.
  
  Parent Share
  twitter facebook
  - Re: (Score:2)
    
    by Nadaka ( 224565 ) writes:
    
    I thought a bangalore was a man portable explosive, telescoping lance used to take out pill boxes in WW2?
    - Re:Code Name is Offensive (Score:5, Funny)
      
      by powerlord ( 28156 ) writes: on Wednesday December 02, 2009 @05:05PM (#30303638) Journal
      
      I thought a bangalore was a man portable explosive, telescoping lance used to take out pill boxes in WW2?
      That was an offshoot technology. They've finally got all the bugs ironed out and the CPU is much less prone to "uncontrolled exothermic reactions" then it use to be.
      
      Parent Share
      twitter facebook
  - - Re: (Score:2, Insightful)
      
      by Taco Cowboy ( 5327 ) writes:
      
      Oh please don't go over your head in this.
      India's tech field has improved, but not to the point of design such a chip yet !
      Without the West, India is still a big nothing !
      - Re: (Score:3, Insightful)
        
        by farlukar ( 225243 ) writes:
        
        Without the West, India is still a big nothing !
        And vice versa :p
      - Re: (Score:3, Informative)
        
        by TheRaven64 ( 641858 ) writes:
        
        Not sure why you think that. Intel's owes its current existence to their Israeli team, which was the only group producing working designs with a usable power envelope while the American design team was following the US automobile industry in concept. Most Intel products are codenamed based on a location near the design team. Several recent Intel chipsets have been designed in east Asia. Plugging 48 x86 cores onto a die, when you have access to Intel's designs, is not a particularly hard task compared to
        
        Re: (Score:3, Informative)
        
        by mrboyd ( 1211932 ) writes:
        
        Really? then wtf is that job offer on intel website for a CPU Architect in bangalore for?
        In this position, you will be responsible for architecting advanced client platforms for 2015 and beyond. We are now in the early research and pathfinding for the 2015 generation of CPU products. Our team engages in early architecture analysis, microarchitecture research and/or development, performance and/or power modeling and analysis, including detailed architecture validation versus RTL
        Here's what they do in Bangalore: http://www.intel.com/jobs/india/iidc/index.htm [intel.com]. Seems like some people in India have enough skills to design a CPU.
    - Re: (Score:2, Funny)
      
      by Sigilium ( 1611915 ) writes:
      
      It's like this: It's hot and loud and there's so many cores.
      
      As in The Big Bang Theory season 3 episode 4: "I don't want to go to India. It's hot and loud and there's so many people. You have no idea, they're everywhere."
- Re:Code Name is Offensive (Score:4, Funny)
  
  by MobileTatsu-NJG ( 946591 ) writes: on Wednesday December 02, 2009 @04:34PM (#30303040)
  
  Intel an American company, with the American economy in the shape it's in, I am offended at the codename Bangalore.
  As the last remaining operational Soong type android, I am offended by the name Bang-A-Lore.
  
  Parent Share
  twitter facebook
  - Re: (Score:2)
    
    by Tetsujin ( 103070 ) writes:
    
    Intel an American company, with the American economy in the shape it's in, I am offended at the codename Bangalore.
    As the last remaining operational Soong type android, I am offended by the name Bang-A-Lore.
    So you're B4, then?
    Well, I guess it was several years ago that you were known as B4... What's the name you're using these days... "Pryor", isn't it?
- Re: (Score:3, Funny)
  
  by Monkeedude1212 ( 1560403 ) writes:
  
  Does the fact that none* of the Apple Operating system names are of animals not native to America?
  *After 5.1, which is "Kodiak" - which can be found in Alaska.
  5.2 Mac OS X v10.0 "Cheetah"
  5.3 Mac OS X v10.1 "Puma"
  5.4 Mac OS X v10.2 "Jaguar"
  5.5 Mac OS X v10.3 "Panther"
  5.6 Mac OS X v10.4 "Tiger"
  5.7 Mac OS X v10.5 "Leopard"
  5.8 Mac OS X v10.6 "Snow Leopard
  - Re: (Score:2, Informative)
    
    by EdipisReks ( 770738 ) writes:
    
    Does the fact that none* of the Apple Operating system names are of animals not native to America? *After 5.1, which is "Kodiak" - which can be found in Alaska. 5.2 Mac OS X v10.0 "Cheetah" 5.3 Mac OS X v10.1 "Puma" 5.4 Mac OS X v10.2 "Jaguar" 5.5 Mac OS X v10.3 "Panther" 5.6 Mac OS X v10.4 "Tiger" 5.7 Mac OS X v10.5 "Leopard" 5.8 Mac OS X v10.6 "Snow Leopard
    there are pumas in the American west and in Florida, they are just called Mountain Lions or Cougars or Floida Panthers. same thing.
    - Re: (Score:2)
      
      by Monkeedude1212 ( 1560403 ) writes:
      
      I refuse to call a Mountain Lion a Puma.
      - Re: (Score:3, Funny)
        
        by snspdaarf ( 1314399 ) writes:
        
        Are you making up imaginary animals again?
      - Re: (Score:3, Funny)
        
        by clbyjack81 ( 597903 ) writes:
        
        I refuse to call a Mountain Lion a Puma.
        Just don't call a Warthog a Puma. Sarge doesn't like that.
    - Re: (Score:2, Redundant)
      
      by Grendel70 ( 1000350 ) writes:
      
      And last I checked, Alaska IS part of America.
  - - Re: (Score:3, Insightful)
      
      by powerlord ( 28156 ) writes:
      
      One could argue a Puma is essentially the same thing as a Cougar/Mountain Lion... Not much in the way of big cats native to the United States.
      True. We don't have many BIG cats in the U.S. ... just a lot of FAT cats (greater concentrations can be found in the vicinity of State Capitols and Washington D.C.).
- Re: (Score:2)
  
  by jcnnghm ( 538570 ) writes:
  
  How awful of them to use the name of San Fransisco's sister city, the "Silicon Valley" of India, as a product codename. Were you equally offended when Ibex Peak, Tylersburg, Alviso, Calistoga, Lakeport, Broadwater, Eaglelake, Crestline and Cantiga were used as codenames?
  You don't need to get your panties in a twist over this. Although it is worth mentioning that it makes you look like a racist when you assume that an innocuous naming decision is some form of racial bigotry or social commentary.
- - Re:Code Name is Offensive (Score:5, Funny)
    
    by Threni ( 635302 ) writes: on Wednesday December 02, 2009 @04:36PM (#30303052)
    
    > This post is copyrighted by Robert Nelson for the private use of his audience. Any other use of this post or of any pict
    Your sigfile is offensive. What have ye got against the Scots?
    
    Parent Share
    twitter facebook
    - Re: (Score:2, Funny)
      
      by sexconker ( 1179573 ) writes:
      
      What have ye got against the Scots?
      Damn Scots!
      They ruined Scotland!
      - Re: (Score:2)
        
        by zach_the_lizard ( 1317619 ) writes:
        
        Damn Scots! They ruined Scotland!
        Scotland? What is that? All I know of is Pictland.
- - Mummy? (Score:3, Funny)
    
    by Tetsujin ( 103070 ) writes:
    Insightful WTF? If you get offended that easily, you'd better:
    
    Not come out from your basement, lest you see something being worth upset over
    Go running to mummy so she can make it better
    Mummy?
    Are you my mummy?
    Mummm-myyy...
- Re: (Score:2)
  
  by eln ( 21727 ) writes:
  
  What's the big deal? You're going to need a single-chip cloud computer if you want to operate in the Semantic Web.
- Re:Windows 12 (Score:5, Interesting)
  
  by mikael ( 484 ) writes: on Wednesday December 02, 2009 @04:37PM (#30303072)
  
  Microsoft once had a podcast where they were talking about multi-core CPU kernels. Their belief was that once you had 50+ cores, you would be able to have a mutex for every single COM object element, simply because you could.
  
  Parent Share
  twitter facebook
  - Re: (Score:2)
    
    by poetmatt ( 793785 ) writes:
    
    my idea when you had enough was that each process thread can be on it's own processor, is the idea of a mutex for every com object similar? I don't really get 100% what a mutex is.
    - Re:Windows 12 (Score:5, Informative)
      
      by Rockoon ( 1252108 ) writes: on Wednesday December 02, 2009 @05:37PM (#30304204)
      
      A mutex (MUTually EXclusive) is a software methodology in which one thread or process can (usually temporarily) lock a resource (such as a memory location) so that another thread or process may not access it.
      
      It is most often required because resources are normally not 'atomic.' For instance, a string in memory is made up of many machine words and a CPU cannot read or write multiple machine word values in one operation. The danger is that while one CPU is writing to such a non-atomic collection of values, another might be trying to read from (or write to) it.. creating a situation where that second process reads part of the old data and part of the new data (essentially garbage data.)
      
      So the idea of a MUTEX is born, in which an atomic value is leveraged to allow a thread to reserve such resources, signaling others (if they respect the MUTEX as well) to wait their turn.
      
      Parent Share
      twitter facebook
    - - Re:Windows 12 (Score:4, Informative)
        
        by JWSmythe ( 446288 ) writes: <jwsmythe@NOsPAm.jwsmythe.com> on Wednesday December 02, 2009 @09:10PM (#30306622) Homepage Journal
        
        It doesn't matter much. The first sibling to grab key 1a is usually running for the car. Even if the other sibling grabbed key 1b, they'll be looking at an empty parking spot, complaining to mom. :)
        
        Parent Share
        twitter facebook
- Re: (Score:2)
  
  by somersault ( 912633 ) writes:
  
  Just imagine.. a Beowulf cluster^2!
- Re:Codenames (Score:4, Funny)
  
  by revlayle ( 964221 ) writes: on Wednesday December 02, 2009 @04:45PM (#30303284)
  
  Could you imagine a Beowulf cluster of Beowul.... *head explodes*
  
  Parent Share
  twitter facebook
  - Re: (Score:3, Funny)
    
    by Acapulco ( 1289274 ) writes:
    
    *head asplodes*
    
    There, fixed that for you.
- Re: (Score:2)
  
  by Knara ( 9377 ) writes:
  
  Intel has always had less than catchy code names, IMHO.
- Re:Codenames (Score:5, Informative)
  
  by azrael29a ( 1349629 ) writes: on Wednesday December 02, 2009 @05:20PM (#30303920)
  
  Why can companies not come up with decent code names. For instance, this would be the perfect case for it being codenamed "Beowulf".
  They're using geographical names (cities, places, lakes, rivers) to avoid having to register the codename as a trademark. Geographical names can't be trademarked so no one will use your codename for his trademark.
  
  Parent Share
  twitter facebook
- - - Re: (Score:3, Funny)
      
      by Chees0rz ( 1194661 ) writes:
      
      > They called it Bangalore because they are going to farm out your processes.
      To Maine..??
      (Oh, sorry, that's Bangor. My bad!)
      Uh, no. That's Bang-ah, Maine. Bangor is a Myth.
- Re: (Score:2, Informative)
  
  by Avtuunaaja ( 1249076 ) writes:
  
  Linux can handle 4096 cores without trouble in the main kernel tree, with support for much larger images already existing in trees forked by people who actually need such things.
  - Re:So ... (Score:4, Informative)
    
    by TheRaven64 ( 641858 ) writes: on Wednesday December 02, 2009 @06:40PM (#30305192) Journal
    
    Ugh, I hate seeing this repeated so often. The 4096-processor SGI machines that Linux works on run 'with the main tree' are clusters. They run a separate instance of Linux on each node and have some very complex hardware managing cache coherency between them. Architecturally, they are nothing like a standard SMP system.
    
    Parent Share
    twitter facebook
- Re: (Score:3, Funny)
  
  by Nadaka ( 224565 ) writes:
  
  All of them except windows.
- Re: (Score:2)
  
  by V!NCENT ( 1105021 ) writes:
  
  What? Vector units inside?
  - - Re: (Score:3)
      
      by V!NCENT ( 1105021 ) writes:
      
      Leaked? Dude I got the freaking instruction set in my mailbox. Want the public PDF? It's an ordinary x86-64 CPU that is capable of vector processing stuff...
- Re: (Score:2)
  
  by V!NCENT ( 1105021 ) writes:
  
  At about 20-30 fps, according to Intel, with Pixomatic 3 :')
- Re: (Score:3, Insightful)
  
  by afidel ( 530433 ) writes:
  
  x86-64 is actually a pretty good architecture with a decent tradeoff between registers and instruction compactness. Since the instructions are compact you can fit more of them per RAM clock cycle which is an advantage vs a pure RISC architecture which is why POWER has come much more towards the CISC side of thing then x86 has gone towards RISC (externally, internally it's pretty much a RISC machine).

There may be more comments in this discussion. Without JavaScript enabled, you might want to turn on Classic Discussion System in your preferences instead.

Meh. I'm holding out for a kilocore. (Score:2)

48 is sufficient for most Ph.D. dissertations. (Score:5, Interesting)

Rather than Larrabee, Intel should've focus on CPU (Score:2)

Re:48 is sufficient for most Ph.D. dissertations. (Score:4, Informative)

Re:48 is sufficient for most Ph.D. dissertations. (Score:5, Funny)

Re: (Score:3, Interesting)

Re:Meh. I'm holding out for a kilocore. (Score:5, Funny)

Re: (Score:2, Funny)

Advantages over just adding more FPUs? (Score:2)

Re: (Score:2, Funny)

Re: (Score:2)

Idle benchmarks (Score:5, Insightful)

Re: (Score:2)

Re: (Score:3)

Re: (Score:3, Insightful)

Re: (Score:2)

Re: (Score:2)

Re:Advantages over just adding more FPUs? (Score:4, Informative)

Re: (Score:3, Informative)

Re:Advantages over just adding more FPUs? (Score:4, Interesting)

Re:Advantages over just adding more FPUs? (Score:4, Interesting)

Re:Advantages over just adding more FPUs? (Score:4, Informative)

Re:Advantages over just adding more FPUs? (Score:5, Informative)

Re:Advantages over just adding more FPUs? (Score:5, Informative)

Re:Advantages over just adding more FPUs? (Score:4, Informative)

Re: (Score:2)

Re:Advantages over just adding more FPUs? (Score:5, Insightful)

Re: (Score:3, Interesting)

Re: (Score:3, Insightful)

Re: (Score:2)

Re:Advantages over just adding more FPUs? (Score:5, Interesting)

Re: (Score:3, Interesting)

Re: (Score:3, Interesting)

NUMA vs SMP (Score:3, Interesting)

Yet another cloud? (Score:5, Insightful)

Re:Yet another cloud? (Score:5, Insightful)

Re:Yet another cloud? (Score:5, Funny)

Re:Yet another cloud? (Score:5, Funny)

Re: (Score:3, Funny)

Re: (Score:3, Funny)

Re: (Score:3, Interesting)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Only 48? (Score:5, Funny)

Re: (Score:2)

Obligatory (Score:2)

Obligatory "Fixed that for you" (Score:3, Funny)

Great cost savings (Score:5, Funny)

Re: (Score:3, Funny)

Synergy! (Score:5, Funny)

Is there enugh cpu to chipset bandwith to make use (Score:5, Interesting)

Re: (Score:2)

Re:Is there enugh cpu to chipset bandwith to make (Score:4, Interesting)

Sun HAS a 64 thread processor: UltraSPARC T2 (Score:4, Informative)

Re: (Score:2)

Not the same thing (Score:4, Informative)

Sounds like Sinclair's waffer scale intergration. (Score:2)

Re: (Score:3, Informative)

Re:Code Name is Offensive (Score:5, Insightful)

Re: (Score:2)

Re:Code Name is Offensive (Score:5, Funny)

Re: (Score:2, Insightful)

Re: (Score:3, Insightful)

Re: (Score:3, Informative)

Re: (Score:3, Informative)

Re: (Score:2, Funny)

Re:Code Name is Offensive (Score:4, Funny)

Re: (Score:2)

Re: (Score:3, Funny)

Re: (Score:2, Informative)

Re: (Score:2)

Re: (Score:3, Funny)

Re: (Score:3, Funny)

Re: (Score:2, Redundant)

Re: (Score:3, Insightful)

Re: (Score:2)

Re:Code Name is Offensive (Score:5, Funny)

Re: (Score:2, Funny)