SRI/Cambridge Opens CHERI Secure Processor Design 59

Posted by Unknown Lamer on Wednesday July 16, 2014 @07:45AM from the dreaming-of-hurd/coyotos dept.

An anonymous reader writes with some exciting news from the world of processor design: Robert Watson at Cambridge (author of Capsicum) has written a blog post on SRI/Cambridge's recent open sourcing of the hardware and software for the DARPA-sponsored CHERI processor — including laser cutting directions for an FPGA-based tablet! Described in their paper The CHERI Capability Model: Reducing Risk in an age of RISC, CHERI is a 64-bit RISC processor able to boot and run FreeBSD and open-source applications, but has a Clang/LLVM-managed fine-grained, capability-based memory protection model within each UNIX process. Drawing on ideas from Capsicum, they also support fine-grained in-process sandboxing using capabilities. The conference talk was presented on a CHERI tablet running CheriBSD, with a video of the talk by student Jonathan Woodruff (slides).

Although based on the 64-bit MIPS ISA, the authors suggest that it would also be usable with other RISC ISAs such as RISC-V and ARMv8. The paper compares the approach with several other research approaches and Intel's forthcoming Memory Protection eXtensions (MPX) with favorable performance and stronger protection properties. The processor "source code" (written in Bluespec Verilog) is available under a variant of the Apache license (modified for application to hardware). Update: 07/16 20:53 GMT by U L : If you have any questions about the project, regular Slashdot contributor TheRaven64 is one of the authors of the paper, and is answering questions.

SRI/Cambridge Opens CHERI Secure Processor Design

This discussion has been archived. No new comments can be posted.

Load All Comments

Search 59 Comments Log In/Create an Account

Comments Filter:

To make things more secure, make them simpler (Score:1)

by Anonymous Coward writes:

Complexity is bad for security.
- Re: (Score:1)
  
  by Tmackiller ( 959837 ) writes:
  
  Agreed, the more complex something becomes, the more likely there is to be a blindspot or loophole in the design.
- Re: (Score:3, Informative)
  
  by Anonymous Coward writes:
  
  Capabilities are simpler than "simple" paged memory management. Every pointer carries permissions, enforced in hardware, instead of requiring several layers of software to perform their own ownership management.
  - Re: (Score:3)
    
    by TheRaven64 ( 641858 ) writes:
    
    Note that, in our implementation, capabilities are orthogonal to paged memory management. CHERI's capabilities are within a virtual address space. The MMU is a really useful abstraction for an OS wanting to isolate processes from each other and implement things like swapping, lazy allocation, and so on. It's a pretty poor abstraction for implementing memory safety and sandboxing within an application. We regard the two as complementary: CHERI provides both a traditional MIPS TLB and capabilities, and th
- Re: (Score:1)
  
  by Anonymous Coward writes:
  
  Complexity is bad for security.
  To paraphrase Einstein, "security should be as simple as possible, but no simpler."
- Re: (Score:1)
  
  by Anonymous Coward writes:
  
  Simplicity is an illusion we place on reality. In reality, simplicity is merely trading complexity for a predefined set of rules that only come about through even more complex ideas.
  This is exactly what they are doing. They are trading the complexity of 5 million different programmers trying to be secure for hardware designed by a few really smart people.
  In this case, things become simpler for everyone, including the hackers, as they only have to hack one model. However, since the compilers are in on it, th
- Re:To make things more secure, make them simpler (Score:4, Insightful)
  
  by TheRaven64 ( 641858 ) writes: on Wednesday July 16, 2014 @10:04AM (#47466749) Journal
  
  Yes, we agree. That's why we have a fairly simple set of instruction set extensions requiring no complex logic (TCAMs, associative structures, complex data structure walkers, and so on) that we have formally verified provide the required security properties.
  
Laser cutting directions! (Score:5, Insightful)

by Gollum ( 35049 ) writes: on Wednesday July 16, 2014 @08:31AM (#47465991)

Breathless excitement!
The achievements in the rest of this paper far outweigh the existence of a tablet built on this foundation.
They've created their own 64-bit processor! They've implemented a compiler for it! They've ported FreeBSD to it! That's some seriously impressive stuff!
But the leader has to be the laser cut tablet assembly. :-(

- - - Re: (Score:1)
      
      by Impy the Impiuos Imp ( 442658 ) writes:
      
      Hogging out a hole : to reverse-3D print
    - Re: (Score:1)
      
      by Garridan ( 597129 ) writes:
      
      3d printing air into solid aluminum? I'm sold! Can you do transparent aluminum too?
- Re:Laser cutting directions! (Score:4, Informative)
  
  by TheRaven64 ( 641858 ) writes: on Wednesday July 16, 2014 @10:02AM (#47466733) Journal
  
  To be fair to the submitter, Theo did some amazing work with the laser cutter to produce the tablets. Mine doesn't have the SRI and Cambridge logos etched into the front (or a battery, actually - it's one of the first models) but the it's still very nice. Not really competitive with the iPad, but definitely something we can plausibly use as a prototype.
  
- - Re: (Score:3)
    
    by TheRaven64 ( 641858 ) writes:
    
    Impressive is also the dare to implement a capability based model with some backing that could lead to consumer products eventually. How many decades this has been put to side now in the general market?
    
    Most of the early capability systems (with the exception of the M-Machine from MIT) were extreme-CISC chips. They implemented complex capability walking so you could have multiple levels of indirection before you got to the . We aim to provide enough hardware support that you can implement these things in software, in a way that can't be bypassed. We're also aiming for a single hardware abstraction that scales from per-object bounds checking up to large sandboxes (think NaCl, but cheap and hardware-enfor
    - - Re: (Score:2)
        
        by TheRaven64 ( 641858 ) writes:
        
        A lot of routers actually run FreeBSD/MIPS. The FreeBSD MIPS port was jointly done by Juniper and Cisco. The CHERI project is jointly funded by two DARPA programs, CRASH and MRC. The MRC-focussed part is looking as software-defined networking and switch fabrics. Part of this includes putting CHERI on the NetFPGA 10G boards (which have 4 10GigE ports, so can do a reasonable amount of packet pushing). For a switch, you typically want some hardware acceleration of lookup tables for routing and so on, but
    - Re: (Score:2)
      
      by Guy Harris ( 3803 ) writes:
      
      Most of the early capability systems (with the exception of the M-Machine from MIT) were extreme-CISC chips.
      Or CISC and extreme-CISC multi-chip processors, such as the Cambridge CAP computer [wikipedia.org], the Plessey System 250 [wikipedia.org], the Flex machine [wikipedia.org], and the IBM System/38 [wikipedia.org].
4chan comment (Score:2, Offtopic)

by bmo ( 77928 ) writes:

>CHERI processor
Really? Cheri?
http://cheri.com/ [cheri.com] You have been promoted!
NOT SAFE FOR WORK.
--
BMO
- Re: (Score:2)
  
  by bmo ( 77928 ) writes:
  
  I know it's poor form to bitch about moderation, but "offtopic?" Really?
  It never crossed anyone's mind to type the name into a web browser? You know, just to make sure that it didn't link to something like lemonparty?
  --
  BMO
Any questions? (Score:5, Informative)

by TheRaven64 ( 641858 ) writes: on Wednesday July 16, 2014 @12:21PM (#47468085) Journal

I'm one of the authors of the paper (I did the LLVM support, some of the ISA design and the hardware implementation, and a token amount of OS work), so I'm happy to answer questions about it.

- source code of the processor? But software patents (Score:2)
  
  by raymorris ( 2726007 ) writes:
  
  A processor is a piece of hardware, correct?
  TFS says the Verilog source code of the processor is available. "Source code" sounds like software. I bet you can even run that source code on a general purpose CPU, can't you?
  So is the processor hardware, software, or are the people who scream about "software patents" utterly clueless about computer engineering?
  - Re:source code of the processor? But software pate (Score:5, Informative)
    
    by TheRaven64 ( 641858 ) writes: on Wednesday July 16, 2014 @01:42PM (#47468895) Journal
    
    The processor is implemented as a softcore in BlueSpec SystemVerilog, which is a high-level hardware description language (HDL). The source code can be compiled to C for simulation, so you can run it on a general-purpose CPU. We get around 30K instructions per second doing this. It can also be compiled to verilog and then synthesized into gate layouts for an FPGA. We can run at 100MHz (probably 125MHz, but we don't push it) in an Altera Stratix IV FPGA, with around 1 instruction per clock (a bit less), so around 3000 times faster than simulation.
    In theory, you could also take the verilog and generate a custom chip. In practice, you wouldn't want to without some tweaking. For example, our TLB design is based on the assumption that TCAMs are very expensive but RAM is very cheap. This is true in an FPGA, but is completely untrue if you were fabbing a custom chip.
    Although we use the term 'source code', it's perhaps better to think of it as the code for a program that produces a design of a processor, rather than the source code for a processor.
    In terms of software patents, there's some annoying precedent that a software implementation of a architectural patent can be infringing. The MIPS architecture that we implement has LWR and LWL instructions that accelerate unaligned loads and stores. These were patented (the patents have now expired) and the owners of the patent won against someone who created a MIPS implementation where these two instructions caused illegal instruction traps and were emulated in software. The software implementations were found to infringe the hardware patent.
    
    - Re: (Score:1)
      
      by goarilla ( 908067 ) writes:
      
      In terms of software patents, there's some annoying precedent that a software implementation of a architectural patent can be infringing. The MIPS architecture that we implement has LWR and LWL instructions that accelerate unaligned loads and stores. These were patented (the patents have now expired) and the owners of the patent won against someone who created a MIPS implementation where these two instructions caused illegal instruction traps and were emulated in software. The software implementations were
  - Re: (Score:2)
    
    by angel'o'sphere ( 80593 ) writes:
    
    The 'source code' we are talking about is not running on a processor.
    It is written in a HDL, a 'hardware description language'. The compiler is generating a chip layout which is then burned into an FPGA, a 'field programmable gate array' (or was it 'free'?)
    So they use a 'programming language' to describe the logic, the circuits, of the processor.
    The final result ofc is hardware that is capable of running FreeBSD and user land applications.
    Hope that helped ;)
- - Re:How does it compare to Unisys MCP ? (Score:5, Informative)
    
    by TheRaven64 ( 641858 ) writes: on Wednesday July 16, 2014 @02:21PM (#47469335) Journal
    
    There's a little bit of comparison to the Burroughs architecture that was one of the forerunners of the Unisys architecture in the paper. I'm not overfly familiar with the later Unisys MCP, so this may be wrong:
    Our approach was explicitly intended to work with language that are not memory safe (i.e. C and friends). If you have a memory-safe language, then there is some cost associated with enforcing the memory safety in software, which CHERI can assist with, but you don't see much of a win.
    As soon as you start mixing languages, you get the worst of all worlds. A typical Android application is written in C and Java (and some other things) and so gets all of the memory safety of C, plus all of the performance of Java. A single pointer error in the C code can corrupt the Java heap. One of my students last year implemented a modified JNI on CHERI that allows this sort of mixing but without the disadvantages. Java references are passed to C code as sealed capabilities, so the C code can pass them back to the JVM, but can't dereference them. The C code itself runs in a sandbox (C0 - the capability register that identifies the region of memory that normal MIPS loads and stores can use - is restricted to a smallish subset of the total [virtual] address space) so any pointer errors inside the C are constrained to only touch the C heap (of that particular sandbox - there can be a lot). He demonstrated running buggy and known-exploitable code in the JNI, without it compromising the Java code, with a very small overhead for calling in and out of the sandbox. Most interestingly, he was also able to enforce the Java security model for native code: the sandboxed code couldn't make system calls directly and had to call back into the currently active Java security monitor to determine whether it was allowed to.
    Another of my students implemented an accurate (copying, generational) garbage collector for capabilities. This can be used with C code, as long as the allocator is outside of the normal C0-defined address space (otherwise pointers can leak as integers into other variables and be reconstructed). In particular, you can use this to track references to high-level language objects as they flow around C code and either invalidate them or treat them as GC roots, so you don't get dangling pointer errors. Or you can just use his allocator in C and have fully GC'd C code...
    My understanding of MCP is that it uses high-level languages in the kernel, but does nothing to protect (for example) typesafe ALGOL code from buggy C code within the same userspace process.
    
    - - Re:How does it compare to Unisys MCP ? (Score:5, Informative)
        
        by TheRaven64 ( 641858 ) writes: on Wednesday July 16, 2014 @02:45PM (#47469591) Journal
        
        Currently, Ohloh.net is aware of around 11 billion lines of C/C++ code. I'm including C++ because, while it's possible to write memory-safe C++, all of the evil pointer tricks that are permitted in C are also allowed in C++. Rewriting 11 billion lines of code, even in a language that is 100 times more expressive than C, would be a daunting task. Note that Ohloh only tracks open source code in public repositories: there's a lot more that's in private codebases.
        For a fully memory-safe language, you need things like bounds checks on array accesses. Some research from IBM about 10 years ago showed that, with JVMs of the time, the cost of bounds checks accounted for the vast majority of the difference in performance between C++ and Java implementations of the same algorithms. They did some nice work on abstractions that allowed most bounds checks to be statically elided, but their approach never made it into the language. The overhead is somewhat hidden by superscalar architectures (the bounds check is done in parallel and the branch is predicted not-taken), but not on low-power in-order chips like the Cortex A7 that you'll find in a lot of modern mobile phones.
        
        
        Re:How does it compare to Unisys MCP ? (Score:5, Informative)
        
        by TheRaven64 ( 641858 ) writes: on Wednesday July 16, 2014 @03:16PM (#47469899) Journal
        
        I appreciate your comments, but would like to add that it appears to me that correctness+security has by now become more important for applied computer science than processing throughput.
        Yes, that's one of the motivations for our work. Now seems to be the time when it's possible to persuade CPU vendors that dedicating a little bit more hardware to security is valuable. It isn't quite the dichotomy that you describe though, because you can either implement security features in software or hardware and doing so in hardware can mean that you see a performance improvement for the sorts of use cases that people increasingly care about.
        
    - Re: (Score:2)
      
      by weav ( 158099 ) writes:
      
      Coming late to this party, I can only share this one datapoint: If I recollect right, it was the hardware on the Burroughs Large Systems that was memory-safe. Arrays were done as Segment Descriptors, and if you indexed outside the segment it raised an exception. Byte-addressed (for 6-bit, 8-bit, etc bytes) were done as "String Descriptors" that referred to the Segment Descriptor.
      The only other hardware I know of that did this lately was the UCSB P-system; there may be others since then.
      - Re: (Score:2)
        
        by TheRaven64 ( 641858 ) writes:
        
        Yup. Compiling for the Burroughs architecture was easier than many segment-based systems, because they allowed segment descriptors to be placed in main memory, with the CPU responsible for tracking the value type by updating a tag. We adapt this slightly so that we only require one tag bit per 256 bits of main memory (the paper describes the implementation of this in some detail, but I'm happy to answer questions) to be able to safely store capabilities in main memory. Our design also allows normal C dat
- - Re:Any questions? (Score:5, Informative)
    
    by TheRaven64 ( 641858 ) writes: on Wednesday July 16, 2014 @02:25PM (#47469365) Journal
    
    We hope so too. One of the things on my short-term todo list is define a set of requirements for our extensions to C so that we can have source compatibility across different CPUs implementing similar ISA extensions. Of course, this is (optimistically) assuming that other CPUs are going to provide similar extensions...
    
- Look this gifthorse in the mouth (Score:1)
  
  by Garridan ( 597129 ) writes:
  
  When old enemies arrive at your door bearing a gift horse large enough to fit a fireteam, don't stop at it's mouth, do a full cavity search.
  
  Are there independent third-party audits underway? If DARPA likes it, I'm suspicious.
  - Re: (Score:2)
    
    by TheRaven64 ( 641858 ) writes:
    
    I hope no one misses the irony of using a DARPA-funded network protocol to post about how evil DARPA is...
    - Re: (Score:1)
      
      by Garridan ( 597129 ) writes:
      
      I didn't say that DARPA was evil. I just said that this merits an independent audit. I take your flippant answer for a "no".
      - Re: (Score:2)
        
        by TheRaven64 ( 641858 ) writes:
        
        The flippant answer is all that your paranoia deserves. The work was undertaken by SRI and The University of Cambridge. The funding was provided by DARPA, but that's the extent of their involvement (other than creating a program with the goal of being able to redesign any aspect of computing with security in mind).
        The code is no more or less meriting an independent audit than any other open source code. Less, actually, because we don't anticipate anyone actually using our open source reference impleme
        
        Re: (Score:2)
        
        by Garridan ( 597129 ) writes:
        
        The flippant answer is all that your paranoia deserves.
        A healthy amount of paranoia is a must in the security industry. Every piece of security hardware or sofware demands third-party evaluation, even that made by people you trust completely. Not all flaws are deliberate. Thank you for your time, and (at last) thoughtful response.
- Re: (Score:2)
  
  by complete loony ( 663508 ) writes:
  
  Has anyone on the project thought about extending the design to include secure key storage? (I'm thinking specifically of something like TRESOR / TreVisor)
  - - Re: (Score:2)
      
      by TheRaven64 ( 641858 ) writes:
      
      This is an interesting design. Have you thought about maybe posting seL4 to it once that's open-sourced in a couple of weeks? /akr
      Yes, we're looking forward to the seL4 release. Somewhat cautiously, as there's no mention of a license (if it's GPL'd, we probably won't put any effort into it). In CHERI, we have store-capability and load-capability permissions on both capabilities and pages. To implement the seL4 model (where capabilities are stored in special pages), we'd only need to use the TLB-grandularity protections, but it would be more interesting to provide fine-grained capability storage, which would reduce a bit of the diff
  - Re: (Score:2)
    
    by TheRaven64 ( 641858 ) writes:
    
    We haven't specifically looked at key storage, but we are looking at potential ways of extending the approach to heterogeneous multicore systems, so that access to main memory could be delegated to accelerators (at varying levels of coupling) via capabilities and capabilities could be used to mediate access to accelerators. In this scheme, it would be possible to provide an encryption coprocessor that would store keys internally and perform encryption and decryption operations on blocks of memory delegated
- Re: (Score:2)
  
  by smallfries ( 601545 ) writes:
  
  Is the code for the LLVM support publically available / do you have a link for it?
  - Re: (Score:3)
    
    by TheRaven64 ( 641858 ) writes:
    
    Yup, it's on GitHub in two repositories, one for LLVM [github.com] and one for clang [github.com]. We've been pushing the fixes for MIPS IV upstream, but there are some changes to the mid-level optimisers to make them aware that not all pointers are integers and some extensions to the platform-independent code generators for the same.
    We've upstreamed all of the FreeBSD changes required for the base processor (BERI) and the development boards, so FreeBSD 10 will run out of the box on CHERI (it just won't expose any of the capabil
Acronyms with multiple meanings (Score:1)

by Damian J Pound ( 3635341 ) writes:

Multi-Pointer X, also called MPX

There may be more comments in this discussion. Without JavaScript enabled, you might want to turn on Classic Discussion System in your preferences instead.

To make things more secure, make them simpler (Score:1)

Re: (Score:1)

Re: (Score:3, Informative)

Re: (Score:3)

Re: (Score:1)

Re: (Score:1)

Re:To make things more secure, make them simpler (Score:4, Insightful)

Laser cutting directions! (Score:5, Insightful)

Re: (Score:1)

Re: (Score:1)

Re:Laser cutting directions! (Score:4, Informative)

Re: (Score:3)

Re: (Score:2)

Re: (Score:2)

4chan comment (Score:2, Offtopic)

Re: (Score:2)

Any questions? (Score:5, Informative)

source code of the processor? But software patents (Score:2)

Re:source code of the processor? But software pate (Score:5, Informative)

Re: (Score:1)

Re: (Score:2)

Re:How does it compare to Unisys MCP ? (Score:5, Informative)

Re:How does it compare to Unisys MCP ? (Score:5, Informative)

Re:How does it compare to Unisys MCP ? (Score:5, Informative)

Re: (Score:2)

Re: (Score:2)

Re:Any questions? (Score:5, Informative)

Look this gifthorse in the mouth (Score:1)

Re: (Score:2)

Re: (Score:1)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:3)

Acronyms with multiple meanings (Score:1)

Related Links Top of the: day, week, month.

Slashdot Top Deals