Nvidia GPU-Powered Autonomous Car Teaches Itself To See And Steer (networkworld.com) 54

Posted by BeauHD on Thursday April 28, 2016 @05:33PM from the homeschooling dept.

An anonymous reader quotes a report from Network World discussing Nvidia's project called DAVE2, where their engineering team built a self-driving car with one camera, one Drive-PX embedded computer and only 72 hours of training data: Neural networks and image recognition applications such as self-driving cars have exploded recently for two reasons. First, Graphical Processing Units (GPU) used to render graphics in mobile phones became powerful and inexpensive. GPUs densely packed onto board-level supercomputers are very good at solving massively parallel neural network problems and are inexpensive enough for every AI researcher and software developer to buy. Second, large, labeled image datasets have become available to train massively parallel neural networks implemented on GPUs to see and perceive the world of objects captured by cameras. The Nvidia team trained a convolutional neural network (CNN) to map raw pixels from a single front-facing camera directly to steering commands. Nvidia's breakthrough is the autonomous vehicle automatically taught itself by watching how a human drove, the internal representations of the processing steps of seeing the road ahead and steering the autonomous vehicle without explicitly training it to detect features such as roads and lanes.

Nvidia GPU-Powered Autonomous Car Teaches Itself To See And Steer

This discussion has been archived. No new comments can be posted.

Load All Comments

Search 54 Comments Log In/Create an Account

Comments Filter:

Inherent shortcomings (Score:3)

by fyngyrz ( 762201 ) writes: on Thursday April 28, 2016 @05:37PM (#52008953) Homepage Journal

In situations that do not resemble the training data, the network's response is essentially undefined, as well as unknown (it's all unknown... an NN results in behaviors that are not deterministic in the sense that anyone planned them out -- they are what they are, that's all.)
Nice experiment, though. :)

- - Re:Inherent shortcomings (Score:5, Interesting)
    
    by fyngyrz ( 762201 ) writes: on Thursday April 28, 2016 @06:13PM (#52009205) Homepage Journal
    
    No, not like people. An NN of the type described doesn't learn further; won't generalize from other experiences (FI, this thing won't know what to do with a pedestrian unless it's observed the human driver stopping for some number of them. Otherwise it has no training that places value on a pedestrian. Same for cat, chunks of some trucks blown-off retread, potholes, etc.) A person will know from their far, far deeper experience that it's inherently a bad idea to run down the three year old that walked in front of the car, will usually at least try not to run down little Susie's pet cat, and will choose to go around the pothole if it is reasonable to do so.
    An NN is a low-dimensional approach to solving problems. But most non-trivial problems -- and driving a vehicle is an entirely representative proxy for such problems -- tend not to be uniformly low-dimensional. Many aspects come into play suddenly and unpredictably. Some might not be seen for years, or ever. But then again, they might. In order to deal with such things, more than low-dimensional problem solving is required. NN's can't do it. They're inherently limited.
    It's going to require very high level software. Not intelligent, but damned well informed and replete with a huge rulespace that can solve all of the types of reasonably solvable problems in the space.
    That's all IMHO, of course. But I'm right, so there's that. :)
    
    - Re: (Score:2)
      
      by laing ( 303349 ) writes:
      
      "Otherwise it has no training that places value on a pedestrian." /p It's been years since I played Death Race 2000 (the video game [wikipedia.org], not the movie [wikipedia.org]). How many points is a pedestrian worth these days?
      - Re: (Score:2)
        
        by fyngyrz ( 762201 ) writes:
        
        How many points is a pedestrian worth these days?
        um.... well, is it a mime? Also, African, or European?
        
        Re: (Score:2)
        
        by fyngyrz ( 762201 ) writes:
        
        Alluding to them, anyway.
    - Not a Luddite fear, but a valid fear (Score:3)
      
      by GPS Pilot ( 3683 ) writes:
      
      Here's an excellent article by Ashlee Vance about a self-driving neural net-based system created by one man, George Hotz: http://www.bloomberg.com/featu... [bloomberg.com]
      It contains one passage that sums up my fears:
      Hotz hadn't programmed any of these behaviors into the vehicle. He can't really explain all the reasons it does what it does. It's started making decisions on its own.
      This will come back to bite us. More than once. Systems can exhibit unexpected behavior even when the inventor has an excellent understanding of the invention; here, a very bright inventor seems to have no hope of fully understanding things.
      Unexpected behavior from the control system of an object that has a l
    - Re:Inherent shortcomings (Score:5, Informative)
      
      by lorinc ( 2470890 ) writes: on Friday April 29, 2016 @04:30AM (#52011489) Homepage Journal
      
      An NN is a low-dimensional approach to solving problems. But most non-trivial problems -- and driving a vehicle is an entirely representative proxy for such problems -- tend not to be uniformly low-dimensional. Many aspects come into play suddenly and unpredictably. Some might not be seen for years, or ever. But then again, they might. In order to deal with such things, more than low-dimensional problem solving is required. NN's can't do it. They're inherently limited.
      No, they're not. [wikipedia.org] The lack of proper response to an unseen before event (which is called bad generalization) can come from at least 2 things: a bad sampling for the training set (no pedestrian in the set), and the absence of transfer learning (use of another system trained on pedestrian to improve the first one). The first one is really easy to solve but time consuming, a lot of very bright people are working on the second.
      Honestly, automatic driving by visual cues is not longer a challenging computer vision problem. It is still a challenging engineering problem, but all the scientific tools needed to solve it are already there, and they are being actively used to solve it.
      There are a lot of computer vision tasks that are now to be considered as solved from the research point of view. Very few people seem to acknowledge the enormous progress that have been made in the field in the last 10 years. Even people working in the field. But it's there, and now the engineers have to use it.
      
      - Re: (Score:2)
        
        by fyngyrz ( 762201 ) writes:
        
        Yes, actually, they are -- because the things you list that would defuse that aren't achievable with a wide problem; only with a deep one.
        We have LDNLS [fyngyrz.com] pretty well solved. The rest is, at best, research-level stuff that just isn't ready for prime-time, and probably will never be until something more sophisticated is employed. Providing a training set won't ever reach competence. You have to provide the ability ti learn independently, and well. No one has that on the table. Yet.
- Re: (Score:2)
  
  by OzPeter ( 195038 ) writes:
  
  In situations that do not resemble the training data, the network's response is essentially undefined, as well as unknown (it's all unknown... an NN results in behaviors that are not deterministic in the sense that anyone planned them out -- they are what they are, that's all.)
  Nice experiment, though. :)
  While I tend to agree with you on the un trained aspect (and I hope they have their liability insurance in place for driving an "unknown" on a public road) it would be interesting to know if you can combine NN datasets and thus massively parallelize the learning process.
  - Re: (Score:2)
    
    by fyngyrz ( 762201 ) writes:
    
    That may well be a very good approach in terms of covering all the bases in a distributed fashion. However, now the problem arises of how to combine all of that training into one coherent, functional space so your vehicle knows everything it needs to know. While it seems obvious that a lot of hardware spread wide will gain lots of experience, and all taken together might consist of a very well covered solution space, as each NN system will be different, even in how it solves the same problems, how to winnow
- yeesh, I need a stiff drink or three (Score:2)
  
  by Thud457 ( 234763 ) writes:
  
  To be fair, if I was driving on the highway at night and a deer jumped out in front of the car I was driving,
  I too, would reboot myself.
  
  Once from the shock of the deer jumping in front of the car,
  once from the collision
  once from appraising the mess the collision created
  once from clearing the mess off the road (assuming my car was still drivable)
  once from the mental anguish of gruesomely killing a large charismatic animal.
- Re: (Score:1)
  
  by npslider ( 4555045 ) writes:
  
  As this technology continues to improve I'm sure we will hear more and more about this until we take it for granted as the new normal.
- Re: (Score:1)
  
  by scheveningen ( 305408 ) writes:
  
  Marketing. They are showcasing their hardware.
  - Re: (Score:1)
    
    by npslider ( 4555045 ) writes:
    
    A special chip for adaptive learning... heck yes, show that off!
- Re: (Score:3, Insightful)
  
  by Anonymous Coward writes:
  
  Last I checked NVIDIA's market cap was 20.3B$. You'd think they have the ability to do more than one thing at a time, especially if it returns a profit on investment in the future.
  This sort of AI usage is only going to increase, it would be dumb of NVIDIA not to at least divert a miniscule amount of manpower and resources to try and get their feet wet in this field. Anything they can patent or turn into a product will outweigh the paycheck for the engineers on this "worthless" venture.
- Re: (Score:1)
  
  by npslider ( 4555045 ) writes:
  
  It's nice to see how technology can be used in real world (ie on the road), not just on a store shelf.
Why a single camera? (Score:1)

by fustakrakich ( 1673220 ) writes:

Ever driven with one eye shut? The drunks needn't answer that one. But with two or more cameras and various other sensors, it seems that the "learning" process would go much smoother.
- Re: (Score:2)
  
  by OzPeter ( 195038 ) writes:
  
  Ever driven with one eye shut? The drunks needn't answer that one. But with two or more cameras and various other sensors, it seems that the "learning" process would go much smoother.
  If you read TFA you'd see that the learning process is done with 3 cameras. Only the actual driving is done with a single camera.
  But yes I drive with "one eye shut" all the time courtesy of being blind in one eye. However I prefer my automated systems to have some degree of redundancy - and a single camera failure would cripple this system.
  - Re: (Score:1)
    
    by npslider ( 4555045 ) writes:
    
    A human's two eyes gives depth perception by seeing the same object from two slightly different angles, the brain composites this and creates the perception of depth. Is there technology to allow a computer to understand the information 2 cameras provide like a human can?
Now there's a Rick and Morty line (Score:4, Funny)

by the_skywise ( 189793 ) writes: on Thursday April 28, 2016 @05:51PM (#52009057)

"What is my purpose?"
"You drive me places." ... "Oh God!"
"Welcome to the club, pal!"
https://www.youtube.com/watch?... [youtube.com]

in an ideal autonomous vehicle development world.. (Score:2)

by sittingnut ( 88521 ) writes:

ideally,
computing hardware running AI should be developed separately ( by many companies ),
so many deferent 'AI's can be created by many companies/groups separately, to run on those hardarews to interact with generic prototype vehicles/interfaces,
then many different vehicle producers can create many actual models to sell, enhancing those generics with added customized features that can be marketed.
doing all of the creating from start to selling, or at least most of that, by a single company, while other
- Re: (Score:2)
  
  by mspohr ( 589790 ) writes:
  
  This might work:
  Neural net on a stick.
  http://techcrunch.com/2016/04/... [techcrunch.com]
Computers don't learn (Score:1)

by Sam36 ( 1065410 ) writes:

Sorry, there is no algorithm that makes algorithms..
- Re:Computers don't learn (Score:5, Interesting)
  
  by slew ( 2918 ) writes: on Thursday April 28, 2016 @07:02PM (#52009483)
  
  Sorry, there is no algorithm that makes algorithms..
  Although there might not be an algorithm that makes algorithms, there are algorithms to configure a meta-algorithm implementations. And example meta-algorithm implementation would be a deep neural net, or a human brain. I don't think it is a stretch to call the algorithm used to configure meta-algorithm implementation "learning" (although commonly this is called training)...
  But this is merely a semantic point.
  
  - Re: (Score:1)
    
    by Anonymous Coward writes:
    
    I wrote a fairly lengthy reply to the above chain but realised that every point I was going to make was made better by Douglas Hofstadter in Godel, Escher, Bach: an Eternal Golden Braid. It really should be essential pre-reading for any slashdotter engaged in a discussion about AI. If you haven't read it, go and do so now.
- Re: (Score:2)
  
  by wonkey_monkey ( 2592601 ) writes:
  
  Sorry, but-
  No, wait, I'm not sorry. Why would I be sorry? Why are you sorry?
  Anyway, yes there is.
- Re: (Score:2)
  
  by lorinc ( 2470890 ) writes:
  
  Definition of learning: The acquisition of knowledge or skills through experience, study, or by being taught
  The computer is acquiring the new skill driving, therefore the computer is learning. End of the story.
  By the way, the learning algorithm that optimizes the parameters of the neural net is making an algorithm. The neural net itself is an algorithm that takes a flow of images as inputs and outputs a steering decision. Therefore, the training/learning/optimization procedure that produces such neural net
A Scary Thought (Score:1)

by npslider ( 4555045 ) writes:

"Automatically taught itself by watching how a human drove..."
Oh my... and just what kind of driver was used as the role model?
- Re: (Score:2)
  
  by mspohr ( 589790 ) writes:
  
  Teenage boys... they (think) they're the best drivers.
  Or... maybe you could customize your driving:
  - teenage girl - texting and talking
  - old geezer - very slow reflexes
  - little old lady going to church on Sunday
  - soccer mom with 8 kids in the car running around after school to get everybody delivered
  - middle age guy with road rage
  - new driver with an "L" on the car
  The possibilities are endless
Heh (Score:4, Funny)

by MobileTatsu-NJG ( 946591 ) writes: on Thursday April 28, 2016 @06:51PM (#52009415)

We all know this car's running Windows cos Linux ain't got no good nVidia drivers!

- Re: (Score:3)
  
  by Trogre ( 513942 ) writes:
  
  Funny, but wrong.
  nVidia hardware performs at least as good under Linux as Windows, including CUDA processing.
"Directly" (Score:2)

by wonkey_monkey ( 2592601 ) writes:

The Nvidia team trained a convolutional neural network (CNN) to map raw pixels from a single front-facing camera directly to steering commands.
That's about as "direct" as... well, as direct as the connection between the photons that enter my eye and the hand movements I make to steer my car, I suppose. Not very direct at all.
DARPA's CPU (Score:2)

by Macdude ( 23507 ) writes:

Isn't this exactly the type of problem that DARPAs new CPU that's bad at math should be really good for?
Beemers (Score:2)

by Nethead ( 1563 ) writes:

I love the comment when there is a merging lane, "I just have to watch out for BMWs."
Who said Knight Rider? (Score:1)

by Paolo Redaelli ( 3749507 ) writes:

"KITT, drive me home!" I can't believe no one has't already cited Michael Knight and KITT of "Knight Rider" the TV series that launched David Hasselhoff
Starman (Score:2)

by DriveDog ( 822962 ) writes:

learned by observing, especially to accelerate hard to get through yellow lights.

There may be more comments in this discussion. Without JavaScript enabled, you might want to turn on Classic Discussion System in your preferences instead.

Inherent shortcomings (Score:3)

Re:Inherent shortcomings (Score:5, Interesting)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Not a Luddite fear, but a valid fear (Score:3)

Re:Inherent shortcomings (Score:5, Informative)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

yeesh, I need a stiff drink or three (Score:2)

Re: (Score:1)

Re: (Score:1)

Re: (Score:1)

Re: (Score:3, Insightful)

Re: (Score:1)

Why a single camera? (Score:1)

Re: (Score:2)

Re: (Score:1)

Now there's a Rick and Morty line (Score:4, Funny)

in an ideal autonomous vehicle development world.. (Score:2)

Re: (Score:2)

Computers don't learn (Score:1)

Re:Computers don't learn (Score:5, Interesting)

Re: (Score:1)

Re: (Score:2)

Re: (Score:2)

A Scary Thought (Score:1)

Re: (Score:2)

Heh (Score:4, Funny)

Re: (Score:3)

"Directly" (Score:2)

DARPA's CPU (Score:2)

Beemers (Score:2)

Who said Knight Rider? (Score:1)

Starman (Score:2)

Related Links Top of the: day, week, month.

Slashdot Top Deals