Stories
Slash Boxes
Comments

News for nerds, stuff that matters

The Coming Wave of Gadgets That Listen and Obey

Posted by Soulskill on Sun Jan 27, 2008 12:36 PM
from the tea-earl-grey-hot dept.
dgan brings us a NYTimes piece about the development of speech recognition for common gadgets. Companies such as Vlingo and Yap are marketing their software to cellular carriers to give consumers a hands-free option for tasks like finding directions and text messaging. Quoting: "Vlingo's service lets people talk naturally, rather than making them use a limited number of set phrases. Dave Grannan, the company's chief executive, demonstrated the Vlingo Find application by asking his phone for a song by Mississippi John Hurt (try typing that with your thumbs), for the location of a local bakery and for a Web search for a consumer product. It was all fast and efficient. Vlingo is designed to adapt to the voice of its primary user, but I was also able to use Mr. Grannan's phone to find an address. The Find application is in the beta test phase at AT&T and Sprint. Consumers who use certain cellphones from those companies can download the application from vlingo.com."

Related Stories

This discussion has been archived. No new comments can be posted.
The Fine Print: The following comments are owned by whoever posted them. We are not responsible for them in any way.

The Coming Wave of Gadgets That Listen and Obey 25 Comments More | Login /

 Full
 Abbreviated
 Hidden
More | Login
Keybindings Beta
Q W E
A S D
Loading ... Please wait.
  • It may finally happen. (Score:4, Funny)

    by moogied (1175879) on Sunday January 27, @12:37PM (#22200576)
    5,000 years ago man relized it could not make women listen and obey. So he started a quest to make devices that could..

    Is it possible that all of mankinds dreams are coming true now?!

    • Re:It may finally happen. (Score:5, Interesting)

      by peragrin (659227) on Sunday January 27, @12:46PM (#22200634)
      nope. because we must select double delete them all.

      voice recognition is no where near reliable. I laugh at my brother as he tries to use voice dial on his cell phone, it takes two or three times to get it to work. I once sneezed and it dialed my father. a good throat clearing sounds like mother. I should try farting at it some time to see who that would Dial.

      Seriously try it sometime. delicately train the system for your voice, use it for a while, and then start throwing random noise at it. Or take a song which the music track is quiet enough to hear each word clearly and play that at the microphone. It should give you all the lyrics, yet they can't sort that out. The human ear can, but a computer can't yet. voice recognition is nearly useless until it can.
      [ Parent ]
      • Re:It may finally happen. (Score:5, Funny)

        by MonsterOfTheLake (880659) on Sunday January 27, @12:56PM (#22200694) Homepage
        I should try farting at it some time to see who that would Dial.

        #265532 [bash.org]:
        (Sabdo) on one of those speech-to-text programs my friend ripped ass onto the mic.
        (Sabdo) and it typed out "France"
        (Sabdo) we were like, wtf?
        [ Parent ]
      • Re:It may finally happen. (Score:5, Interesting)

        by ScrewMaster (602015) on Sunday January 27, @01:01PM (#22200736)
        The human ear can, but a computer can't yet. voice recognition is nearly useless until it can.

        Voice recognition is incredibly useful in the right context. A friend of mine is an attorney who happens to be disabled. He makes great use of voice recognition on his computer, does most of his legal work with it. Is it "conversational"? No, but it serves his purposes perfectly.

        So you're right, speech recognition systems aren't as generally versatile or accurate as the human brain, but they're getting better all the time. Give it ten years or so, with improved algorithms and a sixteen core processor to handle them I think we'll be interacting with computers on a much different level. Of course, by then you'll have to know Spanish or Mandarin to use one of them.
        [ Parent ]
          • Re: (Score:3, Interesting)

            ...what?
            Please mr. guru, tell me how this happens exactly.

            I not saying it is done that way, but it would be very easy to do it that way. Mobile phones have all the kit which is needed to digitise speech, and to send that digitised speech over a GPRS connection to a web service that does speech-to-text and retu

  • All I can think of is... (Score:5, Funny)

    by bennomatic (691188) on Sunday January 27, @12:44PM (#22200614) Homepage
    "Open the pod bay doors, HAL."

    "I'm afraid I can't do that, Dave."

    • Re: (Score:2, Funny)

      by Anonymous Coward
    • Re:All I can think of is... (Score:4, Interesting)

      by value_added (719364) on Sunday January 27, @01:51PM (#22201014)
      "Open the pod bay doors, HAL."
      "I'm afraid I can't do that, Dave."


      My take on the matter is that the reason that's all you can think of is that everything else is inappropriate, inefficient or simply too goofy for consideration.

      Not to anthropomorphise electronic devices (I know, they don't like it when you do that), but I think they'd prefer to be treated anonymously and respond the most basic of instructions only. And we'd prefer they remain that way, except in very limited circumstances where the device is named Lenore.

      In the Star Trek movies you'll find something similar to the above, with an occasional "Tea, Early Gray, Hot" for good measure, but the rest of the time everyone is interacting with devices using ... wait for it ... keys and buttons. And this is into the technologically advanced future where most everything is a device, including crew members. Seeing Picard, for example, say "Computer, send a message to Data telling him to work on his joke-telling skills", or to use the article's example, [asking] his phone for a song by Mississippi John Hurt, would be seen by everyone as a ridiculous use of technology and dismissed as absurd.

      Voice recognition, in the abstract, is fascinating and no doubt fun, but I wouldn't want to live in a Tourettes-like world where everyone is shouting out instructions to unthinking devices, let alone work in a cubicle where the next guy's phone conversation are competing with the noise of his regular work.

      So past opening and closing doors, keyboards it is. Or for those unskilled in the expressive art of the command-line, a mouse or function buttons.
      [ Parent ]
  • Fun with Gadgets (Score:3, Funny)

    by Knave75 (894961) on Sunday January 27, @12:44PM (#22200616)
    User: Please connect me with Hugh Jass
    Gadget: Sorry, I could not find a Hugh Jass
    User: *snicker*
  • I can imagine the day we speak the name of some legislation in the phone and say "vote yes" or "vote no". The results show up on our congressman's web site and some other third party sites that archive. This way we take control of a few and transfer it to
  • heard it all before (Score:4, Insightful)

    by debatem1 (1087307) on Sunday January 27, @12:45PM (#22200624)
    I maintain great skepticism about speech recognition as an interface. It just isn't much faster than typing, even on a cell phone- and its not that it takes so much longer to get an ideal rendering, its that even a minor error in translation results in about five seconds of prompting followed by reentry. Until they can get that figured out, or get accuracy up to a point where someone unused to giving dictation can use it, its just not that great a technology.
    • Re:heard it all before (Score:5, Interesting)

      by mdfst13 (664665) on Sunday January 27, @01:16PM (#22200800)

      It just isn't much faster than typing
      Sure, but it's a lot safer to do while, say, driving down the road. The problem with screen output and typed input is that you have to use both eyes and hands to operate the device. By contrast, using speech input and output only requires voice and ears. Of course, there are some circumstances where the screen/type method is superior, e.g. sending emails from your blackberry during meetings. However, there are many cases where speech is superior, e.g. driving down the road (or even just walking). Viewing speech as a replacement for screen/type is over zealous. It's really more of an alternative.

      It would probably help if advocates of the technology understood this. It doesn't have to be all or nothing. Two alternative solutions can add up to a more powerful solution than either would be alone.
      [ Parent ]
      • Re: (Score:3)

        Poeple definitely shouldn't be texting while they drive. People probably shouldn't be talking while they drive either.
  • I'll only be interested in gadgets which obey only what I tell them to do.
  • Limited phrasebook (Score:4, Interesting)

    by name*censored* (884880) on Sunday January 27, @12:49PM (#22200652)
    Limited phrasebook technology is a lot better than voice recognition technology in a lot of devices. Given that most (well, all) devices have limited functionality (not even Steve Jobs' iPod can do his taxes for him), there's very little point in giving the device the ability to understand possibly-misdirected phrases such as "Honey, have you seen the remote?". A good approach for this technology would be to limit it to understanding alternate ways of phrasing a particular command; "Device, Get Me A Beer"/"Device, Can I Have A Beer"/"I'm Really Thirsty". This way, we'd avoid misdirected speaking (the device thinking you're speaking to it instead of to another), and could also exploit the reduced set of understandable phrases to correct for people with colds/accents/quiet voices/etc, in much the same way as limited-phrasebook devices work (only with more flexibility).
    • Re: (Score:3, Funny)

      Instead, we should invent plot-directed recognition technology. I mean, you never see the computer on Star Trek misinterpreting the zillions of conversations as being directed toward it. Why? Because it would bog down the plot, except for those rare occ
    • Re:Limited phrasebook (Score:4, Funny)

      by niceone (992278) * on Sunday January 27, @01:37PM (#22200922) Journal
      "Honey, have you seen the remote?"

      Phone: Yeah, sure, it's cute enough, but I think I can do better.
      [ Parent ]
  • "Didn't I ask to to start the Roomba, Dear" (click) "Do it yourself, Roomba my ass...."
  • Seems to be a "layer mismatch". Analogy is the OSI model.

    I'll stick to using voice for "higher layer" communication with actual intelligences like humans and other animals. For "lower layer" comms you don't use your voice.

    If you ride a horse while you do t
  • Obedient, huh? Get a job and bring home some cash!
    • Re: (Score:3, Interesting)

      Everything. I personally don't give a rat's ass about cell phones - it's not really a big deal or very innovative until you just have a communicator built in. Everything else though, from doors, lights, running tasks on a computer, etc. is what's really
      • Re: (Score:3)

        Forget voice recognition/synthesis and all that crude claptrap ... I want a brain implant capable of accessing symbolic thought patterns directly. Just think about something and the machine will figure out what it is that you want to know, and feed the inf
    • Re:I wonder... (Score:5, Funny)

      by lgw (121541) on Sunday January 27, @01:00PM (#22200728) Journal
      I can't get over this "hands free text messaging" option! What engineer had the insight "we need to give customers a way to communicate over the phone just by talking"? It's a strange world.
      [ Parent ]
      • Re: (Score:3)

        One of the features of my new phone is "Voice SMS."

        Think about that for a moment. It's like a text message, but it's voice. On a phone.

        According to Sprint [sprintpcs.com], the reason this is better than a normal voice mail message is that you're guaranteed to leave a

    • Re: (Score:2, Interesting)

      There is no reason you couldn't set your car's speed with your cell phone using Blue Tooth. Just say 80 MPH please. Or reduce rapid, no break lights, 60. Or speak "reduce 3 spot 60 BL not." That means reduce speed to 60 in 3 seconds no break lights. Over