Slashdot Log In
The Coming Wave of Gadgets That Listen and Obey
Posted by
Soulskill
on Sun Jan 27, 2008 11:36 AM
from the tea-earl-grey-hot dept.
from the tea-earl-grey-hot dept.
dgan brings us a NYTimes piece about the development of speech recognition for common gadgets. Companies such as Vlingo and Yap are marketing their software to cellular carriers to give consumers a hands-free option for tasks like finding directions and text messaging. Quoting:
"Vlingo's service lets people talk naturally, rather than making them use a limited number of set phrases. Dave Grannan, the company's chief executive, demonstrated the Vlingo Find application by asking his phone for a song by Mississippi John Hurt (try typing that with your thumbs), for the location of a local bakery and for a Web search for a consumer product. It was all fast and efficient. Vlingo is designed to adapt to the voice of its primary user, but I was also able to use Mr. Grannan's phone to find an address. The Find application is in the beta test phase at AT&T and Sprint. Consumers who use certain cellphones from those companies can download the application from vlingo.com."
Related Stories
This discussion has been archived.
No new comments can be posted.
The Fine Print: The following comments are owned by whoever posted them. We are not responsible for them in any way.
Full
Abbreviated
Hidden
Loading... please wait.
It may finally happen. (Score:4, Funny)
Is it possible that all of mankinds dreams are coming true now?!
Re:It may finally happen. (Score:5, Interesting)
voice recognition is no where near reliable. I laugh at my brother as he tries to use voice dial on his cell phone, it takes two or three times to get it to work. I once sneezed and it dialed my father. a good throat clearing sounds like mother. I should try farting at it some time to see who that would Dial.
Seriously try it sometime. delicately train the system for your voice, use it for a while, and then start throwing random noise at it. Or take a song which the music track is quiet enough to hear each word clearly and play that at the microphone. It should give you all the lyrics, yet they can't sort that out. The human ear can, but a computer can't yet. voice recognition is nearly useless until it can.
Parent
Re:It may finally happen. (Score:5, Funny)
#265532 [bash.org]:
(Sabdo) on one of those speech-to-text programs my friend ripped ass onto the mic.
(Sabdo) and it typed out "France"
(Sabdo) we were like, wtf?
Parent
Re:It may finally happen. (Score:5, Interesting)
Voice recognition is incredibly useful in the right context. A friend of mine is an attorney who happens to be disabled. He makes great use of voice recognition on his computer, does most of his legal work with it. Is it "conversational"? No, but it serves his purposes perfectly.
So you're right, speech recognition systems aren't as generally versatile or accurate as the human brain, but they're getting better all the time. Give it ten years or so, with improved algorithms and a sixteen core processor to handle them I think we'll be interacting with computers on a much different level. Of course, by then you'll have to know Spanish or Mandarin to use one of them.
Parent
Re: (Score:2)
processing speed has helped a lot and they are getting better but I think we need to be able to process more than one thing at a time first. parallel programming will help more than anything else.
Re: (Score:2)
speech recognition systems aren't as generally versatile or accurate as the human brain, but they're getting better all the time. Give it ten years or so, with improved algorithms and a sixteen core processor to handle them I think we'll be interacting with computers on a much different level.
I'll believe it when I see it. This is one of those areas where various folks have been promising "[five|ten] more years" since the late sixties. Trouble is, the only thing greater storage and processing capacity get you is bigger personalized dictionaries of memorized [words|phrases|phonemes]. You still have to invest time to train the system in recognizing your speech. The greater capacity/accuracy, the longer it takes to "fine tune" the dictionary. It just doesn't seem like simply a problem of lack of
Re: (Score:2)
Cell phone vs. server farm (Score:2, Informative)
Most of the cell phone systems described in the article are likely uploading the audio to a server farm, running recognition there, and then sending back the response.
Re: (Score:2)
Please mr. guru, tell me how this happens exactly.
Re: (Score:3, Interesting)
Please mr. guru, tell me how this happens exactly.
I not saying it is done that way, but it would be very easy to do it that way. Mobile phones have all the kit which is needed to digitise speech, and to send that digitised speech over a GPRS connection to a web service that does speech-to-text and returns the text would be trivial. Doesn't need a guru.
VLINGO? (Score:2)
All I can think of is... (Score:5, Funny)
"I'm afraid I can't do that, Dave."
Re: (Score:2, Funny)
Re:All I can think of is... (Score:4, Interesting)
"I'm afraid I can't do that, Dave."
My take on the matter is that the reason that's all you can think of is that everything else is inappropriate, inefficient or simply too goofy for consideration.
Not to anthropomorphise electronic devices (I know, they don't like it when you do that), but I think they'd prefer to be treated anonymously and respond the most basic of instructions only. And we'd prefer they remain that way, except in very limited circumstances where the device is named Lenore.
In the Star Trek movies you'll find something similar to the above, with an occasional "Tea, Early Gray, Hot" for good measure, but the rest of the time everyone is interacting with devices using
Voice recognition, in the abstract, is fascinating and no doubt fun, but I wouldn't want to live in a Tourettes-like world where everyone is shouting out instructions to unthinking devices, let alone work in a cubicle where the next guy's phone conversation are competing with the noise of his regular work.
So past opening and closing doors, keyboards it is. Or for those unskilled in the expressive art of the command-line, a mouse or function buttons.
Parent
Re: (Score:2)
Bomb #20: Well, of course I exist.
Doolittle: But how do you know you exist?
Bomb #20: It is intuitively obvious.
Doolittle: Intuition is no proof. What concrete evidence do you have that you exist?
Bomb #20: Hmmmm... well... I think, therefore I am.
Doolittle: That's good. That's very good. But how do you know that anything else exists?
Bomb #20: My sensory apparatus reveals it to me. This is fun."
Re: (Score:2)
Fun with Gadgets (Score:3, Funny)
Gadget: Sorry, I could not find a Hugh Jass
User: *snicker*
Re: (Score:2)
Gadget: Dialing: Mother
User: Hey!
The increasing rate of change Collective Power (Score:2, Interesting)
heard it all before (Score:4, Insightful)
Re:heard it all before (Score:5, Interesting)
It would probably help if advocates of the technology understood this. It doesn't have to be all or nothing. Two alternative solutions can add up to a more powerful solution than either would be alone.
Parent
Re: (Score:3)
Re: (Score:2)
Unfortunately, in practice, people are going to zone out, talk to their passengers, mess with their radio, etc. I'd much rather have them ask their car for a song or directions than have them look down to adjust the radio dials or check a map. That's what this technology is trying to address, and I would guess it will eventually make us safer, should they get it adopted and used in a wide
Re: (Score:2)
Re: (Score:2)
Re: (Score:2)
The studies you saw were specifically designed to find that cell phones are dangerous.
"At least the adult passengers can see the circumstances and have a chance to shut up if the situation is tight. Someone on the other end of the line isn't going to get that."
Not only are many passengers not adults, you cannot just hang up on an adult passenger if you need to.
"Also, an adult riding with
Re: (Score:2)
Ju
Just so we're clear (Score:2)
Limited phrasebook (Score:4, Interesting)
Re: (Score:3, Funny)
Same thing applies to the doors. The doors know exactly when someone is going to walk through them, because they are plot-directed. You can stand mere inches away from a door, facing it, but until the plot indicates that the time
Re:Limited phrasebook (Score:4, Funny)
Phone: Yeah, sure, it's cute enough, but I think I can do better.
Parent
Re: (Score:2)
The hardware problem isn't as big as the software one. Sure Steve Jobs' iPod can't do his taxes with stock firmware, however with a different OS I am sure that it could be done. It used to be that speech recognition would become a reality when your processor was fast enough, now we have quad-core CPUs running at 3 GHZ and it still hasn't been done reliably.
Oh the coming litigation (Score:2)
layer mismatch (Score:2)
I'll stick to using voice for "higher layer" communication with actual intelligences like humans and other animals. For "lower layer" comms you don't use your voice.
If you ride a horse while you do talk to the horse sometimes, the talking is for the "higher layer", you use reins and body for "lower layer".
The last I checked all these gadgets and devices are pretty stupid, definitely no real AI. So it'll be more gimmicky than actually useful.
For such t
Dang Lazy Gadget (Score:2)
The trouble with this is... (Score:2)
The trouble with this can be summed up like this: Would you typically go through your day with a 6 year old, giving the 6yr old instructions on who to dial, what emails to send etc.?
No? Then you can forget the voice recognition stuff. Voice recognition substitutes What? for the typical 6yr old's Why?
There are a lot of people who have VR dialing on their phone now. Do you ever see anyone using
What would make me happy.... (Score:2)
Re: (Score:2)
I think I saw a documentary about a prototype for this... it translated anything you said to helpful phrases such as "Free mustache rides" and "Suck it, bitch, suck it dry".
I Hate This Shit (Score:2)
It sucks and I hate it and it's bullshit and the charlatans selling this shit should be shot in the kneecaps. You're *garbage*.
Multiple voice recognition gadgets (Score:2)
Mississippi John Hurt/Lionel Trains voice command (Score:2)
I am not impressed. I will bet you a nickel that he tried that out prior to the demonstration, and made sure there was nothing similar that might come up by accident. I would be impressed if he had given the mike to reporter Michael Fitzgerald and Fitzgerald had tried it.
At trade shows, I used to watch all sorts of demonstrations of
User friendly GPS (Score:2)
Re: (Score:2)
But yes, that WOULD be a useful thing.
Re: (Score:3, Interesting)
Re: (Score:3)
Re:I wonder... (Score:5, Funny)
Parent
Re: (Score:3)
One of the features of my new phone is "Voice SMS."
Think about that for a moment. It's like a text message, but it's voice. On a phone.
According to Sprint [sprintpcs.com], the reason this is better than a normal voice mail message is that you're guaranteed to leave a message and not actually reach the person you're calling (which comes up how often?) and that the text message UI is easier to deal with than the voice mail system. (Then why not offer a voice mail UI?)
And, of course, it wastes both a text message and d
Re: (Score:2)
Yeah, we'll call them "Freudian slips" instead.
And just how are they going to say "LOL"?
Probably by saying "laughing out loud."
Re: (Score:2)
Re: (Score:2, Interesting)