Microsoft Voice Command Almost Here 292
PDA User writes "The new Microsoft Voice Command for Pocket PC isn't supposed to be out until the next Comdex, but someone inside the company posted details to Handango and Geekzone posted a preview.
The application notifies users of appointments, and answer simple English questions. It does not have "Do you want fries with that?" in the vocabulary though."
The voices (Score:5, Funny)
I'm very afraid. (Score:5, Funny)
Wear do ewe won 2 goatee day?
As I recall, voice recognition still ain't quite 100% yet...
Re:I'm very afraid. (Score:5, Funny)
And it never will be. Ever go through a drive through and successfully convey your order to the 17yo on the other end?
Re:I'm very afraid. (Score:5, Informative)
ain't quite 100% yet...
And regular speech is?
FYI it's at 97%, give or take a couple. Good enough for TellMe to increase ATT's 800 automation rates from 15% to 70% [nytimes.com]. Good enough to automate cop cars [yahoo.com]. It's been a long time coming, and noisy environments are still a challenge, but it's fo real, now.
(Shamelss plug: the really good stuff is running on the engine from Nuance [nuance.com].)
Re:I'm very afraid. (Score:3, Insightful)
Dude, human voice recognition isn't 100% either. How many times a day do you say "excuse me," or "what?" because your ears missed something, or somebody mubled?
Voice recognition will never be "100%" because speech isn't perfect. At least not until voice recognition software designers realize how flawed speech is, and program software to say "Can you repeat the part about the...stuff?"
Plug-in (Score:2)
Star Trek again? (Score:2, Interesting)
What gives?
Re:Star Trek again? (Score:2)
What, does Kirk have a RFID under his skin now? Or is there one in that funky badge of his?
Strangely (Score:5, Funny)
Re:Strangely (Score:2)
... at the presentation... (Score:5, Funny)
format c!
(then another one...)
enter!!
yes!!!
enter!!!!
That happened in '95, to the IBM Viavoice Guy (Score:4, Funny)
Unix version... (Score:5, Funny)
my version. (Score:2)
rm: `/': Permission denied
No big deal. My mail client and web browser get the same answer, unlike some software I know of.
I wonder if M$'s new talker will auto load email and blab it out for the user. Assides from the inapropriate nature of most M$N, Hotmail and other spam M$ users suffer under and would NEVER want read aloud, chances are a malicious email would blow the thing up, install keyloggers or what not. That's what happens when you run as root. Go figure why M$ ha
well maybe not really... (Score:2)
"I got in my car and drove to Washington." stops becoming "I gotten a minicart and dove to washing my son." I don't think it will really catch on.
Re:well maybe not really... (Score:2)
I saw a guy from MS demo this technology on his tablet PC a few months back. The demonstration was slightly spoilt by the look of utter surprise on the guy's face when it actually worked perfectly first time...
Now, I'm really scared (Score:4, Funny)
Oh, the pain. The pain.
Re:Now, I'm really scared (Score:4, Funny)
Re:Now, I'm really scared (Score:2)
Re:Now, I'm really scared (Score:2)
"You are trying to close Windows, I'm sorry, I can't let you do that"
Re:Now, I'm really scared (Score:2)
sure, enough pain for everyone. (Score:2)
I can see clippy trying to read my wife's email.
-BING- Spam with M$ signature lands on her desk.
Clippy: You have new mail!
Clippy: En-large UR Peniz! From Bill Gates.
My wife smacks her keyboard.
Clippy keeps reading the message.
Email loads up Gator through midi exploit.
Clippy keeps wagging his finger and spam blinks, but all is quiet while trickler starts to work.
Handheld detects Advert Avoidance flaw, YOU KILLED CLIPPY, you bastard, and slags it's core.
I
Gimic or Paradigm? (Score:4, Insightful)
I can't think of a lot of reasons I'd want something like this on a home system. Besides the creepy "I'm talking to the walls" feel, I can't imagine it would be any faster or more engaging to interact in this manner.
Public use cases seem like a good idea, until you realize that background noise and assholes shouting commands over your sholder could end up causing more problems than the system solved.
When this kind of thing can be attached to a really powerfull datamining engine and equiped with a much larger vocabularly I'll be impressed. Right now it looks like more of a toy.
Re:Gimic or Paradigm? (Score:3, Insightful)
-Chris
Re:Gimic or Paradigm? (Score:2)
Re:Gimic or Paradigm? (Score:2)
I mean come on, Microsoft's spell checker leaves much to be desired, imagine how much their voice recognition will suck.
"Microsoft is insecure" somehow translates to "Microsoft is secure."
Re:Gimic or Paradigm? (Score:2)
There are already rules that limit cell phone use in various public places (including the ferry on which I commute to work), precisely because it is damned annoying to hear people yacking all the time!
I know that if I'm at work trying to solve a particularly sticky pro
Re:Gimic or Paradigm? (Score:2)
You're forgetting how many handicapped people are in the world. This can be useful for blind, MS or even arthritic folks. Microsoft has done enough damage making web pages that can't be rendered except by IE, maybe this is a little payback.
Re:Gimic or Paradigm? (Score:2)
Not with today's audio processing capability. My cellphone only hears me when I'm talking on it, even when I'm standing in Starbucks on a Saturday evening with the espresso machine going - even I can't hear me then, but my phone filters out all the background noise no problem.
That being said, I agree that it will be a lo
Re:Gimic or Paradigm? (Score:2)
They can use the computer without the need for physical dexterity.
virus writers come hither... (Score:4, Funny)
Suddenly when bill was on the train with his new PocketPC, his speak starter blurting "I BROWSE PORN PORN PORN PORN PORN PORN......."
Re:virus writers come hither... (Score:2)
In my case, that means I've just hit a shortcut icon, and a well used one at that.
Yes! (Score:4, Funny)
To which he'll reply.. (Score:4, Funny)
Exciting new technology (Score:2)
It usually worked, too. If not with "start dopus", "staaart dopus" or "START DOOOPUS!!!1", taking a breath and pronouncing it calmly would execute the command eventually.
Uhhh...so? (Score:5, Interesting)
Re:Uhhh...so? (Score:2)
Office 2000 had support for voice recognition. Although admitidally it wasn't part of the OS and I don't know whether or not this was before Apple.
Re:Uhhh...so? (Score:3, Informative)
Bear in mind that this is Voice Recognition for a 400 mhz PocketPC handheld device and not some hulking great PowerMac with oddles of RAM and at least a Gigahertz processor.
It's not really that innovative--is it?
Maybe not, but still impressive given the limitation of your average PDA.
Re:Uhhh...so? (Score:2)
It is news that someone has finally implemented it in a PDA. However, it's only interesting from social/marketing context, not a technical one. It will be interesting to see how/if peopke will use it once the novelty wears off.
Re:Uhhh...so? (Score:2)
In other words, there is no technical reason why it won't work on a PocketPC, and it's not particularly amazing that it would. If it doesn't work when it's released, the only possible culprit is Micros
Re:Uhhh...so? (Score:2)
I guess what makes this new Microsoft technology so exciting is to find out who it is they stole it from.
I'll be watching the headlines for news of a lawsuit from a small voice recognition software company that 'entered negotiations' a few years ago with MS which were 'suddenly dropped without explanation' after said small company 'show
*Pocket* PC (Score:5, Insightful)
Number one it's much more difficult because of limited resources. I don't know of many who've actually accomplished useable voice recognition on a pocket platform.
Number two, think about the uses of this. The two major bottlenecks of handheld systems are input and output. If the speech recognition gets good enough (which I doubt at this point but which will probably happen sooner rather than later) you might not have to use the little pin-pusher thumb keyboard or handwriting recognition. Also, think of this being used on a combination pda/cell phone. You can use your hands free set for the phone to control the pda and also get information from it. You could use it driving much easier then.
I'm not getting all lathered up over this, but it is kind of cool.
Re:*Pocket* PC (Score:2)
Re:*Pocket* PC (Score:2)
Apple's had their voice recognition technology since 1994 - that's before they had PowerMacs. Back then they used a DSP, but on the first PowerMacs (66 MHz to 80 I thnk; I've used it on my 7200/90, a 90 MHz machine), it ran great on a 90 MHz machine. This is no big deal, and easy as pie on a 400 MHz machine. Which I'm sure has more RAM than my
Re:Uhhh...so? (Score:2)
For all intents and purposes Steve Jobs IS a cult leader, and he's running the company. Watch him set fire to the compound when the IRS closes in.
is there a three-finger salute replacement? (Score:5, Funny)
CONTROLLLLLLLLLLLLLLLLLL!
ALTTTTTTTTTTTTTTTTTTTTTT!
DELETEEEEEEEEEEEEEEEEEEE!
gives an entirely different meaning to "chording", eh?
knyuk knyuk knuk (Score:2)
HELLLOOOO!
HELOOOOOOO!!
HELLLOOOOOOOOO!!!
hello!
Oh, I'm sure it'll "help" your computer operation. (Score:4, Funny)
(Launches IE)
Me: LAUNCH MOZILLA
(Launches IE)
Me: FOR THE LOVE OF GOD, LAUNCH MOZILLA.
(Launches word, starts "Thank You, Microsoft" letter with help of Clippy... and uninstalls Mozilla because, well, it is obviously the cause of the frustration it senses).
You might laugh, but...
Re:Oh, I'm sure it'll "help" your computer operati (Score:2, Funny)
No, I wouldn't. You'd need to write something funny, first.
Re:Oh, I'm sure it'll "help" your computer operati (Score:2)
I used the Microsoft Desktop voice commander system in college (mostly to bother my roommate...i had no difficulty clicking on things). It had no problem opening Lotus WordPro, Netscape Communicator, Metrowerks CodeWarrior or Harvard Graphics, which were my big use programs at the time.
And this was well before the antitrust suit.
Re:Oh, I'm sure it'll "help" your computer operati (Score:2)
Don't worry, we won't.
Tech for Tech's sake (Score:5, Insightful)
The answer is not because the technology is not good enough. Speech command software has come a long way, and in most situations, with the right microphone, it can be very realiable...if you talk clearly.
The problem is that people don't naturally talk clearly. They repeat themselves, add in lots of "ummm"s and "errr"s and "like"s, and generally speak in ways that only another human could symantically understand. Because of this, using a keyboard or mouse to communicate with a computer is always going to me a more effecient mechanism.
Think about it...which is faster: Saying (in a clear, even tone) "Select the 3rd item in the list" or just clicking on it. Even in ideal environments using buttons on a PDA is going to faster and more reliable than voice command.
The only reason humans can use speech to effciently communicate with each other is that along with speech comes tone, body language, and symantic context that conveys as much (if not more) information than the vocal message itself. Computers cannot pick up on those kinds of things.
I seriously doubt that anyone will make extensive use of this feature.
Nope, just think outside the "box" (Score:5, Insightful)
You miss the point. Just like with Pen input, everyone wants to think of the tech as a replacement for keyboards when in reality it is simply an alternative form of input that will be extremely useful given the right situation/environment. One obvious one is allowing input/control for those who are disabled (or close to it). A friend of mine's mother had horrible arthritis in both hands, it really kept her from being able to use her computer much. Using voice input allowed her to utilize it much more than what she would have been capable of otherwise. Another general scenerio is someone who needs to use their hands, but would be convenient to "use" a computer while their doing their activity. Heck, this could be useful even driving. One of the biggest complaints about the fancy computer systems in higher end cars today (e.g. BMW) is their complexity. Well this is perfect since BMW uses WinCE (for now at least) and telling the car to "adjust the temperature to 72 degrees" is simpler than wading through the menus (or to "give directions to Hudson St").
So you are correct in that it's not likely to walk into a cubicle farm and hear "File|Save As|myresume.doc" and "10 of clubs under jack of diamonds" coming from the mouths of four dozen workers. But there are PLENTY of other very useful applications for this technology.
Re:Nope, just think outside the "box" (Score:2)
Considering the last time I played solitary, I think that 10 on jack would be a good command to recignise. At the very least highlight all 10s and jacks. (others are generally the one suggesting, and there is sometimes a good reason not to make a legal move)
Voice commands should be very useful, but it will take a lot of work to figgure out where, and what the right thing to do it. (I'd be very mad if the 10 moved from a pile with no cards under when all kings are placed, when the other 10 is hidden and
It has it's place in some circumstances (Score:3, Insightful)
Re:Tech for Tech's sake (Score:2)
Plus your wife might bust you having cybersex in an AOL chat room if you were actually talking to the computer.
Re:Tech for Tech's sake (Score:2)
You don't own a Mac? ;-) (for those who would rant on, this is an obvious joke, grow a sense of humor)
Re:Tech for Tech's sake (Score:2)
You... may... have... this... problem... but... some... of... us... have... already... mastered... the... required... skills... to... communicate... effectivly... with... the... computer.
</kirk>
Re:Tech for Tech's sake (Score:2)
I remember doing about 3 1/2 hour training sessions with Dragon speech-to-text or whatever it was called. I had it working fairly decently, it would get *most* words. I also had it set up to do some simple commands ("open wordperfect" etc).
Maybe a couple weeks after that, my hard drive crashed. That was the end of
Re:Why speech-based UI is doomed to limited uses (Score:2)
Uh oh... (Score:4, Funny)
"Who's Jonnhy?" she said, and smiled in her special way...
-Adam
Voice as a tool (Score:4, Interesting)
The alternative, tiny keyboards or crazy script can be good or bad, bu voice isn't going to be more than just another sub-division of users who think murmuring to their PDA is fun. In fact, there's no perfect input except for those crazy fsking monkeys and their mechanical arms!
Set Vocabulary? (Score:4, Insightful)
I have seen the same problems with automated phone systems that are supposed to recognize a generic voice and I can see the same thing happening here.
The main difference here though, is that when entering text, you know exactly what you input before pressing enter. With voice recognition software, how do you know that the software "hears" exactly what you say? If you say somethign like "What are my appointments for the thirteenth?" and it hears, "What are my appointments for the thirtieth?" you would be receiving the wrong information.
I hope this is a success but I don't have my hopes up.
I can see it right now: (Score:4, Insightful)
If Microsofts attempt on handwriting recognition is any indication, this thing will fail terribly. Have you ever tried to use a Tablet? You must be a real bad typer if this makes you more productive.
Just my guess, please proof me wrong since these things would be really cool if it worked.
cu,
Lispy
Yes, I have used the Tablet. (Score:2)
Already been done (Score:5, Funny)
http://www.theonion.com/3941/
Voice Recognition Software Yelled At
NEW YORK--Fidelity Financial Services' Gwen Watson, 33, shouted angrily at her IBM ViaVoice Pro USB voice-recognition software, sources close to the human-resources administrator reported Monday. "No, not Gary Friedman! Barry Friedman, you stupid computer. BARRY!" Watson was heard to scream from her cubicle. "Jesus Christ, I could've typed it in a hundredth of the time." After another minute of yelling, Watson was further incensed upon looking at her screen, which read, "Barely Freedman you God ram plucking pizza ship."
Re:Already been done (Score:5, Funny)
"Dial 7"
*pause*
"Dial 7!"
*pause*
(computer): "Command not recognized. Please try again"
"Dial 7"
*pause*
"DIA.."
(computer): "Cannot dial that number, please try again"
"Dial 7!"
(computer): "Dialing 5.... boo bee dee dat bap"
"DIAL SEVEN MOTHERFUCKER!"
It was a riot. I think that guy almost gave himself an aneurism testing that software.
Re:Already been done (Score:2)
"Computer!"
"Shut down!"
"Yes!"
Old farts will love this (Score:2, Insightful)
When I worked for a large law firm, that was one of the most aggravating and consistent comments we would get was "Why do I have to do all this typing? Can't you just load some software that'll listen to what I want?"
I've never seen a real practical use for this in an office environment.
*
Re:Old farts will love this (Score:2)
Not only will we have people giving out loads of private information public while talking on their celphones; we'll now get the priveledge of going even further into their private lives! Imagine listening to someone write a personal e-mail while on the train, or writing a report regarding something your boss did that cost the company a million
anyone for... (Score:2, Funny)
You killed it, you bastards (Score:2, Funny)
Slashdotted. But it leaves a nice advertisement for Microsoft on your screen.
Microsoft JET Database Engine error '80004005'Unspecified error
Insert Dilbert reference (Score:2)
Good example why we need Microsoft yet (Score:2, Insightful)
I can see it now ... (Score:5, Funny)
You: "Open http://www.kernel.org"
PDA: "Error 403: Forbidden. You are obviously attempting to circumvent my artificial intelligence by installing Linux -- which is currently legally owned by SCO, by the way -- and therefore I'm going to need to
OS/2 had this feature for years (Score:2, Informative)
Fine that MS is going to catch up.
How often are you in a quiet enough environment? (Score:2)
But it has one fatal flaw. I can't use it when I'm listenin
This is for Pocket PC PDAs, NOT PCs! (Score:5, Informative)
It's specifically targeted at Pocket PC Phone Edition devices, but will also work on non-Phone Edition Pocket PCs. I've been testing this for a while on my Pocket PC Phone Edition 2003 device.
On the Phone Edition, the ability to call any one of your contacts at any one of their numbers (work, home, cell, etc.) by just speaking the command ("Call John Smith at work"), with no recognition or name training at all, is pretty darn cool. Add in a hands free headset, and you can interact with your Pocket PC Phone, including making calls, checking your appointments, and listing to WMP, all while never taking the device out of your pocket. Or better yet, while in the car, never taking your eyes from the road.
The software isn't meant for Joe Laptop user, and it doesn't replace simple interactions with your PDA (there's no way to create new items, for example), but for Phone Edition devices especially, it does add a lot of functionality and even safety. It's a lot faster for me to call someone by saying "Call so-and-so's cell" than by tapping through my Contacts until I find them, and then tapping the number I want to call. And if I'm driving, it's a lot safer for me to speak what I want than to futz around tapping on the screen.
Re:This is for Pocket PC PDAs, NOT PCs! (Score:2)
It would be a lot safer for me if you made all your calls at home, or in the driveway, or any stopped position, rather than diverting your attention away from the road.
I will never understand the jackasses who can't seem to spend the 3-4 minutes at home making any calls, and instead decide to do it while going 60mph on a freeway.
What would be fun (Score:2)
I can only imagine (Score:3, Insightful)
>> Acknowleged, changing permissions to Administrator.
No. List my appointments.
>> Delete all records in database: Are you sure you want to do that?
NOOO!!!!!
>> Yes. Ok. Database deleted. Continue with disk format?
Menial jobs are safe... for now (Score:3)
Well, thank God. The day someone figures out that voice recognition systems are no worse than undermotivated teenagers and cheap-ass microphones at correctly understanding drive-through orders, millions will be out of work.
That's great... (Score:2)
Oh wait - that was IBM. Got my Star Trek references mixed up.
Anyone remember SAM? (Score:2)
"SAM is now activayted!"
Re:Ataris rule commiedores drool (Score:2)
SAM Speaks! [retrobits.net]
Been doing that in OS X for years (Score:2)
Loosely quoted from Headcrash (Score:2)
"Most people password their start-up sequence. I password my shutdown sequence. You only need to see study hall in the week before final projects are due, with some frat initiates running up and down the aisles yelling 'SHUT DOWN!' to understand why."
Or, from Dilbert, "Well, somebody has a voice activated PC. A lesser engineer would be jealous, but I'm just as happy without it. After all, I'd hate to accidently DELETE! a FILE!"
What? (Score:2, Funny)
Did my computer just say, "Bend over, fool! It's time for an upgrade!" ?!?
expecting too much? (Score:2)
Very high 90s % recognition rate on untrained voice with unlimited vocabulary in noisy environment: not going to happen.
Context dependent response of the system to arbitrary commands: not going to happen (we all knew this, otherwise there are a few Turing
Re:expecting too much? (Score:2, Interesting)
Since the speec rec engines return confidence scores, the application knows how confident it is in the recogniti
"Remind Me' (Score:2)
All it takes... (Score:4, Funny)
Their going to call the Office assistant... (Score:2)
At least one command hardwired: (Score:3, Funny)
Re:Perfect for office.. (Score:3, Funny)
Re:Voice "Command". (Score:2)
Re:first command (Score:2)
Re:Sci fi? (Score:2)
Re:Sci fi? (Score:4, Informative)
This is for PocketPC's.. (Score:2)
Who's behind who again?