Whistled Platform Upgraded With Word Recognition 30
An anonymous reader writes "A few weeks ago, Slashdot featured a cheap platform performing 80FFTs per second to recognize whistles. The platform is open hardware/open source and is aimed for sound processing projects. To this goal, the creator (limpkin) just implemented a simple proof of concept algorithm that will control your lighting once the platform listens to a particular word. A small video has been made to explain the basic concepts of sound recognition to encourage hobbyist to make their own."
Just installed! (Score:1)
I now have a way to wipe all my drives and trigger my thermite detonators incase the feds come knocking.. now i just need to make sure no good looking women come near me so I don't wolf whistle... I think I'll be ok!
prior art (Score:2)
Prior Art -> http://www.youtube.com/watch?v=1HVA5MuWy8o [youtube.com]
Not A Criticism, But... So What? (Score:5, Interesting)
The author of the book, a high-school student, built (with the help of his engineer father, I don't want to downplay that) a small robot that had obstacle sensors, light sensors, and some basic mapping capability so it could find its own charging station (not so different from a Roomba today).
But my main point is: it also had effective voice recognition, for simple commands. And the implementation was pretty simple: the audio input was amplified, then sent through 3 notch filters to separate high, medium and low audio frequencies. Each of the 3 frequencies was digitally sampled at about 40kHz. Repeated samples were averaged and saved in a table in memory.
The CPU (and remember, this was a 1970s-era CPU, if I recall an 8080a or a Z-80 or similar) constantly sampled incoming sounds, and when one sufficiently matched one of the stored templates it meant "command received".
It was a simple scheme, and it worked fine. I don't want to detract from this inventor, but in essence he is doing a similar thing. Except instead of using notch filters, he's using FFTs to do the frequency analysis and build (and then compare to) the templates. The ideas aren't all that different.
But personally, I think I'd prefer the old method, as it demonstrably worked at least as well as this, used only a few $ in hardware in addition to the CPU, and was pretty definitely less compute-intensive to achieve.
Keep in mind: that was 40 years ago. Maybe this newer approach has more potential; I don't know. But it certainly doesn't look much different at this time.
Re: Not A Criticism, But... So What? (Score:2, Informative)
If you can't see why this has more potential, you know nothing about DSP. Yes, for voice, 3 well-chosen fixed bands is mostly enough. But for whistling (as per the original application) or various other sounds, those same three bands will be pretty crap.
Now we could make the 3 analog notch filters tunable via a DAC output, and get good results for one group (at a time) of a wide range of sounds, but that complicates it substantially. Yes, doing it with FFTs uses a lot more computational power -- but when th
Re: (Score:2)
"But for whistling (as per the original application) or various other sounds, those same three bands will be pretty crap."
I'm not suggesting otherwise. I think you're missing my point, which was that THIS was about voice recognition, not whistling, and he is using a high-tech solution for that where a lower-tech, simpler solution might actually be better.
Nor am I saying we shouldn't do it this way. I'm only saying there are alternatives that might work as well for THIS application, which are also simpler and cheaper.
Re: (Score:2)
Re: (Score:2)
Re: (Score:2)
Dead Parrot (Score:1)
pretty cool (Score:1)
http://21stdigitalhome.blogspot.ca/2013/06/vcp200-voice-recognition-ic.html [blogspot.ca]
But... (Score:1)
Oblig. Demolition Man (Score:3)
Simon Phoenix, after reprogramming Dr. Cocteau's house lights;
"Nah, I changed that."
"Illuminate"
"De-luminate"
"Ah, Isn't that much better?"
(I swear, the US is looking and feeling more and more like the fictional "Greater SanAngeles" from the movie with every day that passes.)
Strat
Re: (Score:2)
Excellent point. By way of reward, I'm taking you to Taco Bell.
U.S. nuclear missiles have had this for years... (Score:1)
...apparently. It's why they had to keep Kevin Mitnick in solitary confinement so he couldn't whistle launch codes into the prison payphones: http://en.wikipedia.org/wiki/Kevin_Mitnick#Arrest.2C_conviction.2C_and_incarceration [wikipedia.org]
just imagine... (Score:5, Funny)
..if he could get it recognize, say. the sound of two handclaps...wouldnt that be something?
Even better... (Score:5, Funny)
...the sound of one hand clapping.
They have been shut down (Score:1)
Prometheus (Score:4, Funny)
The moment I have to blow anything to use a device I will retire and become a Luddite.