[Q] Audio decode, recognize a keyword - Java for Android App Development

Hello,
Constantly-on speech recognition listening for just one keyword.
I am trying to do is making my app constantly listen for one keyword that will fire an intent whenever the keyword is recognized.
I know that this will use a lot of battery. and I don't want to use google's speech recognition.
For example - you are talking with a person. Normal conversation. The phone is actively listening and recognizing every single said word and listening for the keyword.
Let's say the keyword is "cheese" in this instance.
Whenever you say "cheese," the application fires an intent that starts up another part of the app.
I tried to record myself saying "cheese" into wav file
And then comparing it to every word that was spoken..
my problem is finding the right tool that can help me perform this signal comparison in the simplest way so it can work on any device..
tried musicg library with fingerprint function but it does not work so well..
tried some other fft/cross-corelation/ect.. functions but I didn't get the result I expected..
any help (examples or some library would be best) that you can give me will be very appreciated ..
Thanks.

It looks like Java has its own speech recognition API (basic info about it here: http://en.wikipedia.org/wiki/Java_Speech_API).
I also found the official FAQ with instructions on how to download and use it etc. http://www.oracle.com/technetwork/java/jsapifaq-135248.html
Hopefully, it's pretty high quality and works well. Good luck!

marwan.kallal said:
It looks like Java has its own speech recognition API (basic info about it here: http://en.wikipedia.org/wiki/Java_Speech_API).
I also found the official FAQ with instructions on how to download and use it etc. http://www.oracle.com/technetwork/java/jsapifaq-135248.html
Hopefully, it's pretty high quality and works well. Good luck!
Click to expand...
Click to collapse
Thank you!!! I'll try it

Wow fantastic question,how about refering the sourse codes of google voice?So u can know what they have done....:thumbup:
.........................................
visit www.fb.com/softcrush

Just FYI, speech recognition is some complex stuff. I believe that Google uses some really advanced techniques such as deep neural networks for their speech recognition.
Here are some links if anyone's interested:
Wired Article
Google Research (Theory Based)

Related

Emailing voice clip

New member here… I usually find answers to my questions through searches/lurking. Unfortunately, I haven’t found any viable solutions to my particular need and wish to present it for possible suggestions.
Frequently during my morning commute to work, I have ideas and/or reminders race through my head that I would love to capture with a high degree of confidence that I will remember to follow-up on them when I arrive at work. (i.e. “hey what if we do it this way”, “don’t forget that they xxx report is due today”)
I would love to be able to assign a function to a button on my Wizard that, when pressed, would record a short 30-second voice clip to a .wav file and automagically email it to my work email address. All without any on-screen interaction requirements that would distract my driving.
I have investigated a lot of commercial and freeware voice dictation tools and similar general recording tools. Additionally, I have tinkered with various scripting (Mortscript) and macro recording (Pen commander) capabilities. Thus far, I have not found a 100% answer to my need.
I have come close to a solution via Mortscript but still lack a specific capability to complete the solution. Basically, I need a way of capturing a sound clip basically from a command-line interface. (i.e “\Windows\RecordVoiceClip.exe –seconds 30 –file TodaysDateTime.wav”)
I would appreciate any suggestions either for a application that would serve my purpose or a possibly a web source where I might find a solution.
…..BriansPhone

Voice Activation Software

I quite like the Cyberon voice dialer utility, and use it as much for the applications as for voice dialing (saves me navigating around the Wizard when I can't be bothered), but I kinda feel like it could do a little more and wondered if there was either an update (can't see one on the Cyberon webpage) or maybe some alternative software you guys could recommned?
I was thinking about one that uses voice commands to securely lock the phone for instance, or maybe one that can do macros, control functions within Media Player (pause, play, next, etc.), and other stuff. I realise this might be a bit hopeful, but some advice/recommendations on good voice activation software woud be nice, nonetheless.
Thanks in advance.
Mannish Boy said:
I quite like the Cyberon voice dialer utility, and use it as much for the applications as for voice dialing (saves me navigating around the Wizard when I can't be bothered), but I kinda feel like it could do a little more and wondered if there was either an update (can't see one on the Cyberon webpage) or maybe some alternative software you guys could recommned?
I was thinking about one that uses voice commands to securely lock the phone for instance, or maybe one that can do macros, control functions within Media Player (pause, play, next, etc.), and other stuff. I realise this might be a bit hopeful, but some advice/recommendations on good voice activation software woud be nice, nonetheless.
Thanks in advance.
Click to expand...
Click to collapse
There're several additioanl voice controller apps; see my earlier reviews linked from here.
This link does not work since the whole site has been upgraded
I have search the new FTP and could not find it...
Can you send it though PM?
This way I can try it and maybe help you to solve the issue you are facing
Tx
Eric
microsoft voice command will launch any app in the startmenu, you can rename the app to something easily recognised. you can add shortcuts to the startmenu or programs folder to point to any app with arguments, within the shortcut.
I don't know of any way to control a media player by voice other than simply launching the player. I need to use the headset for voice control if listening to audio, the recognition doesn't work if music is playing via the speaker.

Speech Controlled Application Development on PPC

Hello All,
I hope I can ask you this question in this section.
I am not a hardcore developer but I have made some custom applications for PPC. NOW, I have a project that I need to develop an application that can be controlled with voice.
The project is designed for visually impaired users and I am trying to do indoor guidance for them. I have all parts of my project worked out, but I need to have a development tool that I can embed in my PPC application to do speech recognition of ** my own custom command ** and maybe do speech synthesis.
I have worked with Speech SDK 5.1 from Microsoft on XP and Vista, but as I understand there is no SDK for Microsoft Voice Command 1.6.
Do you guys have any suggestions or know of any SR engine that I can embed and program into my application?
PS - I am using AT&T Tilt as my test platform at this time and the goal is to have the application made for all Windows 5.0/6.0/6.1.
Regards,
HoSsEiN
hgn842001 said:
Hello All,
I hope I can ask you this question in this section.
I am not a hardcore developer but I have made some custom applications for PPC. NOW, I have a project that I need to develop an application that can be controlled with voice.
The project is designed for visually impaired users and I am trying to do indoor guidance for them. I have all parts of my project worked out, but I need to have a development tool that I can embed in my PPC application to do speech recognition of ** my own custom command ** and maybe do speech synthesis.
I have worked with Speech SDK 5.1 from Microsoft on XP and Vista, but as I understand there is no SDK for Microsoft Voice Command 1.6.
Do you guys have any suggestions or know of any SR engine that I can embed and program into my application?
PS - I am using AT&T Tilt as my test platform at this time and the goal is to have the application made for all Windows 5.0/6.0/6.1.
Regards,
HoSsEiN
Click to expand...
Click to collapse
I may be a little out of touch with what your asking for but Microsoft Voice Command has both audio input and reply. You may want to check it out.
You are right ...
Curious,
Thats right. I do use Voice Command on my PDA, but I don't think that I can have it to respond to ** Custom ** commands!
I actually contacted Microsoft and asked them about this and they said there is no way to customize the Voice command and it is a closed source code and I cant hack into it.
In my application I want to have the user say: "I need to go to room 200" (or just "room 200") and my program should be able to understand it and using my indoor positioning system rout the blind person to that room.
So I am hoping to find an engine ... SDK ... or toolkit that I can embed in my application.
Do you know if you can hack into the Microsoft Voice Command to do this?
Thanks for your reply ...
HoSsEiN
If you can find a software package to handle the voice recognition, you may still have problems with background noise interfering. My previous phone had a voice-activated dialing application that didn't need to be trained for most names. However, I found that, if I was on a road with a lot of traffic, I generally had to wait for a pause in the traffic before the phone could recognize what I was saying. In the case of this indoor-guidance application, the main issue would probably be other voices in the background, such as in a busy hallway.
Ambient noise is an issue, you are right.
I may be able to use a hands free headset that does a bit of noise cancellation to alleviate this problem, but first I need to find the engine to get it working.
Just as "pie-in-the-sky" armchair development, but maybe you could do it client-server. I think this is how Microsoft Live Search works-- I'm guessing it records a sound, then sends that to the a speech recognition server. The server then sends back the best guess at what was said.
No idea what the back-end server is though. Maybe Microsoft Speech Server
Dromio said:
Just as "pie-in-the-sky" armchair development, but maybe you could do it client-server. I think this is how Microsoft Live Search works-- I'm guessing it records a sound, then sends that to the a speech recognition server. The server then sends back the best guess at what was said.
No idea what the back-end server is though. Maybe Microsoft Speech Server
Click to expand...
Click to collapse
Dromio,
Thanks for your comment.
I did a bit of a search for this one before. It sounds promising but I am not sure how easy it is to implement. I had an impression that the Speech Server was mainly build for telephony connection, but as you pointed out Microsoft Live search does that in multimodal way with data only. It is of course subject to be always connected to a network and with a slow connection would probably not work as it should.
Also, I've not been able to find a good working example of server-client SR so far. Maybe if someone has done it and are willing to share it with us, it would shed more light on this approach.
Regards,
HoSsEiN

Voice Dialing via Bluetooth button except Voice Commander

I've been fighting with the X1 for over 3 roms and searching the net and xda for some hours without any solution to this.
Simple as that, I want to launch a voice dialing software, (or any other application) excluding Voice Commander, from my bluetooth button, which will take input (voice) from the bluetooth microphone and not from the main unit.
Am I asking too much? This thing can happen quite easily to phones that are decades old. Voice commander is pointless and utter crap for non supported languages. I've found various recording apps for that purpose but I haven't managed to launch them using the bluetooth button. If I am to use the main unit, voice dialing misses its purpose.
So any hints or advice where to head to? I've seen many registry keys and bluetooth audio gateway dlls but none could actually execute a custom application instead of the damn Voice Commander.
Thanks.
For me, Microsoft Voice Command 1.6 works perfectly, but I guess you're saying that it's not available in Greek?
Galbadia Hotel
Click to expand...
Click to collapse
I do not think you should have that site listed in your signature, if you don't mind my saying so.
What happened when Google visited this site?
Of the 1156 pages that we tested on the site over the past 90 days, 10 page(s) resulted in malicious software being downloaded and installed without user consent. The last time that Google visited this site was on 2009-01-21, and the last time that suspicious content was found on this site was on 2009-01-21.
Malicious software includes 7 trojan(s), 5 scripting exploit(s). Successful infection resulted in an average of 4 new processes on the target machine.
Malicious software is hosted on 5 domain(s), including d0fus.com/, yinjoy.net/, ffshrie.com/.
1 domain(s) appear to be functioning as intermediaries for distributing malware to visitors of this site, including 59.60.31.0/.
This site was hosted on 3 network(s) including AS21844 (THEPLANET), AS19166 (ALPHARED), AS32748 (STEADFAST).
Click to expand...
Click to collapse
George Knighton said:
For me, Microsoft Voice Command 1.6 works perfectly, but I guess you're saying that it's not available in Greek?
Click to expand...
Click to collapse
Not only that but some people prefer prerecorded voice recognition than Voice Command. In any case it should be possible to launch any application you want with the bluetooth button voice dialing or not, the question is how.
I do not think you should have that site listed in your signature, if you don't mind my saying so.
Click to expand...
Click to collapse
The site once was a paradise of music but it's been down for some time. Removed the link until it is back up again.
So people anyone? I wouldn't have imagined that XPERIA can't record callers voice nor a configurable way of voice dialing with bluetooth. These things are for granted in other platforms...

Capture audio from other app?

Hi, I am new to android dev, but know java and other languages fairly well from before.
I have a simple question: is it possible to capture the audio from not only the mic or voice calls as defined in MediaRecorder.AudioSource, but instead from another app?
reb8 said:
Hi, I am new to android dev, but know java and other languages fairly well from before.
I have a simple question: is it possible to capture the audio from not only the mic or voice calls as defined in MediaRecorder.AudioSource, but instead from another app?
Click to expand...
Click to collapse
Like what kind of sounds. You could unpack the app and look for the audio files.
Suppose I want to build an application that broadcasts my telephone conversations on the net in real time. The infrastructure of such an app seems like a fairly straight-forward thing: use MediaRecorder to record the voice, throw it onto the net (ok, this is the opposite of simple, but you get the idea).
Now, suppose I'd like to broadcast the sound from an mp3 application instead. As I'd like to do it in real time, looking for it on disk is not the solution.

Categories

Resources