Natural Voice Engines (Text-To-Speech) - XPERIA X1 General

Is there is any way to replace the native MS voice engine (michelle.dat) utilized in Windows Voice Command 1.6 with a natural one; such as; AT&T Natural Voices, NeoSpeech Voices, or Cepstral Voices. Would it require that I find the C++ compiling script to rewrite the natural voice with the native one? I have a Text-To-Speech program that is designed for Win Mobile devices and it utilizes one of the NeoSpeech voices. The voice file (.dat) is about 15mb. Any ideas of the least time consuming route to simply replace the integrated WCM 1.6 with that natural voice? I'm not sure if the WCM 1.6 was compiled to specifically look for the michelle.dat voice or simply the folder location and/or file extension name.

Sorry if this doesn't address your question, but what text-to-speech software are you using?
From another blog, I found a registry setting for Voice Command to slow down the speech playback to make it sound a bit better but nothing on actually replacing the voice itself.

fxcoupeman83 said:
Sorry if this doesn't address your question, but what text-to-speech software are you using?
From another blog, I found a registry setting for Voice Command to slow down the speech playback to make it sound a bit better but nothing on actually replacing the voice itself.
Click to expand...
Click to collapse
It's Text-To-Speech Pro for Pocket PC by digitalfuture.

I just say Cepstral.. and I am still crying because I have a great t2s engine and vc and i cant connect them..

Related

Voce Comand Dialer by Cyberon

Does anybody know how to extract the excelent Cyberon Voice dialer that comes in JARJAr ROM ?I´ve emailed cyberon.com.tw and they told me that is exclusively distibuted in JASJAR ROM.
It works fabulous, and i will give a try to put it im my QTEK S100.
Thanks
Fernando
Just got my K-jam. All the guys are wiping the cyberon Voice off to replace it with the voice command. So I dont know how to give you cyberon, but what I can tell you is that Voice command is much better, thus everyone with cyberon is installing voice command.
Yes, but Microsoft Voice Comand 1.5 don´t work well with Portuguese Language. Cyberon voice Commander works very, very well also in the car or in noise ambience. I´ve already tested the 2 Softwares and Cyberon´s and despite we can make voice tags for contacts and programs, after that it works fabulous...
Thanks anyway.
I tried every voice dial program and here is what I feel about them:
Fonix 2: quite accurate. But very few functions and very slow.
Fonix 3: add a lot more functions. But even slower and hard to make it work correctly.
MS Voice Command: Fast respone and accurate even with noise. Lots of function and work very well w/ the PPC (since MS knows all the detail of the OS). But it doesn't work for me since my contact list is quite big. It doesn't allow me to filter the canadiate for voice dial making it hard to return me w/ the right contact. In addition, I wish it could provide optional voice tag function.
Cyberon VC 1.2 (anyone got the lastest English version?) : Very accurate but doesn't work very well w/ noise. But it still my choice for voice dial since it allows me to filter contacts and application for voice command.
jackleung said:
I tried every voice dial program and here is what I feel about them:
Fonix 2: quite accurate. But very few functions and very slow.
Fonix 3: add a lot more functions. But even slower and hard to make it work correctly.
MS Voice Command: Fast respone and accurate even with noise. Lots of function and work very well w/ the PPC (since MS knows all the detail of the OS). But it doesn't work for me since my contact list is quite big. It doesn't allow me to filter the canadiate for voice dial making it hard to return me w/ the right contact. In addition, I wish it could provide optional voice tag function.
Cyberon VC 1.2 (anyone got the lastest English version?) : Very accurate but doesn't work very well w/ noise. But it still my choice for voice dial since it allows me to filter contacts and application for voice command.
Click to expand...
Click to collapse
Read this and install Cyberon Voice Dialer. No doubt de best ever....
http://forum.xda-developers.com/viewtopic.php?p=172877#172877
gazela00 said:
Read this and install Cyberon Voice Dialer. No doubt de best ever....
http://forum.xda-developers.com/viewtopic.php?p=172877#172877
Click to expand...
Click to collapse
For me, this is trim down version of Cyberon VC (VC without self recongnition). Unless I really tight at RAM space, I prefer VC since I don't have to build voice tag for every contact.
re
I've used MSVC and now I'm using the Cyberon Voice Dialer (from JJ)
Its true that MSVC occupies more RAM but it doesn't requires voice tag and its very accurate too.
CVD is also accurate, small footprint and requires voice tags which will takes up a lot of RAM space if you've a lot of them.
IMO, voice tags should be created for those frequent contacts/programs etc so thst you'll keep the menory footprint small. If you have say 2000 contacts would you want each and everyone to be voice tagged and do you think you can remember all of them?
So, Cyberon Voice Dialer is still the better one if you manage your voice tags properly.
re
Sorry, it read read '...frequently used contacts/programs....'
Where do i download Cyberon Voice Dialer ?? Please help me, i need a program where i can make vocietags.
RanZor said:
Where do i download Cyberon Voice Dialer ?? Please help me, i need a program where i can make vocietags.
Click to expand...
Click to collapse
Please read the link above.

Voice Activation Software

I quite like the Cyberon voice dialer utility, and use it as much for the applications as for voice dialing (saves me navigating around the Wizard when I can't be bothered), but I kinda feel like it could do a little more and wondered if there was either an update (can't see one on the Cyberon webpage) or maybe some alternative software you guys could recommned?
I was thinking about one that uses voice commands to securely lock the phone for instance, or maybe one that can do macros, control functions within Media Player (pause, play, next, etc.), and other stuff. I realise this might be a bit hopeful, but some advice/recommendations on good voice activation software woud be nice, nonetheless.
Thanks in advance.
Mannish Boy said:
I quite like the Cyberon voice dialer utility, and use it as much for the applications as for voice dialing (saves me navigating around the Wizard when I can't be bothered), but I kinda feel like it could do a little more and wondered if there was either an update (can't see one on the Cyberon webpage) or maybe some alternative software you guys could recommned?
I was thinking about one that uses voice commands to securely lock the phone for instance, or maybe one that can do macros, control functions within Media Player (pause, play, next, etc.), and other stuff. I realise this might be a bit hopeful, but some advice/recommendations on good voice activation software woud be nice, nonetheless.
Thanks in advance.
Click to expand...
Click to collapse
There're several additioanl voice controller apps; see my earlier reviews linked from here.
This link does not work since the whole site has been upgraded
I have search the new FTP and could not find it...
Can you send it though PM?
This way I can try it and maybe help you to solve the issue you are facing
Tx
Eric
microsoft voice command will launch any app in the startmenu, you can rename the app to something easily recognised. you can add shortcuts to the startmenu or programs folder to point to any app with arguments, within the shortcut.
I don't know of any way to control a media player by voice other than simply launching the player. I need to use the headset for voice control if listening to audio, the recognition doesn't work if music is playing via the speaker.

Speech Controlled Application Development on PPC

Hello All,
I hope I can ask you this question in this section.
I am not a hardcore developer but I have made some custom applications for PPC. NOW, I have a project that I need to develop an application that can be controlled with voice.
The project is designed for visually impaired users and I am trying to do indoor guidance for them. I have all parts of my project worked out, but I need to have a development tool that I can embed in my PPC application to do speech recognition of ** my own custom command ** and maybe do speech synthesis.
I have worked with Speech SDK 5.1 from Microsoft on XP and Vista, but as I understand there is no SDK for Microsoft Voice Command 1.6.
Do you guys have any suggestions or know of any SR engine that I can embed and program into my application?
PS - I am using AT&T Tilt as my test platform at this time and the goal is to have the application made for all Windows 5.0/6.0/6.1.
Regards,
HoSsEiN
hgn842001 said:
Hello All,
I hope I can ask you this question in this section.
I am not a hardcore developer but I have made some custom applications for PPC. NOW, I have a project that I need to develop an application that can be controlled with voice.
The project is designed for visually impaired users and I am trying to do indoor guidance for them. I have all parts of my project worked out, but I need to have a development tool that I can embed in my PPC application to do speech recognition of ** my own custom command ** and maybe do speech synthesis.
I have worked with Speech SDK 5.1 from Microsoft on XP and Vista, but as I understand there is no SDK for Microsoft Voice Command 1.6.
Do you guys have any suggestions or know of any SR engine that I can embed and program into my application?
PS - I am using AT&T Tilt as my test platform at this time and the goal is to have the application made for all Windows 5.0/6.0/6.1.
Regards,
HoSsEiN
Click to expand...
Click to collapse
I may be a little out of touch with what your asking for but Microsoft Voice Command has both audio input and reply. You may want to check it out.
You are right ...
Curious,
Thats right. I do use Voice Command on my PDA, but I don't think that I can have it to respond to ** Custom ** commands!
I actually contacted Microsoft and asked them about this and they said there is no way to customize the Voice command and it is a closed source code and I cant hack into it.
In my application I want to have the user say: "I need to go to room 200" (or just "room 200") and my program should be able to understand it and using my indoor positioning system rout the blind person to that room.
So I am hoping to find an engine ... SDK ... or toolkit that I can embed in my application.
Do you know if you can hack into the Microsoft Voice Command to do this?
Thanks for your reply ...
HoSsEiN
If you can find a software package to handle the voice recognition, you may still have problems with background noise interfering. My previous phone had a voice-activated dialing application that didn't need to be trained for most names. However, I found that, if I was on a road with a lot of traffic, I generally had to wait for a pause in the traffic before the phone could recognize what I was saying. In the case of this indoor-guidance application, the main issue would probably be other voices in the background, such as in a busy hallway.
Ambient noise is an issue, you are right.
I may be able to use a hands free headset that does a bit of noise cancellation to alleviate this problem, but first I need to find the engine to get it working.
Just as "pie-in-the-sky" armchair development, but maybe you could do it client-server. I think this is how Microsoft Live Search works-- I'm guessing it records a sound, then sends that to the a speech recognition server. The server then sends back the best guess at what was said.
No idea what the back-end server is though. Maybe Microsoft Speech Server
Dromio said:
Just as "pie-in-the-sky" armchair development, but maybe you could do it client-server. I think this is how Microsoft Live Search works-- I'm guessing it records a sound, then sends that to the a speech recognition server. The server then sends back the best guess at what was said.
No idea what the back-end server is though. Maybe Microsoft Speech Server
Click to expand...
Click to collapse
Dromio,
Thanks for your comment.
I did a bit of a search for this one before. It sounds promising but I am not sure how easy it is to implement. I had an impression that the Speech Server was mainly build for telephony connection, but as you pointed out Microsoft Live search does that in multimodal way with data only. It is of course subject to be always connected to a network and with a slow connection would probably not work as it should.
Also, I've not been able to find a good working example of server-client SR so far. Maybe if someone has done it and are willing to share it with us, it would shed more light on this approach.
Regards,
HoSsEiN

Voice Dialling in 6.5 - not found

I cannot find it for the life of me and searching brings no results either. I use voice dialling in conjunction with a blue tooth hands free in the car so it's pretty important to me.
Assuming that it is simply missing from the cooked ROM does anyone have the relevant .cab?
TIA
/bump
57 views, no suggestions..?
Search software/apps & themes forum for MSVC, i.e. MicroSoft Voice Command
Close, but no cigar, I'm not interested in running programmes by voice, I just want to say a name to the Bluetooth in the car and it dials like it used to. However, MSVC eventually is mentioned in the same sentence as CVSD which is the missing element.
It's pain to find Cybertron Voice Speed Dial, which as far as I was concerned, was part of Mobile Windows in the first place!
You can download the relevant cab file here:
http://forum.xda-developers.com/showpost.php?p=3191184&postcount=3
BTW. Where the hell are all the buttons gone for posting..?

[Q] Audio decode, recognize a keyword

Hello,
Constantly-on speech recognition listening for just one keyword.
I am trying to do is making my app constantly listen for one keyword that will fire an intent whenever the keyword is recognized.
I know that this will use a lot of battery. and I don't want to use google's speech recognition.
For example - you are talking with a person. Normal conversation. The phone is actively listening and recognizing every single said word and listening for the keyword.
Let's say the keyword is "cheese" in this instance.
Whenever you say "cheese," the application fires an intent that starts up another part of the app.
I tried to record myself saying "cheese" into wav file
And then comparing it to every word that was spoken..
my problem is finding the right tool that can help me perform this signal comparison in the simplest way so it can work on any device..
tried musicg library with fingerprint function but it does not work so well..
tried some other fft/cross-corelation/ect.. functions but I didn't get the result I expected..
any help (examples or some library would be best) that you can give me will be very appreciated ..
Thanks.
It looks like Java has its own speech recognition API (basic info about it here: http://en.wikipedia.org/wiki/Java_Speech_API).
I also found the official FAQ with instructions on how to download and use it etc. http://www.oracle.com/technetwork/java/jsapifaq-135248.html
Hopefully, it's pretty high quality and works well. Good luck!
marwan.kallal said:
It looks like Java has its own speech recognition API (basic info about it here: http://en.wikipedia.org/wiki/Java_Speech_API).
I also found the official FAQ with instructions on how to download and use it etc. http://www.oracle.com/technetwork/java/jsapifaq-135248.html
Hopefully, it's pretty high quality and works well. Good luck!
Click to expand...
Click to collapse
Thank you!!! I'll try it
Wow fantastic question,how about refering the sourse codes of google voice?So u can know what they have done....:thumbup:
.........................................
visit www.fb.com/softcrush
Just FYI, speech recognition is some complex stuff. I believe that Google uses some really advanced techniques such as deep neural networks for their speech recognition.
Here are some links if anyone's interested:
Wired Article
Google Research (Theory Based)

Categories

Resources