Related
Hello All,
I hope I can ask you this question in this section.
I am not a hardcore developer but I have made some custom applications for PPC. NOW, I have a project that I need to develop an application that can be controlled with voice.
The project is designed for visually impaired users and I am trying to do indoor guidance for them. I have all parts of my project worked out, but I need to have a development tool that I can embed in my PPC application to do speech recognition of ** my own custom command ** and maybe do speech synthesis.
I have worked with Speech SDK 5.1 from Microsoft on XP and Vista, but as I understand there is no SDK for Microsoft Voice Command 1.6.
Do you guys have any suggestions or know of any SR engine that I can embed and program into my application?
PS - I am using AT&T Tilt as my test platform at this time and the goal is to have the application made for all Windows 5.0/6.0/6.1.
Regards,
HoSsEiN
hgn842001 said:
Hello All,
I hope I can ask you this question in this section.
I am not a hardcore developer but I have made some custom applications for PPC. NOW, I have a project that I need to develop an application that can be controlled with voice.
The project is designed for visually impaired users and I am trying to do indoor guidance for them. I have all parts of my project worked out, but I need to have a development tool that I can embed in my PPC application to do speech recognition of ** my own custom command ** and maybe do speech synthesis.
I have worked with Speech SDK 5.1 from Microsoft on XP and Vista, but as I understand there is no SDK for Microsoft Voice Command 1.6.
Do you guys have any suggestions or know of any SR engine that I can embed and program into my application?
PS - I am using AT&T Tilt as my test platform at this time and the goal is to have the application made for all Windows 5.0/6.0/6.1.
Regards,
HoSsEiN
Click to expand...
Click to collapse
I may be a little out of touch with what your asking for but Microsoft Voice Command has both audio input and reply. You may want to check it out.
You are right ...
Curious,
Thats right. I do use Voice Command on my PDA, but I don't think that I can have it to respond to ** Custom ** commands!
I actually contacted Microsoft and asked them about this and they said there is no way to customize the Voice command and it is a closed source code and I cant hack into it.
In my application I want to have the user say: "I need to go to room 200" (or just "room 200") and my program should be able to understand it and using my indoor positioning system rout the blind person to that room.
So I am hoping to find an engine ... SDK ... or toolkit that I can embed in my application.
Do you know if you can hack into the Microsoft Voice Command to do this?
Thanks for your reply ...
HoSsEiN
If you can find a software package to handle the voice recognition, you may still have problems with background noise interfering. My previous phone had a voice-activated dialing application that didn't need to be trained for most names. However, I found that, if I was on a road with a lot of traffic, I generally had to wait for a pause in the traffic before the phone could recognize what I was saying. In the case of this indoor-guidance application, the main issue would probably be other voices in the background, such as in a busy hallway.
Ambient noise is an issue, you are right.
I may be able to use a hands free headset that does a bit of noise cancellation to alleviate this problem, but first I need to find the engine to get it working.
Just as "pie-in-the-sky" armchair development, but maybe you could do it client-server. I think this is how Microsoft Live Search works-- I'm guessing it records a sound, then sends that to the a speech recognition server. The server then sends back the best guess at what was said.
No idea what the back-end server is though. Maybe Microsoft Speech Server
Dromio said:
Just as "pie-in-the-sky" armchair development, but maybe you could do it client-server. I think this is how Microsoft Live Search works-- I'm guessing it records a sound, then sends that to the a speech recognition server. The server then sends back the best guess at what was said.
No idea what the back-end server is though. Maybe Microsoft Speech Server
Click to expand...
Click to collapse
Dromio,
Thanks for your comment.
I did a bit of a search for this one before. It sounds promising but I am not sure how easy it is to implement. I had an impression that the Speech Server was mainly build for telephony connection, but as you pointed out Microsoft Live search does that in multimodal way with data only. It is of course subject to be always connected to a network and with a slow connection would probably not work as it should.
Also, I've not been able to find a good working example of server-client SR so far. Maybe if someone has done it and are willing to share it with us, it would shed more light on this approach.
Regards,
HoSsEiN
Hay,
i'm trying to develop a program, in VB.NET. i call it wDiary stands for Workout Diary.
The purpose is to let the user choose 4 (or more) sports to save progess/daynotes of, and to view the txtfiles, like a dayly diary in a calendar.
i have comed to that part where i'm want to save a file in a spesific directory.
like save:
"Jogging - " + date".txt in %MyDocuments%\wDiary\Jogging\
print textbox1.text in line one and so on
like the this code
Code:
FileOpen(1, My.Application.Info.DirectoryPath & "\Jogging - "+date".txt", OpenMode.Output)
PrintLine(1, TextBox2.Text)
PrintLine(2, TextBox3.Text)
FileClose(1)
But thats a code i used for an Windows app. I want the same function for my Touch pro (WinCE 6.1) :/
Fraction
Reserved..
Just a quick one, and I understand English might not be your first language but a Dairy is where they bottle milk, and make other dairy produce like cheese and cream.
I think your app should have been called wDiary - as in a diary of events
I'm not taking the mick here, just offering a bit of help As for the file editing, what language are you doing it in? C++.net, C#.net, VB.net?
If it's vb.net you're using - this site has some decent basic file routines (checking if it exists, adding text, appending, closing etc)
Hope this helps
Doing a quick search, it points out to be VB.net indeed. Try stating which language you use next time.
The link in the above post should give you sufficient info.
FrederickUK said:
Just a quick one, and I understand English might not be your first language but a Dairy is where they bottle milk, and make other dairy produce like cheese and cream.
I think your app should have been called wDiary - as in a diary of events
I'm not taking the mick here, just offering a bit of help As for the file editing, what language are you doing it in? C++.net, C#.net, VB.net?
If it's vb.net you're using - this site has some decent basic file routines (checking if it exists, adding text, appending, closing etc)
Hope this helps
Click to expand...
Click to collapse
****, now i'm embarrassed. no, english is not my primar language. hehe.
Visual basic.net
EVILRipper said:
Doing a quick search, it points out to be VB.net indeed. Try stating which language you use next time.
The link in the above post should give you sufficient info.
Click to expand...
Click to collapse
i have edit the topic, and the first post. thanx, i forgot.
Fraction
You should be able to use the standard System.IO.StreamReader and System.IO.StreamWriter Functions. I can code this in c# if you can translate to VB.net.
Change of direction
Actually let me make another suggestion. Sometimes when writing a small app to collect data I use a very lazy yet very usefull feature of .net DataSets.
I databind the controls to the dataset by dropping the dataset on the form and assiging the bindings via autogenerated bindinsource component. All I do now to read and save are the following lines of code
Read:
Code:
dataSet1.ReadXml("mydb.xml");
Write
Code:
dataSet1.WriteXml("mydb.xl", XmlWriteMode.WriteSchema);
It might not be pretty but it does the trick. Also less code and no need for parsing XML
yes please
cornelha said:
You should be able to use the standard System.IO.StreamReader and System.IO.StreamWriter Functions. I can code this in c# if you can translate to VB.net.
Click to expand...
Click to collapse
Yes, please
I have one more thing, if sombody can help. the Design seems perfectly in Visual basic. put when i deploy it to the emulator it looks smal and rubbish. the buttons (png.files) gets streched in width and pressed in the height. so, how can i design in 480x640 (VGA) for my Touch pro :/
I've set my emulator settings to USA WM 6.1 Classic and it works fine when deployed to my touch pro. If your screen resolution is fairly high (I use 1280x1024) you could use USA WM 6.1 VGA but it filled my entire screen. You might have better luck if you use 1600x1280 or whatever (or run multi-monitor)
Hope this helps
FrederickUK said:
I've set my emulator settings to USA WM 6.1 Classic and it works fine when deployed to my touch pro. If your screen resolution is fairly high (I use 1280x1024) you could use USA WM 6.1 VGA but it filled my entire screen. You might have better luck if you use 1600x1280 or whatever (or run multi-monitor)
Hope this helps
Click to expand...
Click to collapse
i Also use 1280x1024, and i (usa?)VM 6.1 profesional.
and yes it it fills my hole screen ;P
I've been fighting with the X1 for over 3 roms and searching the net and xda for some hours without any solution to this.
Simple as that, I want to launch a voice dialing software, (or any other application) excluding Voice Commander, from my bluetooth button, which will take input (voice) from the bluetooth microphone and not from the main unit.
Am I asking too much? This thing can happen quite easily to phones that are decades old. Voice commander is pointless and utter crap for non supported languages. I've found various recording apps for that purpose but I haven't managed to launch them using the bluetooth button. If I am to use the main unit, voice dialing misses its purpose.
So any hints or advice where to head to? I've seen many registry keys and bluetooth audio gateway dlls but none could actually execute a custom application instead of the damn Voice Commander.
Thanks.
For me, Microsoft Voice Command 1.6 works perfectly, but I guess you're saying that it's not available in Greek?
Galbadia Hotel
Click to expand...
Click to collapse
I do not think you should have that site listed in your signature, if you don't mind my saying so.
What happened when Google visited this site?
Of the 1156 pages that we tested on the site over the past 90 days, 10 page(s) resulted in malicious software being downloaded and installed without user consent. The last time that Google visited this site was on 2009-01-21, and the last time that suspicious content was found on this site was on 2009-01-21.
Malicious software includes 7 trojan(s), 5 scripting exploit(s). Successful infection resulted in an average of 4 new processes on the target machine.
Malicious software is hosted on 5 domain(s), including d0fus.com/, yinjoy.net/, ffshrie.com/.
1 domain(s) appear to be functioning as intermediaries for distributing malware to visitors of this site, including 59.60.31.0/.
This site was hosted on 3 network(s) including AS21844 (THEPLANET), AS19166 (ALPHARED), AS32748 (STEADFAST).
Click to expand...
Click to collapse
George Knighton said:
For me, Microsoft Voice Command 1.6 works perfectly, but I guess you're saying that it's not available in Greek?
Click to expand...
Click to collapse
Not only that but some people prefer prerecorded voice recognition than Voice Command. In any case it should be possible to launch any application you want with the bluetooth button voice dialing or not, the question is how.
I do not think you should have that site listed in your signature, if you don't mind my saying so.
Click to expand...
Click to collapse
The site once was a paradise of music but it's been down for some time. Removed the link until it is back up again.
So people anyone? I wouldn't have imagined that XPERIA can't record callers voice nor a configurable way of voice dialing with bluetooth. These things are for granted in other platforms...
hi,
is there any chance to send a request to google navigation and start a route to a destination given by latitude and longitude?
I'm pretty sure this is possible, but there is no official API - you have to look into a code of Google Maps.
Brut.all said:
I'm pretty sure this is possible, but there is no official API - you have to look into a code of Google Maps.
Click to expand...
Click to collapse
that's exactly what i tought, because google maps can start a navigation with google navigation.
is google maps open source? if yes, where can i download it?
bassmaster said:
is google maps open source?
Click to expand...
Click to collapse
No, but you could baksmali it. If you aren't too familiar with it, then I could help you when I will have some free time
Brut.all said:
No, but you could baksmali it. If you aren't too familiar with it, then I could help you when I will have some free time
Click to expand...
Click to collapse
no, i never heard about it. it would be very nice, if you could help me.
thanks in advanced.
Ok, there was no need to look into smali:
Code:
I/ActivityManager( 118): Starting activity: Intent { act=android.intent.action.VIEW dat=google.navigation:///?q=Some%20place cmp=brut.googlemaps/com.google.android.maps.driveabout.app.NavigationActivity }
Ok, there was no need to look into smali:
Code:
I/ActivityManager( 118): Starting activity: Intent { act=android.intent.action.VIEW dat=google.navigation:///?q=Some%20place cmp=brut.googlemaps/com.google.android.maps.driveabout.app.NavigationActivity }
Click to expand...
Click to collapse
Is it implied that we have to use the brut maps package to accomplish this, or can I do something similar with the default Google maps package?
okdok said:
Is it implied that we have to use the brut maps package to accomplish this, or can I do something similar with the default Google maps package?
Click to expand...
Click to collapse
Replace brut.googlemaps by com.google.android.apps.maps and it should work just fine
Hey Brut,
Sorry, I'm new to developing apps on Android. Could you help me get from that output to the code that I need to put in my app? I tried fixing the code in Eclipse to get it working, but I haven't worked enough with custom intents and activities (and especially the Maps API) to get from point B to point A. My ultimate plan is to be able to have a list of buttons that will provide walking directions from your current location to a predetermined latitude/longitude coordinate. Any help is much appreciated. Thank you in advance to anyone that could point me in the right direction.
I can launch the driving navigation by:
Code:
startActivity(new Intent(Intent.ACTION_VIEW, Uri.parse("google.navigation:q=" + mLatitude + "," + mLongitude)));
However, I want to launch the walking navigation (new since google maps 4.5).
Does anybody know how to launch this?
Regards,
Alex
I found the answer myself after using the app CatLog (great app for viewing logging at your phone).
You can launch walking navigation by:
Code:
startActivity(new Intent(Intent.ACTION_VIEW, Uri.parse("google.navigation:ll=" + mLatitude + "," + mLongitude + "&mode=w")));
Regards,
Alex
Quick question regarding Google navigation brut.
Once u r navigating , and for example u take a wrong turn ,i.e, right instead of light , ..
The navigation voice won't tell u that u have taken a wrong turn or the term , rerouting !
any suggestions .. or possibility of implementation ?
Sent from LeeDroid powered by HTC Sense using XDA App!
can somebody explain to me how to look into the source of google navigation with baksmali?
hyeclass said:
can somebody explain to me how to look into the source of google navigation with baksmali?
Click to expand...
Click to collapse
And what do you need to now? Everything is in usage help and/or on project site.
i'm sorry I didn't take the time to read the website befor posting won't happen again
sorry for double posting but....
i've been trying some stuff now but without any success...
has anybody been able to get any sort of a callback from the navigation intent?
hyeclass said:
has anybody been able to get any sort of a callback from the navigation intent?
Click to expand...
Click to collapse
What do you mean?
team up for a project with google navigation
hello all
my name is Luigi, I am living in the Netherlands now.
(Wageningen)
Background in Communication and International Development.
I am looking for people with expertise in programming for google maps and G navigation. He will be responsible to develop an application, I will be responsible to create a community + funding; I am preparing a business plan and looking for partners.
the query sent to google maps/navigation must be stored in a server.
The server give back to the mobile coordinates of nearby locations
Alternatively, please could you instruct me on materials / tutorials to guide me to realize (or at least draftly explain better) such application?
I am new in programming, just previous experience in Actionscript...
Please let me know if you are interested + your skills
thanks!
Luigi
alexvanderlinden said:
I found the answer myself after using the app CatLog (great app for viewing logging at your phone).
You can launch walking navigation by:
Code:
startActivity(new Intent(Intent.ACTION_VIEW, Uri.parse("google.navigation:ll=" + mLatitude + "," + mLongitude + "&mode=w")));
Regards,
Alex
Click to expand...
Click to collapse
Awesome tip, thanks Alex.
Brut, when you start the navigation intent it returns immediately. If you want to monitor the status of the navigation you have to do it yourself.
Does anyone get a force close when they laugh it from a method? Also what types should the variables be? My long and lat are in type double.
Hello,
Constantly-on speech recognition listening for just one keyword.
I am trying to do is making my app constantly listen for one keyword that will fire an intent whenever the keyword is recognized.
I know that this will use a lot of battery. and I don't want to use google's speech recognition.
For example - you are talking with a person. Normal conversation. The phone is actively listening and recognizing every single said word and listening for the keyword.
Let's say the keyword is "cheese" in this instance.
Whenever you say "cheese," the application fires an intent that starts up another part of the app.
I tried to record myself saying "cheese" into wav file
And then comparing it to every word that was spoken..
my problem is finding the right tool that can help me perform this signal comparison in the simplest way so it can work on any device..
tried musicg library with fingerprint function but it does not work so well..
tried some other fft/cross-corelation/ect.. functions but I didn't get the result I expected..
any help (examples or some library would be best) that you can give me will be very appreciated ..
Thanks.
It looks like Java has its own speech recognition API (basic info about it here: http://en.wikipedia.org/wiki/Java_Speech_API).
I also found the official FAQ with instructions on how to download and use it etc. http://www.oracle.com/technetwork/java/jsapifaq-135248.html
Hopefully, it's pretty high quality and works well. Good luck!
marwan.kallal said:
It looks like Java has its own speech recognition API (basic info about it here: http://en.wikipedia.org/wiki/Java_Speech_API).
I also found the official FAQ with instructions on how to download and use it etc. http://www.oracle.com/technetwork/java/jsapifaq-135248.html
Hopefully, it's pretty high quality and works well. Good luck!
Click to expand...
Click to collapse
Thank you!!! I'll try it
Wow fantastic question,how about refering the sourse codes of google voice?So u can know what they have done....:thumbup:
.........................................
visit www.fb.com/softcrush
Just FYI, speech recognition is some complex stuff. I believe that Google uses some really advanced techniques such as deep neural networks for their speech recognition.
Here are some links if anyone's interested:
Wired Article
Google Research (Theory Based)