Hello everyone,
After spending hundred of hours doing RnD on JSAPI, I have successfully created my first demo application that work on voice commands. Its just a simple application that accepts the voice commands and print numbers in a textbox by reading voice tokens. After working on it for so long, the things are pretty clear for me now.
Well, Im using Sphinx 4.0. Its a Speech Recognizer (no synthesizer) written in java. Sphinx is an implementation of Java Speech API (JSAPI) 1.0 and you must have JSAPI installed on your machine. Sphinx has quite different and easy implementation than others like FreeTTS etc. But the major fallback is, Its just recognizer. As wise people said, Something is better than nothing. Its much more than something, And according to me the best for beginners (Ofcourse for experts too).
About Sphinx :
Sphinx-4 is a state-of-the-art speech recognition system written entirely in the Java programming language. It was created via a joint collaboration between the Sphinx group at Carnegie Mellon University, Sun Microsystems Laboratories, Mitsubishi Electric Research Labs (MERL), and Hewlett Packard (HP), with contributions from the University of California at Santa Cruz (UCSC) and the Massachusetts Institute of Technology (MIT).
Sphinx-4 is a very flexible system capable of performing many different types of recognition tasks. As such, it is difficult to characterize the performance and accuracy of Sphinx-4 with just a few simple numbers such as speed and accuracy. Instead, we regularly run regression tests on Sphinx-4 to determine how it performs under a variety of tasks.
Now, i’m working on a calculator application that works on Voice commands. I’ll be posting step by step tutorial to get sphinx work on windows and a simple example code of it.
Regards,
f5ne9dbstj
The appz is good as a starter : ^_^ .. looking forward for your next appz (that voice calculator) 🙂
http://www.youtube.com/watch?v=Xt9LC32414I
great job!
Hello from Russia!
Can I quote a post in your blog with the link to you?
Hello Polprav,
Yea sure, Why not. Just let me know once you are done.
Regards,
Have you posted a tutorial on configuring Sphinx and creating the speech-driven calculator?
Is there any way I can contact you so I can obtain source code to follow as an example in learning Sphinx and coding a speech-driven calculator?
I tried to use Sphinx for command recognition for simple application, but the accuracy was low. Can u make recognition system of isolated words with small vocabulary about 20 words?
I tried to TIDIGIT demo, but my english accent is bad or the recognition is bad.
can you tell me that ,how can i add these sphinx 4 in my java media player . I am facing difficulty to implement it.
Hey George,
Replied you on my Facebook page. See you there.
Regards,
Hi Puneet
i am developing a recognization and sysnthesis system
both of them are ready but when i am trying to integrate them i am getting some exceptions, so can i get a solution how to build a speech synthesis and recognization into a single program
please help me
hi i m building a application on sphinx 4.0 and jsapi for medicine name recognition , is it feasible with the accuracy