After spending hundred of hours doing RnD on JSAPI, I have successfully created my first demo application that work on voice commands. Its just a simple application that accepts the voice commands and print numbers in a textbox by reading voice tokens. After working on it for so long, the things are pretty clear for me now.
Well, Im using Sphinx 4.0. Its a Speech Recognizer (no synthesizer) written in java. Sphinx is an implementation of Java Speech API (JSAPI) 1.0 and you must have JSAPI installed on your machine. Sphinx has quite different and easy implementation than others like FreeTTS etc. But the major fallback is, Its just recognizer. As wise people said, Something is better than nothing. Its much more than something, And according to me the best for beginners (Ofcourse for experts too).
About Sphinx :
Sphinx-4 is a state-of-the-art speech recognition system written entirely in the Java programming language. It was created via a joint collaboration between the Sphinx group at Carnegie Mellon University, Sun Microsystems Laboratories, Mitsubishi Electric Research Labs (MERL), and Hewlett Packard (HP), with contributions from the University of California at Santa Cruz (UCSC) and the Massachusetts Institute of Technology (MIT).
Sphinx-4 is a very flexible system capable of performing many different types of recognition tasks. As such, it is difficult to characterize the performance and accuracy of Sphinx-4 with just a few simple numbers such as speed and accuracy. Instead, we regularly run regression tests on Sphinx-4 to determine how it performs under a variety of tasks.
Now, i’m working on a calculator application that works on Voice commands. I’ll be posting step by step tutorial to get sphinx work on windows and a simple example code of it.