Hello everyone,

After spending hundred of hours doing RnD on JSAPI, I have successfully created my first demo application that work on voice commands. Its just a simple application that accepts the voice commands and print numbers in a textbox by reading voice tokens. After working on it for so long, the things are pretty clear for me now.

Well, Im using Sphinx 4.0. Its a Speech Recognizer (no synthesizer) written in java. Sphinx is an implementation of Java Speech API (JSAPI) 1.0 and you must have JSAPI installed on your machine. Sphinx has quite different and easy implementation than others like FreeTTS etc. But the major fallback is, Its just recognizer. As wise people said, Something is better than nothing. Its much more than something, And according to me the best for beginners (Ofcourse for experts too).

About Sphinx :

Sphinx-4 is a state-of-the-art speech recognition system written entirely in the Java programming language. It was created via a joint collaboration between the Sphinx group at Carnegie Mellon University, Sun Microsystems Laboratories, Mitsubishi Electric Research Labs (MERL), and Hewlett Packard (HP), with contributions from the University of California at Santa Cruz (UCSC) and the Massachusetts Institute of Technology (MIT).

Sphinx-4 is a very flexible system capable of performing many different types of recognition tasks. As such, it is difficult to characterize the performance and accuracy of Sphinx-4 with just a few simple numbers such as speed and accuracy. Instead, we regularly run regression tests on Sphinx-4 to determine how it performs under a variety of tasks.

Link to Sphinx 4.0 website

Now, i’m working on a calculator application that works on Voice commands. I’ll be posting step by step tutorial to get sphinx work on windows and a simple example code of it.



11 Responses

  1. Have you posted a tutorial on configuring Sphinx and creating the speech-driven calculator?

    Is there any way I can contact you so I can obtain source code to follow as an example in learning Sphinx and coding a speech-driven calculator?

    1. I tried to use Sphinx for command recognition for simple application, but the accuracy was low. Can u make recognition system of isolated words with small vocabulary about 20 words?

      I tried to TIDIGIT demo, but my english accent is bad or the recognition is bad.

  2. Hi Puneet

    i am developing a recognization and sysnthesis system

    both of them are ready but when i am trying to integrate them i am getting some exceptions, so can i get a solution how to build a speech synthesis and recognization into a single program

    please help me

  3. hi i m building a application on sphinx 4.0 and jsapi for medicine name recognition , is it feasible with the accuracy

Leave a Reply

Your email address will not be published.