"I Think, I Conceptualize, I Analyze, I Design, and I Create." ~ Puneet Kalra

Cognitive Robotics Research Centre Of University Of Wales, Newport Puneet Kalra

Home Studies Research Projects Tutorials Portfolio

Puneet Kalra - www.puneetk.com - Socializing Robots

Speech Recognition System

September 11th, 2009 by Puneet Kalra Leave a reply »

Hello everyone,

After spending hundred of hours doing RnD on JSAPI, I have successfully created my first demo application that work on voice commands. Its just a simple application that accepts the voice commands and print numbers in a textbox by reading voice tokens. After working on it for so long, the things are pretty clear for me now.

Well, Im using Sphinx 4.0. Its a Speech Recognizer (no synthesizer) written in java. Sphinx is an implementation of Java Speech API (JSAPI) 1.0 and you must have JSAPI installed on your machine. Sphinx has quite different and easy implementation than others like FreeTTS etc. But the major fallback is, Its just recognizer. As wise people said, Something is better than nothing. Its much more than something, And according to me the best for beginners (Ofcourse for experts too).

About Sphinx :

Sphinx-4 is a state-of-the-art speech recognition system written entirely in the Java programming language. It was created via a joint collaboration between the Sphinx group at Carnegie Mellon University, Sun Microsystems Laboratories, Mitsubishi Electric Research Labs (MERL), and Hewlett Packard (HP), with contributions from the University of California at Santa Cruz (UCSC) and the Massachusetts Institute of Technology (MIT).

Sphinx-4 is a very flexible system capable of performing many different types of recognition tasks. As such, it is difficult to characterize the performance and accuracy of Sphinx-4 with just a few simple numbers such as speed and accuracy. Instead, we regularly run regression tests on Sphinx-4 to determine how it performs under a variety of tasks.

Link to Sphinx 4.0 website

Now, i’m working on a calculator application that works on Voice commands. I’ll be posting step by step tutorial to get sphinx work on windows and a simple example code of it.

Regards,

f5ne9dbstj

Advertisement

9 Responses

  1. The appz is good as a starter : ^_^ .. looking forward for your next appz (that voice calculator) :)

  2. arunima says:

    great job!

  3. Polprav says:

    Hello from Russia!
    Can I quote a post in your blog with the link to you?

  4. Puneet Kalra says:

    Hello Polprav,

    Yea sure, Why not. Just let me know once you are done.

    Regards,

  5. Brian says:

    Have you posted a tutorial on configuring Sphinx and creating the speech-driven calculator?

    Is there any way I can contact you so I can obtain source code to follow as an example in learning Sphinx and coding a speech-driven calculator?

    • George Adams says:

      I tried to use Sphinx for command recognition for simple application, but the accuracy was low. Can u make recognition system of isolated words with small vocabulary about 20 words?
      I tried to TIDIGIT demo, but my english accent is bad or the recognition is bad.

  6. yuvam jain says:

    can you tell me that ,how can i add these sphinx 4 in my java media player . I am facing difficulty to implement it.

  7. Puneet Kalra says:

    Hey George,

    Replied you on my Facebook page. See you there.

    Regards,

Leave a Reply

Please note : I will help you only if you will show some efforts! Don't expect replies to simple queries, You can easily find answers for them through searching.

I'm a student and I work as well. So i might reply late here. If you have something important to discuss Or stuck somewhere in your project. Please post on my Facebook page for a quick response.