"I Think, I Conceptualize, I Analyze, I Design, and I Create." ~ Puneet Kalra

Cognitive Robotics Research Centre Of University Of Wales, Newport Puneet Kalra

Home Studies Research Projects Tutorials Portfolio

Puneet Kalra - www.puneetk.com - Socializing Robots

Sphinx 4.0 Video Tutorial ( High Quality )

October 30th, 2009 by Puneet Kalra Leave a reply »

Hello,

Here’s the same video tutorial but with higher video quality. For more info about the tutorial, please check the original post here.

Download Sphinx 4.0 Video Tutorial

File Details :

Name : Sphinx-Tutorial.avi
Size : 334 MBs
Duration : 26:14 Minutes
Dimensions : 1280 x 800

89 Responses

  1. Shredder says:

    I understand that the components provided for decoding will be:
    1. The decoder source code
    2. The language dictionary
    3. The filler dictionary
    4. The language model
    5. The test data
    Now, for decoding sphinx4 forms a trellis, which is noting but a prduct of language HMM and time. This trellis is noting but a acyclic graph(or a search graph as one might call it). What I am interested is in reducing the size of this structure(search graph), so as to increase the recognition capability of the system. What I am doing now is, I am passing the decoder updated language model(read grammar file). I would like to verify whether passing a smaller grammar file(pruned grammar file which suffices the need of the concerned sample to be decoded) will help reduce the size of the search graph.
    Please make suggestion or respond if I am not making myself not clear.
    I am looking forward towards this discussion.

  2. mrt says:

    hi, this is great.

    tnx

  3. Abey says:

    hey
    how to integrate sphinx4 in j2EE application .

  4. ntmartin says:

    Hey, I went through your tutorial, and am getting an error…

    Exception in thread “main” Property exception component:’dictionary’ property:’fillerPath’ – Can’t locate resource:/WSJ_8gau_13dCep_16k_40mel_130Hz_6800Hz/dict/fillerdict
    edu.cmu.sphinx.util.props.InternalConfigurationException: Can’t locate resource:/WSJ_8gau_13dCep_16k_40mel_130Hz_6800Hz/dict/fillerdict

    Any ideas what I might have done wrong? Pretty sure I did everything you said to in the tutorial…

  5. Puneet Kalra says:

    Hey Guys,

    @Abey, Sphinx will work as a module on Server side and JApplet as client side on your website.

    @mtmartin, I guess you have downloaded unstable version of Acoustic model, download stable one and try with that one.

    Hope that works !

  6. Sara says:

    Thank you so much for this tutorial! It was great! Very easy to follow and I got the HelloWorld working fine. I’m doing a learning game for Swedish children, so I’m going to check out your post about expanding the dictionary to get the 100 or so Swedish words I need my program to recognize and I hope it will work. If I understand correctly, this is possible although the words are not English? I have searched a little but not found any aucustic model for Swedish.

  7. Puneet Kalra says:

    Hello Sara,

    Yes, it is possible. But there will be some difference in pronunciation and you might have to play little with phonemes in dict file.

    Best of luck!

    ~Puneet Kalra

  8. elnagar says:

    @the heeloword.java
    symbol : constructor ConfigurationManager(java.lang.String)
    location: class edu.cmu.sphinx.util.props.ConfigurationManager

    …cm = new ConfigurationManager(args[0]);//edu.cmu.sphinx.sample.MyComponent
    1 error

  9. Shumaila says:

    Thank You Puneet Bhai for the tutorial
    it was great.
    i am Working on Speech Enabled Search engine.
    Can you help me because i just starting my work. this is my final year project i’ll be thankful to you.

  10. Puneet Kalra says:

    Hello Shumaila,

    Thanks for commenting, and yea, please feel free to post here if you get stuck anywhere.

    Or you can also post on my Facebook page, Me or any other person will respond you.

    Regards,
    Puneet Kalra

  11. Lauren says:

    Hi there, Please could you help me iv been trying to download your tutorial about 3 – 4 times now and every time i download it it gets stuck on 121MB i need to despriately make a speech recognition but i am so new to sphinx and eclipse…

  12. Lauren says:

    Hi again no worries I have sovled my problem :)

  13. Lauren says:

    Hi Puneet.. please could you tell me what iv done wrong i have followed your tutorial step by step and when i want to run the HelloWorld Demo i get the following error “main” java.lang.NoSuchFieldError: engineListeners

    when iopen the HellowWorld and check the main(String[]) it says the Jar file has no source added to it what does it mean. please help

  14. Setiadhi says:

    wow, great video tutorial. that’s very helpful for my project.

  15. Lauren says:

    Hi Puneet,

    Is it possible to edit HellonGram demo’s dictionary and only use your own words? Please could you get back to me thank you. I’ve looked at extending the Dictionary but i actually need to create my own and am not sure how to go about that, Thank you so much and a AWESOME WEBSITE WELL DONE :)

    • Puneet Kalra says:

      Hey Lauren,

      Use same Expanding Dictionary tutorial, but there will be one change, Replace your own words DIC file with original DIC file in acoustic model instead of copying your own words DIC file’s content into original DIC file.

      Or if your words going to be in thousands, you will need to work on Sphinxtrain.

      Regards,

      • Lauren says:

        Hi Puneet,

        Thank you soooo much for replying :)
        by the way i got the dictionary working its about 300 words. but i am having a huge problem. i want it to not try force match words that are not in the dictionary to the words in my dictionary if that makes sense. let me make an example “That is a blue car” ==> words in dictionary “That is a … car”
        ==> words NOT in dictionary “blue”

        so basically i want it to display you said: “that is a car” replaces words that are not in the dictionary with “unknown” .. have you created a video demo on this ? iv been battling for a month with this “out-of-grammar” i just dont know how to actually set it up there is no proper guidance. and im just getting it wrong over and over :(

        Please could you give me advice i really would appreciate it so much :) if you can email me please do.

        Thank you for all your hard work, your website has helped me out so much :)

  16. shailesh says:

    Please let me How I can incorporate Sphinx into C#.net Application.

    • Puneet Kalra says:

      Hey Shailesh,

      Use “Pocketsphinx” which is written in C, and load it in your c# program exactly same as you load DLL or EXE files.

      ~Puneet

      • lauren says:

        Hi, how do you go about using pocketsphinx i loaded it in Visual Studio 2010 C++ built sphinxbase and pocketsphinx have the dll’s in the bin folder but how do i actually use it in a C# application would i have to create a consol application or a winform?? please help i really need advise.

        thank so much Lauren

  17. Abey says:

    thanks puneet
    i have integrate sphinx with web using applet. I have develop a sphinx application which runs in client machine and recognized text will post on the web page using applet.
    I have seen that google translate page (pls use chrome :http://translate.google.com/?sl=en# ) and it is too good. Can u suggest how I can make web page like this using sphinx?

  18. Puneet Kalra says:

    Hey Abey,

    There are many ways of doing that. First step will be same for all and that will be recognising speech through Sphinx, which is integrated by applet.

    Second step will be sending recognised string plus translation details to server through networking, Or you can use Google Translate API but unfortunately that will be taken down by Dec 2011, so you might need to find another one.

    Third step will be getting output from Applet or from Server using Ajax depending on what you choose on second step.

    That’s the basic idea how it will work but there are many important aspects to consider before you make the final go.

    Best of luck, ~Puneet

  19. nikul says:

    Hi!

    Sphinx4 related topic
    When i running regression test its gives following error.

    [java] java.lang.NoClassDefFoundError: edu/cmu/sphinx/util/props/Configurable

    can you tell me which jar file contain above class definition ??

    Thanks

  20. Lauren says:

    Hi Puneetk,

    Do you know how to make your hellongram demo to only pick up the words in your dictionary and not try force match words that are not in the dictionary to words that are in the dictionary, Hope that makes sense. so it looks at the Out of Grammar fact. so for example i would like it to display “that is a car”

    what was said: “that is a ugly car”..

    Please could you help me thank you sooo much :D

    • lauren says:

      hi again i got the sphinx4 working lovely unfortunately i have to have the speech recognition in my C# project. so i really need some help please please please.. iv been battling for 3 months now trying to just get even a demo running in C++ please can anyone help me :’( please

  21. Rahulserver says:

    Mr.Punit, great work on this forum and thank you very much for your video tut. I think you are the best guy on earth to solve my problem.
    I copied the /demo/sphinx/HelloWorld source folder and pasted in netbeans’ project src folder.The folder has all the needed gram,xml and java files.But as I tried to run it following error comes:
    Problem configuring HelloWorld: Property Exception component:’jsgfGrammar’ property:’grammarLocation’ – Can’t locate resource:/helloworld.HelloWorld
    Property Exception component:’jsgfGrammar’ property:’grammarLocation’ – Can’t locate resource:/helloworld.HelloWorld
    at edu.cmu.sphinx.util.props.ValidatingPropertySheet.getResource(ValidatingPropertySheet.java:372)
    at edu.cmu.sphinx.jsapi.JSGFGrammar.newProperties(JSGFGrammar.java:272)
    at edu.cmu.sphinx.util.props.ConfigurationManager.lookup(ConfigurationManager.java:214)
    at edu.cmu.sphinx.util.props.ValidatingPropertySheet.getComponent(ValidatingPropertySheet.java:403)
    at edu.cmu.sphinx.linguist.flat.FlatLinguist.newProperties(FlatLinguist.java:248)
    at edu.cmu.sphinx.util.props.ConfigurationManager.lookup(ConfigurationManager.java:214)
    at edu.cmu.sphinx.util.props.ValidatingPropertySheet.getComponent(ValidatingPropertySheet.java:403)
    at edu.cmu.sphinx.decoder.search.SimpleBreadthFirstSearchManager.newProperties(SimpleBreadthFirstSearchManager.java:180)
    at edu.cmu.sphinx.util.props.ConfigurationManager.lookup(ConfigurationManager.java:214)
    at edu.cmu.sphinx.util.props.ValidatingPropertySheet.getComponent(ValidatingPropertySheet.java:403)
    at edu.cmu.sphinx.decoder.Decoder.newProperties(Decoder.java:71)
    at edu.cmu.sphinx.util.props.ConfigurationManager.lookup(ConfigurationManager.java:214)
    at edu.cmu.sphinx.util.props.ValidatingPropertySheet.getComponent(ValidatingPropertySheet.java:403)
    at edu.cmu.sphinx.recognizer.Recognizer.newProperties(Recognizer.java:93)
    at edu.cmu.sphinx.util.props.ConfigurationManager.lookup(ConfigurationManager.java:214)
    at helloworld.HelloWorld.main(HelloWorld.java:49)

    I also changed the hellworld.config.xml file’s grammarLocation property to hold the proper package name. But problem persists.
    Any help????????????????

  22. sanchez says:

    Hello friends!!!
    I am new one with sphinx (speech recognizer). And I want to understand all about sphinx… Can you help me???
    HelloWorld.java class is not running????

  23. akshay says:

    hi puneet. it was a great tutorial. my helloworld program works fine. but now i want to make a complete application to recognize common English words. i dont know how to make changes to config file. there are a lot of properties. plz help me. what should i do.

  24. madhulatha says:

    hii,
    i’m trying to execute the sample program of sphinx but i get the following error
    Exception in thread “main” java.lang.NoSuchFieldError: engineListeners
    at com.sun.speech.engine.recognition.BaseRecognizer.fireRecognizerSuspended(BaseRecognizer.java:922)
    at com.sun.speech.engine.recognition.BaseRecognizer.dispatchSpeechEvent(BaseRecognizer.java:1262)
    at com.sun.speech.engine.SpeechEventUtilities.postSpeechEvent(SpeechEventUtilities.java:201)
    at com.sun.speech.engine.SpeechEventUtilities.postSpeechEvent(SpeechEventUtilities.java:132)
    at com.sun.speech.engine.recognition.BaseRecognizer.postRecognizerSuspen
    ded(BaseRecognizer.java:912)
    at com.sun.speech.engine.recognition.BaseRecognizer.commitChanges(BaseRecognizer.java:358)
    at edu.cmu.sphinx.jsapi.JSGFGrammar.commitChanges(JSGFGrammar.java:536)
    at edu.cmu.sphinx.jsapi.JSGFGrammar.createGrammar(JSGFGrammar.java:243)
    at edu.cmu.sphinx.linguist.language.grammar.Grammar.allocate(Grammar.java:101)
    at edu.cmu.sphinx.linguist.flat.FlatLinguist.allocate(FlatLinguist.java:229)
    at edu.cmu.sphinx.decoder.search.SimpleBreadthFirstSearchManager.allocat
    e(SimpleBreadthFirstSearchManager.java:603)
    at edu.cmu.sphinx.decoder.AbstractDecoder.allocate(AbstractDecoder.java:
    67)
    at edu.cmu.sphinx.recognizer.Recognizer.allocate(Recognizer.java:157)
    at edu.cmu.sphinx.demo.helloworld.HelloWorld.main(HelloWorld.java:37)

  25. Matteo says:

    Hi, this tut is awesome, but the hd video to download is only first part :( are you try to upload HD on megavideo? :)

  26. Puneet Kalra says:

    Hey everyone,

    Here’s the link of video on Megavideo : http://www.megavideo.com/?v=C9WDSWCK

    Regards,
    PK

  27. littleboy says:

    hai puneet..

    i downloaded the tutorial..it contains only 1st part…then tried http://www.megavideo.com/?v=C9WDSWCK…its not working..it says some privacy warning..little help plzzzzzz..:-(

  28. Ryan Kareem says:

    Hey Puneet, Thanks for the tutorial. I could run the hello world demo pretty smoothly.

    In your tutorial, you took a file name “edu.cmu.sphinx.demo.helloworld” form a project named “VoiceCal” which contained the source file of the demos. Could you upload the download link of the source files of the demo or the source files please ..I googled them but couldn’t found any link so far.

    One more request, would you make a video tutorial on adding words inside the dictionary and making essential changes in .gram file? It would be very helpful.

    Thanks a lot,

    Ryan

  29. punit says:

    hey i m not able to download hd videos also not able to watch them,
    link is not working:
    http://www.megavideo.com/?v=C9WDSWCK
    it just showing me a piracy notice.

Leave a Reply

Please note : I will help you only if you will show some efforts! Don't expect replies to simple queries, You can easily find answers for them through searching.

I'm a student and I work as well. So i might reply late here. If you have something important to discuss Or stuck somewhere in your project. Please post on my Facebook page for a quick response.