State-of-the-art speech recognition system written entirely in the Java programming language. #Speech recognition #Display waveform #Spectrogram viewer #Speech #Recognition #Recognizer
Sphinx is a speaker-independent large vocabulary continuous speech recognizer released under a BSD style license.
It is also a collection of open source resources and tools that allows developers and researchers to build speech recognition systems.
Sphinx-4 is a state-of-the-art speech recognition system written entirely in the Java programming language.
It was created via a joint collaboration between the Sphinx group at Carnegie Mellon University, Sun Microsystems Laboratories, Mitsubishi Electric Research Labs (MERL), and Hewlett Packard (HP), with contributions from the University of California at Santa Cruz (UCSC) and the Massachusetts Institute of Technology (MIT).
Sphinx-4 started out as a port of Sphinx-3 to the Java programming language, but evolved into a recognizer designed to be much more flexible than Sphinx-3, thus becoming an excellent platform for speech research.
CMU Sphinx Group is releasing are a set of reasonably mature, world-class speech components that provide a basic level of technology to anyone interested in creating speech-using applications without the once-prohibitive initial investment cost in research and development; the same components are open to peer review by all researchers in the field, and are used for linguistic research as well.
Note however that Sphinx is not a final product. Those with a certain level of expertise can achieve great results with the versions of Sphinx available here, but a naive user will certainly need further help. In other words, the software available here is not meant for users with no experience in speech, but for expert users.
System requirements
- Java 2 SDK, Standard Edition 5.0 or later
- Ant 1.6.0 or later
- Subversion (svn), but only if you want to interact directly with the svn tree (which is recommended).
What's new in CMU Sphinx 4 1.0 Beta 5:
- Large arbitrary-order language models
- Simplified and reworked model loading code
- Raw configuration and and demos
- HTK model loader
CMU Sphinx 4 1.0 Beta 5
add to watchlist add to download basket send us an update REPORT- runs on:
- Mac OS X (PPC & Intel)
- file size:
- 55.6 MB
- main category:
- Audio
- developer:
- visit homepage
calibre
ShareX
Zoom Client
Bitdefender Antivirus Free
paint.net
7-Zip
IrfanView
4k Video Downloader
Windows Sandbox Launcher
Microsoft Teams
- 4k Video Downloader
- Windows Sandbox Launcher
- Microsoft Teams
- calibre
- ShareX
- Zoom Client
- Bitdefender Antivirus Free
- paint.net
- 7-Zip
- IrfanView