submit urlsubmit rss feedadd directory

article

Speech recognition (in many contexts, also known as automatic speech recognition, computer speech recognition or voice recognition) is the process of converting a speech signal to a set of words, by means of an algorithm implemented as a computer program. Speech recognition applications that have emerged over the last years include voice dialing (e.g., Call home), call routing (e.g., I would like to make a collect call), simple data entry (e.g., entering a credit card number), and preparation of structured documents (e.g., a radiology report).

Defining the Problem


According to "Survey of the State of the Art in Human Language Technology (1997) by Ron Cole et all" Speech recognition is the process of converting an acoustic signal, captured by a microphone or a telephone, to a set of words. The recognized words can be the final results, for such applications as commands & control, data entry, and document preparation. They can also serve as the input to further linguistic processing in order to achieve text formating or speech understanding.

Speech recognition systems can be characterized by many parameters as in the table below.

Parameters Range
Speaking Mode Isolated words to continuous speech
Speaking Style Read speech to spontaneous speech
Enrollment Speaker-dependent to Speaker-independent
Vocabulary Small (< 20 words) to large (> 20,000 words)
Language Model Finite-state to context-sensitive
Perplexity Small (< 10) to large (> 100)
SNR High (> 30 dB) to low (< 10 dB)
Transducer Voice-cancelling microphone to telephone
An isolated-word speech recognition system requires that the speaker pause briefly between words, whereas a continuous speech recognition system does not. Spontaneous, or extemporaneously generated, speech contains disfluencies and is much more dificult to recognize than speech read from script. Some systems require speaker enrollment (a user must provide samples of his or her speech before using them) whereas other systems are said to be speaker-independent, in that no enrollment is necessary. Some of the other parameters depend on the specific task. Recognition is generally more difficult when vocabularies are large or have many similar-sounding words. When speech is produced in a sequence of words, language models or artificial grammars are used to restrict the combination of words. The simplest language model can be specified as a finite-state network, where the permissible words following each word are explicitly given. More general language models approximating natural language are specified in terms of a context-sensitive grammar. One popular measure of the difficulty of the task, combining the vocabulary size and the language model, is perplexity, loosely defined as the geometric mean of the number of words that can follow a word after the language model has been applied. In addition, there are some external parameters that can affect speech recognition system performance, including the characteristics of the environmental noise and the type and the placement of the microphone.

More on [ Speech recognition ]


directory of related categories

 

 

 

 
Speech_Technology RSS feed
Speech Technology - Twitter Search

すごーい!RT @DLtoday New animatronic technology allows guests to see "Mr. Lincoln" enunciate every word of his speech: http://bit.ly/66mhAX
itsadisneyworld (MiMi ミミ) Wed, 16 Dec 2009 09:28:18 -0000
すごーい!RT @DLtoday New animatronic technology allows guests to see "Mr. Lincoln" enunciate every word of his speech: http://bit.ly/66mhAX
TBI-based speech-disorder study to begin with radical new technology. http://bit.ly/7Z5G8k
Club_Z (Sharon Lopez) Wed, 16 Dec 2009 02:27:24 -0000
TBI-based speech-disorder study to begin with radical new technology. http://bit.ly/7Z5G8k
Technology is going to make it a sad day for free speech. Nothing is private anymore....
rockercolu (Lance Rocker) Wed, 16 Dec 2009 00:49:08 -0000
Technology is going to make it a sad day for free speech. Nothing is private anymore....
Nuance brings its speech-recognition technology to the iPhone - http://shar.es/a8KJl
erkaufman (Elliot Kaufman) Tue, 15 Dec 2009 15:18:24 -0000
Nuance brings its speech-recognition technology to the iPhone - http://shar.es/a8KJl
RT @eMusing: Rt @wavety Google's latest uber disruptive technology text to speech... Awesome!!!! http://bit.ly/81SGyT #fb
danielcfng (Daniel Ng) Tue, 15 Dec 2009 15:00:50 -0000
RT @eMusing: Rt @wavety Google's latest uber disruptive technology text to speech... Awesome!!!! http://bit.ly/81SGyT #fb
Wagan EL2564 Bluetooth Wireless Handsfree Car Kit with Text to Speech Technology - Wagan EL2564 Bluetooth... http://tumblr.com/xrf4pik1i
bikess (Amazing Store) Tue, 15 Dec 2009 14:49:17 -0000
Wagan EL2564 Bluetooth Wireless Handsfree Car Kit with Text to Speech Technology - Wagan EL2564 Bluetooth... http://tumblr.com/xrf4pik1i

 
Subscribe to Speech_Technology RSS feed

directory of related sites

Answers For Executives - Site addressing typical questions executives might have about speech recognition for offices and companies. Contains product reviews, installation guidelines, provocative analysis of the industry for people who would rather use voice technology instead of typing.
Meta Description: [ VoiceWizard is a professional provider of speech recognition and language product information and services, including speech recognition software, language translation software, text to speech, developer tools, speech enabled web sites, music and sound compression, product evaluations and reviewe... ]

Applied Technologies on Language and Speech: IberVox ASR - Barcelona based firm offering speech recognition, TTS and VoiceXML products for Spanish, Catalan, Basque, Portuguese, and the varieties of Spanish spoken in Latin America.

Best Mobile Phone - Online Mobile Phone shop Offers a best mobile phone with best mobile phone tariff with latest feature on payg mobile phones. You can compare mobile phone and also mobile phone tariff with any other mobile phone offer

Conversay - Computational Computing Corp. sells application specific speech enabled products including voice responsive browsers, office messaging system, speech SDK's for mobile devices, telephony speech servers.

500 enCue Communications - Deploys Microsoft speech server technology focused on three software solutions and product areas: the Internet, mobile reach and call centre/cnterprise interfaces. Based South Africa.

Game Commander - Speaker independent (no training required) voice control software for Windows games replaces keystrokes with voice commands for popular games. Template files, patches, message boards, downloads, free trial version.
Meta Description: [ Speech recognition (voice command and control) for games. Use voice commands to send keystrokes to Windows games. ]

Grover Industries, Inc. - Provides command and control applications for internet and desktop contexts.
Meta Description: [ voice recognition software ]

Guardian Business Solutions, Inc. - Warehouse applications using voice, wireless, and wearable technologies to provide automation and productivity solutions for warehouse management, distribution, and manufacturing systems. Partnered with Syvox.
Meta Description: [ Providing voice and wireless technology to manufacturing, distribution, and warehouse management applications. Quality solutions through speech technology. ]

HAL Hits the Home - Voice recognition software (with product names HAL2000) for Windows 95/98 that supports air conditioning, telephony, infrared, Internet, X-10 and security - for use in home systems. Site gives audio examples of the interactions possible.

Hand Held - Site sells a large-vocabulary continuous speech recognizer that runs on a PDA. Current offering (free beta download) is a voice enabled address book for Win 95, Win 98, Win NT, Win CE, Pocket PC.

IBM Software - Speech Recognition - Big Blue's ViaVoice offerings in the desktop continuous speech dictation arena. Competes with Dragon Systems. Has mobile dictation and telephony products as well. Has continuous speech recognition for the Apple Macintosh.
Meta Description: [ ViaVoice technology, now available to consumers on the Windows, Macintosh and handheld computer platforms, can afford a 'multi-modal' environment, freeing users from dependence on the mouse, keyboard and stylus for many applications. ]

IMSI Software - IMSI Utilities Group licenses IBM ViaVoice technology to produce their own line of VoiceDirect dictation software.

Media Management - 20/20 Speech develops and supplies proprietary speech recognition and text to speech software products and solutions for portable devices and media management applications such as subtitle generation or synchronization of video to legal dispositions.
Meta Description: [ Aurix - experts in speech - speech recognition, audio mining, speech detection and alignment software - the first choice technology partner for integrators and contractors who are building systems that are either controlled or transacted by the human voice. ]

Mobilethink - Danish startup specializing in developing mobile phone speech solutions that are integrated with Internet information systems.
Meta Description: [ Mobilethink provides advanced device management solutions to mobile operators, service providers and handset manufacturers around the world. ]

Natspeak Information Pages - An unusual compendium of insider knowledge about Dragon Systems NaturallySpeaking speech recognition products. Downloadable utilities, tips for improved usage as well as a detailed, programmer oriented explanation of techniques for adding extended macro capabilities using Python code and custom grammar files. Generally oriented towards versions 4.0 and lower but still insightful for later versions. Hosted by Synapse Adaptive, provider of a wide range of assistive technologies.
Meta Description: [ NaturallySpeaking, naturallyspeaking, naturallyspeaking deluxe, dragon naturallyspeaking, dragon naturallyspeaking med, dragonlaw, dragontech, dragonmed, dragonextra!, speech, speech-to-text, voice-to-text, voice recognition, dragondictate is the best speach products that can be purchased., ... ]

Natural Language Recognition - Simplis, Inc, provides a Java based natural language speech recognition interface designed to simplify access to existing programs and web applications.

Open Source Speech Recognition System - Carnegie Mellon Sphinx project. Real-time continuous speech recognition system. Downloadable source for Linux/Unix and Windows NT or later.

PGPfone - Pretty Good Privacy internet phone allows encrypted talking over a network. MAC and PC versions available.

Philips Speech Processing - Worldwide provider of speech recognition solutions for telephony, voice portals, automotive and consumer embedded systems, medical and legal dictation with multi-lingual capabilities. SDK's available for inclusion of speech recognition in business systems.

Scansoft - Dragon Naturally Speaking - Acquired Lernout Hauspie, Dragon Systems speech recognition and synthesis resources and products. Also known for digital imaging products.

Speak Freely for Windows - A free Internet phone program for talking to someone PC-to-PC over a network, i.e., a voice chat program. No banner ads. Features encryption hooks, answering machine, text chat, cross platform versions for Unix/Linux. Optional facility for a buddy addressing server to list who else is on-line similar to the commercial instant messaging programs. Integrates with ICQ. Good voice quality. Optional C++ source code (free) for those interested in learning about with Internet speech protocols.

Speech and Handwriting Recognition for Wireless - Advanced Recognition Technologies, Inc. - designs, develops and distributes speech and handwriting recognition software products and technologies focusing on embedded software for cellular devices, mobile communicators, and PDAs.

Speech FX - An ongoing speech recognition project for the Apple Macintosh, currently concentrating on enhancing command and control.
Meta Description: [ Accettura.com is home to award winning software like Keep Me Online, and SpeechFX; it features MacPR, a press release post, and Macintosh eCards, to send virtual greetings to your friends. ]

Speech Recognition for In-car Use - Germany based Böhme Datentechnik deals in hands-free systems having echo and noise cancellation features.
Meta Description: [ Hands-free system and voice recognition system with echo and noise cancellation. One example is in-car use. ]

Speech Recognition News and Studies - TMA Associates publishes Speech Recognition Update, an industry newsletter on the business, products, markets, and companies in speech recognition, text-to-speech, and speaker verification. The site contains headlines and recent news, as well as descriptions of TMA conferences and market studies in speech recognition.
Meta Description: [ Speech recognition (voice recognition, ASR), text-to-speech, and speaker verification news, market analysis, Telephony VUI conference, and consulting. ]

Speech Technology Center - Russian organization providing unusual variety of speech processing products and services for research and development, speech recognition, voice verification, speaker identification, noise reduction in speech signals, noise cancellation, forensic examination, audio analysis, logging and communication channel protection.
Meta Description: [ Speech Technology Center : noise cancellation, noise reduction, DSP board, embedded solutions, voice identification, audio restoration, anti - terrorist ]

Speech Technology Magazine - Online edition of the magazine, plus information on an annual 'SpeechTEK' speech technology business exposition.

SpeechUp! Blog - A weblog for speech enthusiasts and professionals alike. Created to serve as an information exchange for the community of both speech recognition developers and business users.

Talking Desktop - Speech recognition, text-to-speech software transforms a Windows computer into a conversational desktop companion. Provides dictation, web navigation, voice email, web cams, on-line news, weather maps, stock ticker, X10 home automation, MP3 music player, 3D avatar, disabilities features. Project in progress.
Meta Description: [ Talking Desktop sells talking computer software that does dictation and operates your computer by voice command. This interactive program has many speech recognition, voice control and artificial intelligence features. This software will listen and respond verbally using natural text-to-speech. ]

Verbatim Careers Institute - Offers training for new careers made possible by advances in speech recognition technology.
Meta Description: [ Verbatim Careers Institute, offering training for new careers utilizing the latest advancements in voice recognition technology. Broadcast Captioning, Communication Access Realtime Translation (CART), and Court Reporting. ]

Voxware - Provides voice-based technology that enables warehouse workers to achieve higher levels of productivity and accuracy while reducing operational costs.
Meta Description: [ . ]

Warehouse Management Voice Technology Solutions - BCP's Accord warehouse management system uses speech technology to provide voice directed picking, goods receiving, pallet movements, and stock checking. Fully integrated voice software enables real-time interaction with the system.
Meta Description: [ BCP is a leading UK software house, specialising in Supply Chain Management solutions for the Retail and Wholesale Distribution industries. BCP's Accord software offers full Head Office, Depot, Branch and Store functions for Buying, Stock Control, Warehouse Management, Logistics, Order Management... ]

Wenr Corporation - A holding company of technology companies. Includes history and current portfolio.

Speech_Technology related videos

Inventor: Lines are blurring between humans and machines

Inventor and futurist Ray Kurzweil illustrates the exponential evolution of technology, predicting a sharp rise in computing capability, robotics and life expectancy within the next 15 years. He outlines the shocking ways we'll use technology to augment our own capabilities, forever blurring the lines between human and machine.

Speech_Technology related videos

 

HOMEADVERTISINGABOUT US

articlesartsbusinesscomputersgameshealthhospitalshomekids & teensnewsmobilephysiciansrecreationreferenceregionalscienceshoppingsocietysportsworld


Submit a Site About Become an Editor