Automatic Speech Recognition: The Development of the SPHINX System

Author:   Kai-Fu Lee
Publisher:   Springer-Verlag New York Inc.
Edition:   Softcover reprint of the original 1st ed. 1989
Volume:   62
ISBN:  

9781461366249


Pages:   207
Publication Date:   03 March 2013
Format:   Paperback
Availability:   Manufactured on demand   Availability explained
We will order this item for you from a manufactured on demand supplier.

Our Price $448.77 Quantity:  
Add to Cart

Share |

Automatic Speech Recognition: The Development of the SPHINX System


Add your own review!

Overview

Speech Recognition has a long history of being one of the difficult problems in Artificial Intelligence and Computer Science. As one goes from problem solving tasks such as puzzles and chess to perceptual tasks such as speech and vision, the problem characteristics change dramatically: knowledge poor to knowledge rich; low data rates to high data rates; slow response time (minutes to hours) to instantaneous response time. These characteristics taken together increase the computational complexity of the problem by several orders of magnitude. Further, speech provides a challenging task domain which embodies many of the requirements of intelligent behavior: operate in real time; exploit vast amounts of knowledge, tolerate errorful, unexpected unknown input; use symbols and abstractions; communicate in natural language and learn from the environment. Voice input to computers offers a number of advantages. It provides a natural, fast, hands free, eyes free, location free input medium. However, there are many as yet unsolved problems that prevent routine use of speech as an input device by non-experts. These include cost, real time response, speaker independence, robustness to variations such as noise, microphone, speech rate and loudness, and the ability to handle non-grammatical speech. Satisfactory solutions to each of these problems can be expected within the next decade. Recognition of unrestricted spontaneous continuous speech appears unsolvable at present. However, by the addition of simple constraints, such as clarification dialog to resolve ambiguity, we believe it will be possible to develop systems capable of accepting very large vocabulary continuous speechdictation.

Full Product Details

Author:   Kai-Fu Lee
Publisher:   Springer-Verlag New York Inc.
Imprint:   Springer-Verlag New York Inc.
Edition:   Softcover reprint of the original 1st ed. 1989
Volume:   62
Dimensions:   Width: 15.50cm , Height: 1.20cm , Length: 23.50cm
Weight:   0.355kg
ISBN:  

9781461366249


ISBN 10:   1461366240
Pages:   207
Publication Date:   03 March 2013
Audience:   Professional and scholarly ,  Professional & Vocational
Format:   Paperback
Publisher's Status:   Active
Availability:   Manufactured on demand   Availability explained
We will order this item for you from a manufactured on demand supplier.

Table of Contents

1. Introduction.- 2. Hidden Markov Modeling of Speech.- 3. Task and Databases.- 4. The Baseline SPHINX System.- 5. Adding Knowledge.- 6. Finding a Good Unit of Speech.- 7. Learning and Adaptation.- 8. Summary of Results.- 9. Conclusion.- Appendix I. Evaluating Speech Recognizers.- I.1. Perplexity.- I.2. Computing Error Rate.- Appendix H. The Resource Management Task.- II.1. The Vocabulary and the SPHINX Pronunciation Dictionary.- II.2. The Grammar.- II.3. Training and Test Speakers.- Appendix III. Examples of SPHINX Recognition.- References.

Reviews

Author Information

Tab Content 6

Author Website:  

Customer Reviews

Recent Reviews

No review item found!

Add your own review!

Countries Available

All regions
Latest Reading Guide

Aorrng

Shopping Cart
Your cart is empty
Shopping cart
Mailing List