Automatic language recognition via spectral and token based approaches
January 1, 2007
Book Chapter
Author:
Published in:
Chapter 41 in Springer Handbook of Speech Processing and Communication, 2007, pp. 811-24.
R&D Area:
Summary
Automatic language recognition from speech consists of algorithms and techniques that model and classify the language being spoken. Current state-of-the-art language recognition systems fall into two broad categories: spectral- and token-sequence-based approaches. In this chapter, we describe algorithms for extracting features and models representing these types of language cues and systems for making recognition decisions using one or more of these language cues. A performance assessment of these systems is also provided, in terms of both accuracy and computation considerations, using the National Institute of Science and Technology (NIST) language recognition evaluation benchmarks.