• Studies on inter-speaker variability in speech and its application in automatic speech recognition

    • Fulltext

       

        Click here to view fulltext PDF


      Permanent link:
      http://www.ias.ac.in/article/fulltext/sadh/036/05/0853-0883

    • Keywords

       

      Vowel-normalization; vocal-tract length normalization; speech-scale; frequency-warping; linear transformation of cepstra; speaker-adaptation.

    • Abstract

       

      In this paper, we give an overview of the problem of inter-speaker variability and its study in many diverse areas of speech signal processing. We first give an overview of vowel-normalization studies that minimize variations in the acoustic representation of vowel realizations by different speakers. We then describe the universal-warping approach to speaker normalization which unifies many of the vowel normalization approaches and also shows the relation between speech production, perception and auditory processing. We then address the problem of inter-speaker variability in automatic speech recognition (ASR) and describe techniques that are used to reduce these effects and thereby improve the performance of speaker-independent ASR systems.

    • Author Affiliations

       

      S Umesh1

      1. Department of Electrical Engineering, Indian Institute of Technology-Madras, Chennai 600 036, India

© 2017 Indian Academy of Sciences, Bengaluru.