Digital Speech Processing (Prof. S K. Das Mandal, IIT Kharagpur) | Electronics and Electrical Engineering

Digital Speech Processing

Digital Speech Processing. Instructor: Prof. S K. Das Mandal, Advanced Technology Development Centre, IIT Kharagpur. Oral Speech may be the most natural, common and direct mode of human communication. Since the middle of the last century, Speech has become an area of intense and active research and development (R&D) to become a prime means of direct Human-Computer Interactions (HCI). The pace of such R&D has further got boosted with the general abundance of cheap computing power in the form of PC, PDA or Mobile Handset. While man to machine in speech mode is yet to reach the minimum threshold level for wide-spread deployment, spoken messages directly by machine. This need research in speech science and development of speech technology. The course provides the foundation knowledge on speech production and perception along with processing of speech signals in the digital domain. (from nptel.ac.in)

Introduction

Lecture 01 - Introduction to Digital Speech Processing

Lecture 02 - Digitization and Recording

Lecture 03 - Review of DSP Concepts

Lecture 04 - Review of DSP Concepts (cont.)

Lecture 05 - Human Speech Production and Source Filter Model

Lecture 06 - Place and Manner at Articulation

Lecture 07 - Articulatory and Acoustic Phonetics

Lecture 08 - Handson on Acoustic Phonetics

Lecture 09 - Uniform Tube Modeling of Speech Processing, Part I

Lecture 10 - Uniform Tube Modeling of Speech Processing, Part II

Lecture 11 - Uniform Tube Modeling of Speech Processing, Part III

Lecture 12 - Uniform Tube Modeling of Speech Processing, Part IV

Lecture 13 - Uniform Tube Modeling of Speech Processing, Part V

Lecture 14 - Uniform Tube Modeling of Speech Processing, Part VI

Lecture 15 - Uniform Tube Modeling of Speech Processing, Part VII

Lecture 16 - Speech Perception, Part I

Lecture 17 - Speech Perception, Part II

Lecture 18 - Speech Perception, Part III

Lecture 19 - Time Domain Methods in Speech Processing

Lecture 20 - Time Domain Methods in Speech Processing (cont.)

Lecture 21 - Introduction to Linear Prediction

Lecture 22 - Autocorrelation Method of LPC Analysis

Lecture 23 - Autocorrelation Method of LPC Analysis (cont.)

Lecture 24 - Lattice Formulations of Linear Prediction

Lecture 25 - Lattice Formulations of Linear Prediction (cont.)

Lecture 26 - Overview of Short-Time Fourier Transform (STFT)

Lecture 27 - Short-Time Fourier Transform Analysis

Lecture 28 - Short-Time Fourier Transform Synthesis

Lecture 29 - Lattice Formulations of Linear Prediction

Lecture 30 - Lattice Formulations of Linear Prediction (cont.)

Lecture 31 - Segmental and Supra-Segmental Features of Speech Signal

Lecture 32 - Cepstral Transform Coefficients (CC) Parameters Extraction

Lecture 33 - Mel Frequency Cepstral Coefficients

Lecture 34 - MFCC Features Vector

Lecture 35 - Fundamental Frequency (F0) Detection of Speech Signal

Lecture 36 - Frequency Domain Fundamental Frequency Detection Algorithms

Lecture 37 - Text to Speech Synthesis

Lecture 38 - Text to Speech Synthesis (cont.)

Lecture 39 - Automatic Speech Recognition

Lecture 40 - Statistical Modeling of Automatic Speech Recognition

Lecture 41 - Speech Based Technology Development for e-Learning

Lecture 42 - Prosody Modeling

Lecture 43 - Fundamental Frequency Contour Modeling

Lecture 44 - Fundamental Frequency Contour Modeling (cont.)

References

Digital Speech Processing
Instructor: Prof. S K. Das Mandal, Advanced Technology Development Centre, IIT Kharagpur. This course provides the foundation knowledge on speech production and perception along with processing of speech signals in the digital domain.