Digital Speech Processing
Digital Speech Processing. Instructor: Prof. S K. Das Mandal, Advanced Technology Development Centre, IIT Kharagpur. Oral Speech may be the most natural, common and direct mode of human communication. Since the middle of the last century, Speech has become an area of intense and active research and development (R&D) to become a prime means of direct Human-Computer Interactions (HCI). The pace of such R&D has further got boosted with the general abundance of cheap computing power in the form of PC, PDA or Mobile Handset. While man to machine in speech mode is yet to reach the minimum threshold level for wide-spread deployment, spoken messages directly by machine. This need research in speech science and development of speech technology. The course provides the foundation knowledge on speech production and perception along with processing of speech signals in the digital domain.
(from nptel.ac.in)
Lecture 01 - Introduction to Digital Speech Processing |
Lecture 02 - Digitization and Recording |
Lecture 03 - Review of DSP Concepts |
Lecture 04 - Review of DSP Concepts (cont.) |
Lecture 05 - Human Speech Production and Source Filter Model |
Lecture 06 - Place and Manner at Articulation |
Lecture 07 - Articulatory and Acoustic Phonetics |
Lecture 08 - Handson on Acoustic Phonetics |
Lecture 09 - Uniform Tube Modeling of Speech Processing, Part I |
Lecture 10 - Uniform Tube Modeling of Speech Processing, Part II |
Lecture 11 - Uniform Tube Modeling of Speech Processing, Part III |
Lecture 12 - Uniform Tube Modeling of Speech Processing, Part IV |
Lecture 13 - Uniform Tube Modeling of Speech Processing, Part V |
Lecture 14 - Uniform Tube Modeling of Speech Processing, Part VI |
Lecture 15 - Uniform Tube Modeling of Speech Processing, Part VII |
Lecture 16 - Speech Perception, Part I |
Lecture 17 - Speech Perception, Part II |
Lecture 18 - Speech Perception, Part III |
Lecture 19 - Time Domain Methods in Speech Processing |
Lecture 20 - Time Domain Methods in Speech Processing (cont.) |
Lecture 21 - Introduction to Linear Prediction |
Lecture 22 - Autocorrelation Method of LPC Analysis |
Lecture 23 - Autocorrelation Method of LPC Analysis (cont.) |
Lecture 24 - Lattice Formulations of Linear Prediction |
Lecture 25 - Lattice Formulations of Linear Prediction (cont.) |
Lecture 26 - Overview of Short-Time Fourier Transform (STFT) |
Lecture 27 - Short-Time Fourier Transform Analysis |
Lecture 28 - Short-Time Fourier Transform Synthesis |
Lecture 29 - Lattice Formulations of Linear Prediction |
Lecture 30 - Lattice Formulations of Linear Prediction (cont.) |
Lecture 31 - Segmental and Supra-Segmental Features of Speech Signal |
Lecture 32 - Cepstral Transform Coefficients (CC) Parameters Extraction |
Lecture 33 - Mel Frequency Cepstral Coefficients |
Lecture 34 - MFCC Features Vector |
Lecture 35 - Fundamental Frequency (F0) Detection of Speech Signal |
Lecture 36 - Frequency Domain Fundamental Frequency Detection Algorithms |
Lecture 37 - Text to Speech Synthesis |
Lecture 38 - Text to Speech Synthesis (cont.) |
Lecture 39 - Automatic Speech Recognition |
Lecture 40 - Statistical Modeling of Automatic Speech Recognition |
Lecture 41 - Speech Based Technology Development for e-Learning |
Lecture 42 - Prosody Modeling |
Lecture 43 - Fundamental Frequency Contour Modeling |
Lecture 44 - Fundamental Frequency Contour Modeling (cont.) |
References |
Digital Speech Processing
Instructor: Prof. S K. Das Mandal, Advanced Technology Development Centre, IIT Kharagpur. This course provides the foundation knowledge on speech production and perception along with processing of speech signals in the digital domain.
|