SPEECH AND AUDIO SIGNAL PROCESSING.pdf

Viewer
Transcript

PED – 136

*PED136*

II Semester M.E. (Electronics & Communication) Degree Examination, January 2015 (2K9 Scheme) EL 215 : SPEECH AND AUDIO SIGNAL PROCESSING Time : 3 Hours

Max. Marks : 100

Instruction : Answer any five full questions. 1. a) With a neat block diagram, explain the speech production mechanism in terms of i) excitation along with the type of speech sounds produced and ii) system (filter). b) Write a note on : Articulators.

8 3

c) Explain the following : i) Formants, ii) Pitch, and iii) Spectrograms.

9

2. a) Define short time autocorrelation function. Discuss the clipping autocorrelation pitch detector algorithm.

8

b) Explain the significance of short time energy and short time average zero crossing rate in the context of speech signal processing.

6

c) Show that the short time energy can be expressed recursively as, En = a2 En–1 + x2 (n), where the window h (n), is defined as h (n) = anu(n), x (n) is the speech signal, and 0 < a < 1. Draw a digital network diagram of this equation.

6

PED – 136

-2-

*PED136*

3. a) Explain short time Fourier transform. Discuss the Fourier transform interpretation of STFT, specifically in terms of i) periodicity, ii) & recovering the signal x (n) from its STFT.

6

b) Discuss the filter bank summation method of short time synthesis and derive the expression for necessary constraints.

8

STFT c) Let x(n) ←⎯ ⎯ ⎯→ Xn (e jω)

i) If v(n) = x (n) + y (n), then show that

( )

( ) ( )

Vn e jω = Xn e jω + Yn e jω .

ii) If v (n) = x (n – k), then show that

( )

( )

Vn e jω = e−jωkXn e jω .

6

4. a) Define complex cepstrum and real cepstrum of a sequence x (n). Show that the complex cepstrum of a sequence can be computed as,

xˆ (n) =

−1 2πnj

1 xˆ (0) = 2π

∫

∫

π

−π

π

x′(e jω ) jωn e dω, n ≠ 0 & x(e j ω )

log X(e jω ) dω

−π

where symbols have usual meanings.

6

*PED136*

PED – 136

-3-

b) Find the cepstrum of the sequence whose Z-transform is V(z) =

1+ 0.98z−1

π π j −j −1 6 6 (1 − 0.9e z )(1 − 0.9e z−1)

.

6

c) Explain with a block diagram steps involved in computing the real cepstrum of a speech signal. Draw block diagram to explain how pitch of the speech signal is estimated from the computed cepstrum and justify the steps.

8

5. a) Define a pth order linear predictor. Show that the following relation holds for a pth order linea predictor s (n)

p

∑ k =1αks(n − k) + e(n),

where { α k} are LP

coefficients and e (n) is the prediction error.

4

b) Explain the autocorrelation method of LP analysis and show that i)

p

∑ k =1α k Rn ( i− k ) = Rn (i),

1 ≤ i ≤ p, and

assuming p = 4, write the above equation in the matrix form. ii) minimum mean squared prediction error En = Rn(0) − ∑ pk =1 αkRn (k) .

8

c) Using Durbin’s recursive algorithm compute the transfer function of 2nd order LP model for a speech signal having autocorrelation coefficients R (0) = 1, R (1) = 0.5 and R (2) = 0.25.

8

6. a) Explain the following : i) JND, ii) frequency masking and iii) temporal masking. b) Discuss two broad classes of speech quality measures.

12 8

PED – 136

-4-

*PED136*

7. a) With a block diagram explain ADPCM system with feed-forward adaptive quantization.

6

b) Discuss the salient features of the LD_CELP G.728 speech compression standard.

6

c) With a block diagram explain MPEG audio compression algorithm.

8

8. a) Explain the following in the context of speech signal processing. i) Dynamic time warping and ii) Hidden Markov model. b) Write a short note on Text to speech synthesis.

__________________

12 8