Project Development: Speech Processing, Proj05

Thursday, April 22, 2010

Speech Processing, Proj05

//Purdue Cal - ECE 595C.
//Spring 2010.
//Project 05.
//Copyright @ 2010 antonio081014  ;
//All codes are in Matlab.

Report Link:

1. Getting the Mel-Scale Frequencies Cepstral Coefficient.
    1.1 Using overlapping Triangular window.
    1.2 Calculate the energy for each filter band. (16 in total).
    1.3 Set all values (except energy values on these 16 frequencies) to be zeros.
    1.4 Do Discrete Cosine Transform.
   Plot the first feature through all the frames for one utterance. (trajectories).

2. Dynamic Time Warping with simple Euclidean and cosine distance.
    Here, I didn't add any constraints in DTW.

3. Getting the features for equal spaced frequencies' magnitude.

    Use equal spaced frequencies is sure not better than Mel Scale frequencies. This is why we use MFCCs instead of using these. MFCCs can better approximate the human auditory system.

No comments:

Post a Comment