Each audio segment from audio segmentation has the same acoustic characteristics, and the wavelet domain MFCC feature is used for each audio segment as the input of the clustering algorithm. However, most of these divided audio segments are not only one second, some even tens of seconds, how to extract the most important feature vector from these audio segments of different sizes to represent the entire audio segment data
Each audio segment divided out of audio has the same acoustic characteristics for each audio segment enterprising wavelet domain MFCC characteristics as input to the clustering algorithm. But most of these split audio segments, not just one second, some even tens of seconds, extract the most important feature vectors from these audio segments of varying sizes to represent the entire audio segment data