Limits...
A bio-inspired feature extraction for robust speech recognition.

Zouhir Y, Ouni K - Springerplus (2014)

Bottom Line: The proposed method is motivated by a biologically inspired auditory model which simulates the outer/middle ear filtering by a low-pass filter and the spectral behaviour of the cochlea by the Gammachirp auditory filterbank (GcFB).The evaluation results show that the proposed method gives better recognition rates compared to the classic techniques such as Perceptual Linear Prediction (PLP), Linear Predictive Coding (LPC), Linear Prediction Cepstral coefficients (LPCC) and Mel Frequency Cepstral Coefficients (MFCC).The used recognition system is based on the Hidden Markov Models with continuous Gaussian Mixture densities (HMM-GM).

View Article: PubMed Central - PubMed

Affiliation: Research Unit: Signals and Mechatronic Systems, SMS, Higher School of Technology and Computer Science (ESTI), University of Carthage, Carthage, Tunisia.

ABSTRACT
In this paper, a feature extraction method for robust speech recognition in noisy environments is proposed. The proposed method is motivated by a biologically inspired auditory model which simulates the outer/middle ear filtering by a low-pass filter and the spectral behaviour of the cochlea by the Gammachirp auditory filterbank (GcFB). The speech recognition performance of our method is tested on speech signals corrupted by real-world noises. The evaluation results show that the proposed method gives better recognition rates compared to the classic techniques such as Perceptual Linear Prediction (PLP), Linear Predictive Coding (LPC), Linear Prediction Cepstral coefficients (LPCC) and Mel Frequency Cepstral Coefficients (MFCC). The used recognition system is based on the Hidden Markov Models with continuous Gaussian Mixture densities (HMM-GM).

No MeSH data available.


Block diagram of PLP technique (Hermansky 1990).
© Copyright Policy - open-access
Related In: Results  -  Collection

License
getmorefigures.php?uid=PMC4230714&req=5

Fig3: Block diagram of PLP technique (Hermansky 1990).

Mentions: The Perceptual Linear Prediction (PLP) is based on the human auditory system characteristics. It is similar to that of LPC technique, except that the speech power spectrum is transformed by a Bark-scale filter bank, an equal-loudness pre-emphasis and an intensity-loudness conversion to take into account the human auditory system characteristics, before modelling by the autoregressive all-pole transfer function. The block diagram of PLP technique, as shown in FigureĀ 3 (Hermansky 1990; Beigi 2011).Figure 3


A bio-inspired feature extraction for robust speech recognition.

Zouhir Y, Ouni K - Springerplus (2014)

Block diagram of PLP technique (Hermansky 1990).
© Copyright Policy - open-access
Related In: Results  -  Collection

License
Show All Figures
getmorefigures.php?uid=PMC4230714&req=5

Fig3: Block diagram of PLP technique (Hermansky 1990).
Mentions: The Perceptual Linear Prediction (PLP) is based on the human auditory system characteristics. It is similar to that of LPC technique, except that the speech power spectrum is transformed by a Bark-scale filter bank, an equal-loudness pre-emphasis and an intensity-loudness conversion to take into account the human auditory system characteristics, before modelling by the autoregressive all-pole transfer function. The block diagram of PLP technique, as shown in FigureĀ 3 (Hermansky 1990; Beigi 2011).Figure 3

Bottom Line: The proposed method is motivated by a biologically inspired auditory model which simulates the outer/middle ear filtering by a low-pass filter and the spectral behaviour of the cochlea by the Gammachirp auditory filterbank (GcFB).The evaluation results show that the proposed method gives better recognition rates compared to the classic techniques such as Perceptual Linear Prediction (PLP), Linear Predictive Coding (LPC), Linear Prediction Cepstral coefficients (LPCC) and Mel Frequency Cepstral Coefficients (MFCC).The used recognition system is based on the Hidden Markov Models with continuous Gaussian Mixture densities (HMM-GM).

View Article: PubMed Central - PubMed

Affiliation: Research Unit: Signals and Mechatronic Systems, SMS, Higher School of Technology and Computer Science (ESTI), University of Carthage, Carthage, Tunisia.

ABSTRACT
In this paper, a feature extraction method for robust speech recognition in noisy environments is proposed. The proposed method is motivated by a biologically inspired auditory model which simulates the outer/middle ear filtering by a low-pass filter and the spectral behaviour of the cochlea by the Gammachirp auditory filterbank (GcFB). The speech recognition performance of our method is tested on speech signals corrupted by real-world noises. The evaluation results show that the proposed method gives better recognition rates compared to the classic techniques such as Perceptual Linear Prediction (PLP), Linear Predictive Coding (LPC), Linear Prediction Cepstral coefficients (LPCC) and Mel Frequency Cepstral Coefficients (MFCC). The used recognition system is based on the Hidden Markov Models with continuous Gaussian Mixture densities (HMM-GM).

No MeSH data available.