Clin Exp Otorhinolaryngol.  2012 Apr;5(Suppl 1):S65-S68.

Noise Reduction Using Wavelet Thresholding of Multitaper Estimators and Geometric Approach to Spectral Subtraction for Speech Coding Strategy

Affiliations
  • 1Department of Computer Science and Institute of Biomedical Engineering, National Chiao Tung University, Taiwan. c.t.choi@ieee.org
  • 2Faculty of Computer Systems & Software Engineering, Universiti Malaysia Pahang, Pahang, Malaysia.
  • 3Department of Electrical Engineering, National Chiao Tung University, Taiwan.

Abstract


OBJECTIVES
Noise reduction using wavelet thresholding of multitaper estimators (WTME) and geometric approach to spectral subtraction (GASS) can improve speech quality of noisy sound for speech coding strategy. This study used Perceptual Evaluation of Speech Quality (PESQ) to assess the performance of the WTME and GASS for speech coding strategy.
METHODS
This study included 25 Mandarin sentences as test materials. Environmental noises including the air-conditioner, cafeteria and multi-talker were artificially added to test materials at signal to noise ratio (SNR) of -5, 0, 5, and 10 dB. HiRes 120 vocoder WTME and GASS noise reduction process were used in this study to generate sound outputs. The sound outputs were measured by the PESQ to evaluate sound quality.
RESULTS
Two figures and three tables were used to assess the speech quality of the sound output of the WTME and GASS.
CONCLUSION
There is no significant difference between the overall performance of sound quality in both methods, but the geometric approach to spectral subtraction method is slightly better than the wavelet thresholding of multitaper estimators.

Keyword

Cochlear implant; Speech production measurement

MeSH Terms

Clinical Coding
Cochlear Implants
Noise
Signal-To-Noise Ratio
Speech Production Measurement

Figure

  • Fig. 1 HiRes 120 vocoder with noise reduction process.

  • Fig. 2 Structure of perceptual evaluation of speech quality model (6).

  • Fig. 3 Perceptual Evaluation of Speech Quality (PESQ) evaluation method for testing quality with environmental noise (7).

  • Fig. 4 The mean value of each signal to noise ratio (SNR) based on difference background environment.

  • Fig. 5 The overall mean value of each signal to noise ratio (SNR).


Reference

1. Boll S. Suppression of acoustic noise in speech using spectral subtraction. IEEE Trans Acoust. 1979; 4. 27(2):113–120.
Article
2. Berouti M, Schwartz R, Makhoul J. Enhancement of speech corrupted by acoustic noise. IEEE International Conference on Acoustics, Speech and Signal Processing. 1979 Apr 2-4; Washington, DC, USA. p. 208–211.
Article
3. Hu Y, Loizou PC. Speech enhancement based on wavelet thresholding the multitaper spectrum. IEEE Trans Speech Audio Process. 2004; 1. 12(1):59–67.
4. Lu Y, Loizou PC. A geometric approach to spectral subtraction. Speech Commun. 2008; 1. 50(6):453–466. PMID: 19122867.
Article
5. Martin R. Noise power spectral density estimation based on optimal smoothing and minimum statistics. IEEE Trans Speech Audio Process. 2001; 7. 9(5):504–512.
Article
6. Rix AW, Beerends JG, Hollier MP, Hekstra AP. Perceptual evaluation of speech quality (PESQ)-a new method for speech quality assessment of telephone networks and codecs. IEEE International Conference on Acoustics, Speech and Signal Processing. 2001 May 7-11; Salt Lake City, UT, USA. p. 749–752.
Article
7. International Telecommunication Union. Perceptual evaluation of speech quality (PESQ): an objective method for end-to-end speech quality assessment of narrow-band telephone networks and speech codecs [Internet]. 2001. cited 2011 Nov 30. Geneva, Switzerland: International Telecommunication Union;Available from: http://www.itu.int/rec/dologin_pub.asp?lang=e&id=T-REC-P.862-200102-I!!SOFT-ZST-E&type=items.
8. Choi CT, Hsu CH, Tsai WY, Lee YH. A vocoder for a novel cochlear implant stimulating strategy based on virtual channel technology. 13th International Conference on Biomedical Engineering. 2008 Dec 3-6; Singapore. p. 310–313.
Article
9. Riedel KS, Sidorenko A. Minimum bias multiple taper spectral estimation. IEEE Trans Signal Process. 1995; 1. 43(1):188–195.
Article
10. Walden AT, Percival DB, McCoy EJ. Spectrum estimation by wavelet thresholding of multitaper estimators. IEEE Trans Signal Process. 1998; 12. 46(12):3153–3165.
Article
Full Text Links
  • CEO
Actions
Cited
CITED
export Copy
Close
Share
  • Twitter
  • Facebook
Similar articles
Copyright © 2024 by Korean Association of Medical Journal Editors. All rights reserved.     E-mail: koreamed@kamje.or.kr