An Improved Bi-Level Thresholding Based Uncertainty Evaluation for Speech Enhancement in Non-Stationary Noises
-
2018-04-25 https://doi.org/10.14419/ijet.v7i2.24.12130 -
Speech enhancement, Noise estimation, EMD, Thresholding, Babble noise, Output SNR, and PESQ. -
Abstract
This paper proposes a new speech enhancement framework to improve the quality of speeches recorded under adverse acoustic environments based on the speech presence uncertainty. Since the uncertainty evaluation gives a more and clear discrimination about the speech and noise, this paper proposes a new uncertainty evaluation mechanism as a preprocessing mechanism to the noise suppression methods. This mechanism relates with energies of a noisy speech signal and classifies the speech segments and noise segments more perfectly. In addition to the quality enhancement, this approach also reduces the unnecessary computational burden over the speech processing system. Extensive simulations are carried out over the speech signals with different types of non-stationary noises like babble noise, exhibition noise, restaurant noise and train station noises and the performance is measured with the performance metrics namely the Output SNR, AvgSegSNR, PESQ and COMP. The comparative analysis of proposed approach over the conventional approaches shows an outstanding performance in all environments.
Â
 -
References
[1] S. F. Boll, “Suppression of acoustic noise in speech using spectral subtractionâ€, IEEE Transactions on Acoustics, Speech and Signal Processing, Vol. 27, No. 2, pp. 113–120, 1979.
[2] Y. Hu, Subspace and multitaper methods for speech enhancement [Ph.D. dissertation], University of Texas at Dallas, Richardson, Tex, USA, 2003.
[3] Firas Jabloun and Benoît Champagne, “Incorporating the Human Hearing Properties in the Signal Subspace Approach for Speech Enhancementâ€, IEEE Transactions on Speech and Audio Processing, Vol. 11, No. 6, November 2003.
[4] D. O’Shaughnessy, Speech Communications Human and Machine, 2nd ed. New York: IEEE Press, 2000
[5] Philipos C. Loizou, “Speech Enhancement: theory and practiceâ€, 2nd Ed. CRC press, Taylor and Francis Group, 2013.
[6] R. Martin, “Noise power spectral density estimation based on optimal smoothing and minimum statistics,†IEEE Trans. Speech Audio Process., vol. 9, no. 5, pp. 504–512, July 2001.
[7] I. Cohen, “Noise spectrum estimation in adverse environments: Improved minima controlled recursive averaging,†IEEE Trans. Speech Audio Process., vol. 11, no. 5, pp. 466–475, Sept. 2003.
[8] R. Yu, “A low-complexity noise estimation algorithm based on smoothing of noise power estimation and estimation bias correction,†in IEEE ICASSP, 2009, pp. 4421–4424.
[9] R. C. Hendricks, R. Heusdens, and J. Jensen, “MMSE based noise PSD tracking with low complexity,†IEEE ICASSP, pp.4266–4269, Mar. 2010.
[10] J. Taghia, N. Mohammadiha, J. Sang, V. Bouse, and R. Martin, “An evaluation of noise power spectral density estimation algorithms in adverse acoustic environments,†IEEE ICASSP, May 2011.
[11] Mahdi Parchami, Wei-Ping Zhu, Benoit CHampange, “Recent Developments in Speech Enhancement in the Short-Time Fourier Transform Domainâ€, IEEE Circuits and Systems Magazine, Volume: 16, Issue: 3, third quarter 2016.[12] J. S. Erkelens, R. C. Hendriks, R. Heusdens, and J. Jensen, “Minimum mean-square error estimation of discrete Fourier coefficients with generalized gamma priors,†IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no.6, pp. 1741–1752, 2007.
[13] J. Jensen and R. C. Hendriks, “Spectral magnitude minimum mean-square error estimation using binary and continuous gain functions,†IEEE Trans. Audio, Speech, Lang. Process., vol. 20, no. 1, pp. 92–102, 2012.
[14] I. Cohen, “Optimal speech enhancement under signal presence uncertainty using log-spectral amplitude estimator,†IEEE Signal Process. Lett., vol. 9, no. 4, pp. 113–116, 2002.
[15] Van-Khanh MAI, Dominique PASTOR, Abdeldjalil AÃSSA-EL-BEY, Raphaël LE BIDAN, “Combined Detection and Estimation Based on Mean-Square Error Log-Spectral Amplitude for Speech Enhancementâ€, GRETSI 2017.
[16] Y. Hu and P. C. Loizou, “Techniques for estimating the ideal binary mask,†in Proc. 11th Int. Workshop Acoust. Echo Noise Control, 2008, pp. 154–157.
[17] U. Kjems et al., “Role of mask pattern in intelligibility of ideal binary-masked noisy speech,†J. Acoust. Soc. Amer., vol. 126, no. 3, pp.1415–1426, Sep. 2009.
[18] Y. Hu and P. C. Loizou, “Techniques for estimating the ideal binary mask,†in Proc. 11th Int.Workshop Acoust., Echo, Noise Control, 2008.
[19] G. Kim, Y. Lu, and P. C. Loizou, “An algorithm that improves speech intelligibility in noise for normal-hearing listener,†J. Acoust. Soc. Amer., vol. 126, no. 3, pp. 1486–1494, Sep. 2009.
[20] Leo Lightburn, Enzo De sena, Alastair Moore, “Improving the perceptual quality of ideal binary masked speechâ€, IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2017.[21] Abhay Upadhyay, “Speech enhancement based on mEMD-VMD methodâ€, IEEE Electronics Letters, Vol.53, No.7, 2017.[22] Taufiq Hasan, “Suppression of Residual Noise From Speech Signals Using Empirical Mode Decompositionâ€, IEEE signal processing letters, Vol. 16, No. 1, 2009.
[23] S. Nageswara Rao, K.J. Shankar and C.D. Naidu, “An adaptive speech enhancement approach based on DCT and empirical mode decompositionâ€, In Proc. of International Conference on Communication and Signal Processing (ICCSP), 2016.[24] Yannis Kopsinis, Member, IEEE, and Stephen McLaughlin, “Development of EMD-Based Denoising Methods
[25] Inspired by Wavelet Thresholdingâ€, IEEE Transactions on Signal Processing, Vol. 57, No. 4, April 2009.[26] Taufiq Hasan and Md. Kamrul Hasan, “A Probabilistic Speech Enhancement Filter Utilizing the Constructive and Destructive Interference of Noiseâ€, 15th European Signal Processing Conference (EUSIPCO 2007), Poznan, Poland, September 3-7, 2007.
[27] Saggurti Nageswararao, K Jaya sankar, C.D Naidu, “Suppression of Non-Stationary Noises Through the Generalized Signal Detectorâ€, International Journal of Intelligent Engineering and Systems, Vol.11, No.1, 2018.
[28] S. S. Kalamkar and A. Banerjee, “On the Performance of Generalized Energy Detector under Noise Uncertainty in Cognitive Radioâ€, In: Proc. of National Conference on Communications (NCC), 2013.
[29] Jun Zhu, Yun Bai, “Analytical Optimization for Collaborative Double Threshold Energy Detection in Cognitive Radio Networkâ€, “Journal of Information & Computational Science 9: 13 (2012) 3875–3882.
[30] H.Urkowitz, “Energy detection of unknown deterministic signalsâ€, Proceedings of IEEE, vol.55, pp. 523-531, April 1967.
[31] Sun.C, Zhang.W, Letaief.K.B, “Cluster-based cooperative spectrum sensing in cognitive radio systemsâ€, in Proc. IEEE ICC’07, pp.2511-2515, 2007.
[32] Y. Hu and P. C. Loizou, “Evaluation of objective measures for speech enhancement,†in Proc. INTERSPEECH, Sep. 2006.
[33] Avinash Yadlapati, Dr. Hari Kishore Kakarla, “An Advanced AXI Protocol Verification using Verilog HDLâ€, Wulfenia Journal, ISSN: 1561-882X, Volume 22, Number 4, pp. 307-314, April 2015.
[34] P Ramakrishna, K. Hari Kishore, “Design of Low Power 10GS/s 6-Bit DAC using CMOS Technology “International Journal of Engineering and Technology(UAE), ISSN No: 2227-524X, Vol No: 7, Issue No: 1.5, Page No: 226-229, January 2018.
[35] A Murali, K. Hari Kishore, “Efficient and High Speed Key Independent AES Based Authenticated Encryption Architecture using FPGAs “International Journal of Engineering and Technology(UAE), ISSN No: 2227-524X, Vol No: 7, Issue No: 1.5, Page No: 230-233, January 2018.
[36] G.S.Spandana,K Hari Kishore “A Contemporary Approach For Fault Diagnosis In Testable Reversible Circuits By Employing The CNT Gate Library†International Journal of Pure and Applied Mathematics, ISSN No: 1314-3395, Vol No: 115, Issue No: 7, Page No: 537-542, September 2017.
[37] K Hari Kishore, CVRN Aswin Kumar, T Vijay Srinivas, GV Govardhan, Ch Naga Pavan Kumar, R Venkatesh “Design and Analysis of High Efficient UART on Spartran-6 and Virtex-7 Devicesâ€, International Journal of Applied Engineering Research, ISSN 0973-4562, Volume 10, Number 09 , pp. 23043-23052, June 2015.
[38] K Bindu Bhargavi, K Hari Kishore “Low Power BIST on Memory Interface Logicâ€, International Journal of Applied Engineering Research, ISSN 0973-4562, Volume 10, Number 08 , pp. 21079-21090, May 2015.
[39] Korraprolu Brahma Reddy, K Hari Kishore, “A Mixed Approach for Power Dissipation Reduction in Nanometer CMOS VLSI circuitsâ€, International Journal of Applied Engineering Research, ISSN 0973-4562 Volume 9, Number 18 , pp. 5141-5148, July 2014.
[40] Nidamanuri Sai Charan, Kakarla Hari Kishore "Reorganization of Delay Faults in Cluster Based FPGA Using BIST†Indian Journal of Science and Technology, ISSN No: 0974-6846, Vol No.9, Issue No.28, page: 1-7, July 2016.
[41] Sravya Kante, Hari Kishore Kakarla, Avinash Yadlapati,"Design and Verification of AMBA AHB-Lite protocol using Verilog HDL" International Journal of Engineering and Technology, E-ISSN No: 0975-4024, Vol No.8, Issue No.2, Page:734-741, May 2016.
[42] Bandlamoodi Sravani, K Hari Kishore, “An FPGA Implementation of Phase Locked Loop (PLL)â€, International Journal of Applied Engineering Research, ISSN 0973-4562, Volume 10, Number 14 , pp. 34137-34139, August 2015
[43] Avinash Yadlapati, Kakarla Hari Kishore,“Constrained Level Validation of Serial Peripheral Interface Protocolâ€, Proceedings of the First International Conference on SCI 2016, Volume 1, Smart Computing and Informatics, Smart Innovation, Systems and Technologies 77, ISSN No: 2190-3018, ISBN: 978-981-10-5544-7, Chapter No: 77, pp. 743-753, 25th December 2017.
[44] P Kiran Kumar, P Prasad Rao, Kakarla Hari Kishore, “Optimal Design of Reversible Parity Preserving New Full Adder / Full Subtractorâ€, IEEE SPONSORED 3rd INTERNATIONAL CONFERENCE ON ELECTRONICS AND COMMUNICATION SYSTEMS (ICECS 2016), pp. 3465-3470, 25th and 26th February 2016.
[45] Y Avinash, K Hari Kishore ‘’Designing Asynchronous FIFO for Low Power DFT Implementation’’ International Journal of Pure and Applied Mathematics, ISSN No: 1314-3395, Vol No: 115, Issue No: 8, Page No: 561-566, September 2017.
[46] Mahesh Mudavathand K Hari Kishore "Design of RF Front End CMOS Cascade CS Low Noise Amplifier on 65nm Technology Process†International Journal of Pure and Applied Mathematics, ISSN No: 1314-3395, Vol No: 115, Issue No: 7, Page No: 417-422, September 2017.
[47] P. Sahithi K Hari Kishore, E Raghuveera, P. Gopi Krishna “DESIGN OF VOLTAGE LEVEL SHIFTER FOR POWER-EFFICIENT APPLICATIONS USING 45nm TECHNOLOGY†International Journal of Engineering and Technology(UAE), ISSN No: 2227-524X, Vol No: 7, Issue No: 2.8, Page No: 103-108, March 2018.
[48] N Bala Dastagiri K Hari Kishore “A 14-bit 10kS/s Power Efficient 65nm SAR ADC for Cardiac Implantable Medical Devices†International Journal of Engineering and Technology(UAE), ISSN No: 2227-524X, Vol No: 7, Issue No: 2.8, Page No: 34-39, March 2018.
[49] S.V.Manikanthan and T.Padmapriya “Recent Trends In M2m Communications In 4g Networks And Evolution Towards 5gâ€, International Journal of Pure and Applied Mathematics, ISSN NO:1314-3395, Vol-115, Issue -8, Sep 2017.
[50] S.V. Manikanthan , T. Padmapriya “An enhanced distributed evolved node-b architecture in 5G tele-communications network†International Journal of Engineering & Technology (UAE), Vol 7 Issues No (2.8) (2018) 248-254.March2018.
[51] S.V. Manikanthan, T. Padmapriya, Relay Based Architecture For Energy Perceptive For Mobile Adhoc Networks, Advances and Applications in Mathematical Sciences, Volume 17, Issue 1, November 2017, Pages 165-179
-
Downloads
-
How to Cite
Nageswara Rao, S., Jaya Sankar, K., & D. Naidu, C. (2018). An Improved Bi-Level Thresholding Based Uncertainty Evaluation for Speech Enhancement in Non-Stationary Noises. International Journal of Engineering & Technology, 7(2.24), 436-443. https://doi.org/10.14419/ijet.v7i2.24.12130Received date: 2018-04-25
Accepted date: 2018-04-25
Published date: 2018-04-25