• ISSN 0258-2724
  • CN 51-1277/U
  • EI Compendex
  • Scopus
  • Indexed by Core Journals of China, Chinese S&T Journal Citation Reports
  • Chinese S&T Journal Citation Reports
  • Chinese Science Citation Database
Volume 26 Issue 4
Aug.  2013
Turn off MathJax
Article Contents
TAN Xiaoheng, XU Ke, QIN Jiwei. Objective Evaluation Method of Speech Quality Based on Auditory Perceptual Properties[J]. Journal of Southwest Jiaotong University, 2013, 26(4): 756-760. doi: 10.3969/j.issn.0258-2724.2013.04.025
Citation: TAN Xiaoheng, XU Ke, QIN Jiwei. Objective Evaluation Method of Speech Quality Based on Auditory Perceptual Properties[J]. Journal of Southwest Jiaotong University, 2013, 26(4): 756-760. doi: 10.3969/j.issn.0258-2724.2013.04.025

Objective Evaluation Method of Speech Quality Based on Auditory Perceptual Properties

doi: 10.3969/j.issn.0258-2724.2013.04.025
  • Received Date: 11 Apr 2012
  • Publish Date: 25 Aug 2013
  • Based on Mel-frequency cepstral coefficients (MFCC), Mel-cepstral distance measure (Mel-CD) algorithm used for the objective evaluation of speech quality was analyzed. According to the theory of psychoacoustics, a human auditory model proposed by Johannesma and nonlinear compression were applied to extracting MFCC. Gammatone filter bank was used to simulate the basilar membrane. Mel-cepstral gammatone filter bank distance measure (Mel-GD) based on the improved MFCC was proposed, which was more in accordance with the auditory perceptual properties. Performance testing results showed that the proposed algorithm compared favorably with the Mel-CD in time complexity, the correlation degree between objective evaluation and subjective evaluation was improved by 4.9%, and estimation bias was decreased by 45.5%.

     

  • loading
  • 陈国,胡修林,张蕴玉,等. 语音质量客观评价方法研究进展[J]. 电子学报,2001,29(4): 1-5. CHEN Guo, HU Xiulin, ZHANG Yunyu, et al. Research advance on objective measures of speech quality[J]. Acta Electroncia Sinica, 2001, 29(4): 1-5.
    李薇,胡智奇,尚秋峰,等. 语音质量客观评价方法的研究[J]. 电力系统通信,2009,30(198): 64-67,71. LI Wei, HU Zhiqi, SHANG Qiufeng, et al. Research on objective evaluation of speech quality[J]. Telecommunications for Electric Power System, 2009, 30(198): 64-67, 71.
    Telecommunication Standardization Sector of ITU. ITU-T Recommendation P.830 Subjective performance assessment of telephone-band and wideband digital codecs[S]. Geneva: International Telecommunication Union, 1996.
    Telecommunication Standardization Sector of ITU. ITU-T Recommendation P.862 Perceptual evaluation of speech quality (PESQ): An objective method for end-to-end speech quality assessment of narrow-band telephone networks and speech codecs[S]. Geneva: International Telecommunication Union, 2001.
    KUBICHEK R. Mel-cepstral distance measure for objective speech quality assessment[C]//Proceedings of IEEE Pacific Rim Conference on Communications, Computer and Signal Processing. Piscataway: IEEE Press, 1993: 125-128.
    DAVIS S B, MERMELSTEIN P. Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences[J]. IEEE Trans. on Acoustics, Speech and Signal Processing,1980, 28(4): 357-366.
    陈华伟,靳蕃. 基于感知模型的美尔谱失真测度[J]. 西南交通大学学报,2006,41(6): 723-728. CHEN Huawei, JIN Fan. Mel-spectral distortion measure based on perception model for objective speech quality assessment[J]. Journal of Southwest Jiaotong University, 2006, 41(6): 723-728.
    张军,张德运,傅鹏. 一种改进的心理声学语音质量客观评价算法[J]. 微电子学与计算机,2007,24(3): 203-206. ZHANG Jun, ZHANG Deyun, FU Peng. An improved psychoacoustics speech quality evaluation algorithm[J]. Microelectronics & Computer, 2007, 24(3): 203-206.
    陈明义,孙冬梅,何孝月. 基于改进MFCC语音特征参数的语音质量评估的研究[J].电路与系统学报,2009,14(3): 111-116. CHEN Mingyi, SUN Dongmei, HE Xiaoyue. Study on speech quality evaluation based on improved MFCC[J]. Journal of Circuits and Systems, 2009, 14(3): 111-116.
    邓宗元,杨震. 一种改进的语音质量客观评价参数[J]. 南京邮电大学学报:自然科学版,2008,28(2): 14-18. DENG Zongyuan, YANG Zhen. An improved object measure of speech quality[J]. Journal of Nanjing University of Posts and Telecommunications: Natural Science, 2008, 28(2): 14-18.
    梁超. 一种基于Gammatone滤波的语音质量评价算法[J].长春工业大学学报:自然科学版,2010,31(4): 432-436. LIANG Chao. An algorithm for objective speech quality assessment based on Gammatone filter[J]. Journal of Changchun University of Technology: Natural Science Edition, 2010, 31(4): 432-436.
    JOHANNESMA P I M. The pre-response stimulus ensemble of neurons in the cochlear nucleus[C]//Proceedings of the Symposium on Hearing Theory. Eindhoven: IPO, 1972: 58-69.
    陈世雄,宫琴,金慧君. 用Gammatone滤波器组仿真人耳基底膜的特性[J]. 清华大学学报:自然科学版,2008,48(6): 1044-1048. CHEN Shixiong, GONG Qin, JIN Huijun. Gammatone filter bank to simulate the characteristics of the human basilar membrane[J]. Journal of Tsinghua University: Science and Technology, 2008, 48(6): 1044-1048.
    李云鸿,胡修林,张蕴玉. 基于人耳听觉模型的语音质量客观评价方法[J]. 华中理工大学学报,2000,28(5): 63-65. LI Yunhong, HU Xiulin, ZHANG Yunyu. Objective evaluation method of speech quality based on human auditory model[J]. Journal of Huazhong University of Science and Technology, 2000, 28(5): 63-65.
    王炜,刘峰,吴淑珍. RASTA滤波在语音通信质量客观评价中应用的研究[J]. 北京大学学报:自然科学版,2003,39(5): 697-702. WANG Wei, LIU Feng, WU Shuzhen. A study for the application of RASTA on objective communication speech quality evaluation[J]. Acta Scientiarum Naturalium Universitatis Pekinensis, 2003, 39(5): 697-702.
  • 加载中

Catalog

    通讯作者: 陈斌, bchen63@163.com
    • 1. 

      沈阳化工大学材料科学与工程学院 沈阳 110142

    1. 本站搜索
    2. 百度学术搜索
    3. 万方数据库搜索
    4. CNKI搜索
    Article views(988) PDF downloads(473) Cited by()
    Proportional views
    Related

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return