在現(xiàn)代通信系統(tǒng)中,電話語(yǔ)音的頻帶被限制在300 Hz~4 kHz的范圍內(nèi),帶來(lái)了語(yǔ)音可懂度和自然度的降低。為了在不增加額外成本的前提下提高語(yǔ)音的可懂度和自然度,進(jìn)行了電話語(yǔ)音頻帶擴(kuò)展的研究。提出了一種改進(jìn)的基于碼本映射的語(yǔ)音帶寬擴(kuò)展算法:在碼本映射的過(guò)程中,使用加權(quán)系數(shù)來(lái)得到映射碼本。客觀測(cè)試結(jié)果表明,用此算法得到的寬帶語(yǔ)音的譜失真度比用一般的碼本映射降低至少2%。主觀測(cè)試結(jié)果表明,用此算法得到的寬帶語(yǔ)音具有更好的可懂度和自然度。
Abstract:
In modern communication systems, the bandwidth of telephone speech is limited from 300Hz to 4 kHz, which reduces the intelligibility and naturalness of speech. Telephone speech bandwidth extension is researched to get wideband speech and to improve its intelligibility and naturalness, without increasing extra costs. This paper put forward an improved algorithm of speech bandwidth extension based on codebook mapping. In the process of codebook mapping, weighted coefficients were used to get mapping codebook. Objective tests show that spectral distortion of wideband speech obtained by this algorithm reduces at least 2%, comparing to conditional codebook mapping. Subjective tests show that the wideband speech obtained by this algorithm has better intelligibility and naturalness.
標(biāo)簽:
映射
帶寬
擴(kuò)展
語(yǔ)音
上傳時(shí)間:
2014-12-29
上傳用戶:15501536189