公共文化服务平台

中文耳语元音的声调特征研究被引量：3: 2011年; 声带准周期振动的缺失,使得汉语耳语音成为了一种特殊的发音模式,也使得耳语声调无法用基音周期表征。目前用于语音识别和声纹识别的常规语音特征,包含声调信息较少,所以在声调识别实验中很难获得良好的效果。本文提出一种新的特征参数来模拟正常语音的基频声调轨迹,即以人的听觉特性为出发点,研究人的声调敏感Bark频带,发现部分扩散Bark谱能量归一化比例拟合曲线,能够呈现出类似正常语音的基频轨迹,这说明在某些方面该轨迹或多或少包含了耳语音的声调信息。在以该轨迹和语音短时能量曲线为特征,以神经网络为模型的耳语声调识别实验中获得了较高的识别正确率,汉语四声的总体识别正确率高达78%,这也为对耳语音的进一步处理提供了很多有力依据。; 潘欣裕赵鹤鸣; 关键词：耳语音声调

Research of whispered speech vocal tract system conversion based on universal background model and effective Gaussian components被引量：1: 2013年; Directing to the weakness of the present fixed values mapping methods （method_F）, a vocal tract system conversion method based on the universal background model （UBM） is proposed for improving the performance of the speech conversion system from Chinese whis- pered speech to normal speech. For the numerous components of UBM, the errors produced by the acoustical probability density statistical model can＇t be ignored. Thus an effective Gaus- sian mixture components chosen method based on the posterior probability summation of the minimum spectral distortion is developed to optimizing the system performance. The proposed method （method_U） is analyzed and compared using the performance index （PI） based on Itakura-Saito spectral distortion measure. It is shown experimentally that the performance of method_U is more stability for different speakers and different phonemes than that of method_F. The average PI of method_U is better than method_F. It is shown that by selecting effective Gaussian mixture components, the PI of method_U can be further improved 5.11%. Subjective auditory tests also show that the proposed method can improve the definition and intelligibility of conversion speech.; CHEN XueqinZHAO Heming

Vocal Tract Modeling in 1D and 2D Digital Waveguide Methods: The digital waveguide method is a technique commonly used in the modeling of room acoustics and musical instru...; Yao JuHeming Zhao; 关键词：DOMAIN SIMULATION

基于约束方差的噪声谱估计算法被引量：2: 2012年; 为了进一步提高非平稳环境下噪声估计的准确性,提出了一种基于约束方差的噪声谱估计算法,通过约束方差计算得出平滑参数,对噪声功率谱进行估计。实验结果表明,相对于其他三种算法,该算法能较低时延地跟踪背景噪声的轨迹,且它的噪声谱估计均方误差较小,在非平稳噪声及噪声突变环境下尤为明显。; 周成燕周强顾济华赵鹤鸣陶智; 关键词：约束方差语音增强

利用经验模态分解的QRS综合波检测: 2010年; 在分析心电信号能量分布特点基础上,利用经验模态分解(EMD)方法的自适应数据分析特性提出了一种QRS综合波检测方案.采用完全数据驱动的EMD方法将心电信号分解成一组内蕴模式函数(IMFs).求取各内蕴模式函数的能量并归一化,根据能量分布的特点确定信号是否存在电平整体偏置、基线漂移、高大T波等影响QRS综合波检测的因素,并采取相应的抑制措施.然后以较简单的可调阈值进行判定,并采取一系列防错检、防漏检措施来准确定位QRS综合波.采用MIT-BIH Arrythmia Database对该方法进行了验证,平均正确检测率高达99.91%.结果表明,该方法是一种高效的QRS综合波检测算法.; 朱伟芳赵鹤鸣陈小平; 关键词：心电信号 QRS综合波经验模态分解

有效高斯分量通用背景模型下耳语音声道系统转换研究被引量：5: 2013年; 为了改善耳语音转换中声道系统的转换性能,针对定值转换方法在非特定人耳语音转换系统中效果不理想的情况,提出使用通用背景模型建立独立于说话人的声道系统转换模型。进一步针对在通用背景模型中由于较大分量数产生的声学概率密度统计模型的误差问题,提出基于最小谱失真度的后验概率和有效高斯分量选择方法优化特征矢量的转换性能。定义了板仓一斋田谱失真测度的性能指标对该模型进行分析比较,实验表明,基于通用背景模型的转换特征矢量平均谱失真度性能指标优于定值偏移方法,且稳定性明显好于定值偏移方法。通用背景模型基础上有效高斯分量选择方法可进一步将性能指标提高5.11%,主观听觉测试表明本文方法可改善转换语音的清晰度和准确度。; 陈雪勤赵鹤鸣; 关键词：耳语音声道高斯性能指标特征矢量

模型与特征混合补偿法及其在耳语说话人识别中的应用被引量：4: 2012年; 为了提高信道差异下短时耳语说话人的识别率,提出了一种在模型域和特征域进行混合补偿的方法。该方法首先在模型训练阶段以联合因子分析法为基础,通过估计训练语音的说话人空间和信道空间,提取出说话人因子,消除信道因子,其次在测试阶段,将测试语音的信道因子映射到特征空间,实施特征补偿,从而在模型和特征两方面去除信道信息,提高识别率。实验结果显示,在三种不同的信道训练环境下,混合补偿法都取得了相似的识别率,且新方法对短时耳语音的测试效果要优于联合因子分析法。; 顾晓江赵鹤鸣吕岗; 关键词：说话人识别特征域补偿法

Conversion from whispered speech to normal speech using the extended bilinear transformation method被引量：1: 2013年; A method of conversion from whispered speech to normal speech using the extended bilinear transformation was proposed. On account of the different deviation degrees of the whisper＇s formants in different frequency bands, the spectrum of the whispered speech will be processed in the separate partitions of this paper. On the basis of this spectrum, we will establish a conversion function able to usefully convert whispered speech to normal speech. Because of the whisper＇s non-linear offset in relation to normal speech, this paper introduces an expansion factor in the bilinear transform function making it correspond more closely to the actual conversion demands of whispered speech to normal speech. The introduction of this factor takes the non-linear move of the spectrum and the compression of the formant bandwidth into consideration, thus effectively reducing the spectrum distortion distance in the conversion. The experiment results show that the conversion presented in this paper effectively improves both the sound quality and the intelligibility of whispered speech.; TAO ZhiZHAO HemingTAN XuedanGU JihuaZHANG XiaojunWU Di; 关键词：LSP

Whispered speaker identification based on feature and model hybrid compensation被引量：1: 2012年; In order to increase short time whispered speaker recognition rate in variable chan- nel conditions, the hybrid compensation in model and feature domains was proposed. This method is based on joint factor analysis in training model stage. It extracts speaker factor and eliminates channel factor by estimating training speech speaker and channel spaces. Then in the test stage, the test speech channel factor is projected into feature space to engage in feature compensation, so it can remove channel information both in model and feature domains in order to improve recognition rate. The experiment result shows that the hybrid compensation can obtain the similar recognition rate in the three different training channel conditions and this method is more effective than joint factor analysis in the test of short whispered speech.; GU Xiaojiang ZHAO Heming Lu Gang

A method of whispered speech enhancement based on speech absence probability and modified mel-domain masking model: 2011年; Whispered speech enhancement using auditory masking model in modified Mel- domain and Speech Absence Probability （SAP） was proposed. In light of the phonation char- acteristic of whisper, we modify the Mel-frequency Scaling model. Whispered speech is filtered by the proposed model. Meanwhile, the value of masking threshold for each frequency band is dynamically determined by speech absence probability. Then whispered speech enhancement is conducted by adaptively rectifying the spectrum subtraction coefficients using different masking threshold values. Results of objective and subjective tests on the enhanced whispered signal show that compared with other methods; the proposed method can enhance whispered signal with better subjective auditory quality and less distortion by reducing the music noise and background noise under the masking threshold value.; TAO Zhi~(1,2) ZHAO Heming~2 WU Di~1 CHEN Daqing~1 ZHANG Xiaojun~1 (1 School of Physical Science and Technology,Soochow University Suzhou 215006) (2 School of Electronics and Information Engineering,Soochow University Suzhou 215006)

渝B2-20050021-1　渝公网安备 50019002500403号　违法和不良信息举报中心　互联网出版许可证　新出网证(渝)字10号

国家自然科学基金(61071215)

文献类型

领域

主题

机构

作者

传媒

年份

用户反馈

国家自然科学基金(61071215)

文献类型

领域

主题

机构

作者

传媒

年份

用户登录

用户反馈