A new method for estimating gain factors in amplitude panning system is proposed. The method is based on particle ve- locity and balanced sound energy formulation. A scale factor is employed in amplitude panning system and thus, an overdeter- mined system of equation is derived in particle velocity equation. To obtain the analytic solution of the overdetermined equation, the sound energy identical formula is considered and then the unique gain factors are estimated. The proposed method is able to repro- duce sound source direction and control the distance perception in a flexible twoor three-dimension loudspeaker setup. Subjective evaluations show that the proposed technique in an aspheric loudspeaker setup maintains the sound direction and controls the distance perception at the listening point.
针对小样本环境下音频信号分类精度急需提高的问题,首先提出自适应梅尔滤波算法提取具有更高区分度的梅尔谱图,再提出循环残差结构并结合迁移和微调构建循环残差网络频谱分类器,融合自适应梅尔滤波算法和循环残差网络频谱分类器生成一种主要用于小样本环境的音频信号分类模型。以ESC-50、music speech、Free ST Chinese Mandarin Corpus(FSCMC)为源数据集模拟四个不同属性的小样本环境。仿真显示在各小样本环境下生成模型的分类精度与MF-VGG16、10 layers CNN、CRBM等模型相比均有一定程度的提高,且精度曲线更平滑,性能更稳定。
丢包现象严重影响VoIP的通话语音质量。WSOLA(Waveform Similarity Based Overlap-Add)算法是一种基于接收端的丢包隐藏方法,可以较好地提高语音质量。在介绍WSOLA算法原理的基础上,针对该算法中计算互相关系数所需计算量较大,会增加过多计算延时的问题,提出一种互相关系数计算的改进方法。最后通过仿真对重建语音信号质量进行了对比。