公共文化服务平台

基于曲线波隐马尔可夫模型的人脸检测被引量：3: 2011年; 提出了一种基于曲线波隐马尔可夫模型的混合人脸检测算法。曲线波变换是一种多尺度几何变换,具有很好的方向性,能用极少的非零系数精确表示图像的边缘,是一种最稀疏的表示方法。根据隐马尔可夫模型对人脸拓扑结构的约束,采用3状态的隐马尔可夫模型进行从粗到细的人脸检测。实验结果表明,这种算法具有较好的检测速度与正确率及鲁棒性。; 王吉林叶建隆赵力邹采荣; 关键词：人脸检测

Perceptual video coding method based on JND and AR model被引量：1: 2010年; In order to achieve better perceptual coding quality while using fewer bits, a novel perceptual video coding method based on the just-noticeable-distortion （JND） model and the auto-regressive （AR） model is explored. First, a new texture segmentation method exploiting the JND profile is devised to detect and classify texture regions in video scenes. In this step, a spatial-temporal JND model is proposed and the JND energy of every micro-block unit is computed and compared with the threshold. Secondly, in order to effectively remove temporal redundancies while preserving high visual quality, an AR model is applied to synthesize the texture regions. All the parameters of the AR model are obtained by the least-squares method and each pixel in the texture region is generated as a linear combination of pixels taken from the closest forward and backward reference frames. Finally, the proposed method is compared with the H.264/AVC video coding system to demonstrate the performance. Various sequences with different types of texture regions are used in the experiment and the results show that the proposed method can reduce the bit-rate by 15% to 58% while maintaining good perceptual quality.; 王翀赵力邹采荣

Efficient fundamental frequency transformation for voice conversion: 2012年; In order to improve the performance of voice conversion, the fundamental frequency （F0） transformation methods are investigated, and an efficient F0 transformation algorithm is proposed. First, unlike the traditional linear transformation methods, the relationships between F0s and spectral parameters are explored. In each component of the Gaussian mixture model （GMM）, the F0s are predicted from the converted spectral parameters using the support vector regression （SVR） method. Then, in order to reduce the over- smoothing caused by the statistical average of the GMM, a mixed transformation method combining SVR with the traditional mean-variance linear （MVL） conversion is presented. Meanwhile, the adaptive median filter, prevalent in image processing, is adopted to solve the discontinuity problem caused by the frame-wise transformation. Objective and subjective experiments are carried out to evaluate the performance of the proposed method. The results demonstrate that the proposed method outperforms the traditional F0 transformation methods in terms of the similarity and the quality.; 宋鹏金赟包永强赵力邹采荣

一种基于二代曲波系数乘积的图象去噪方法: 2011年; 曲线波变换是一种多尺度变换,对于具有光滑曲线奇异性的目标函数,曲线波提供了稳定的、高效的和近于最优的表示.在第二代曲线波的基础上,利用曲线波分解中不同尺度的系数也具有相同的特点,提出了基于第二代曲波的系数乘积去噪算法.实验结果表明,提出的算法明显优于小波图像去噪方法,也优于曲线波的阈值方法.; 卞金洪周锋赵力; 关键词：小波多尺度图像去噪

Emotional speaker recognition based on prosody transformation被引量：1: 2011年; A novel emotional speaker recognition system （ESRS） is proposed to compensate for emotion variability. First, the emotion recognition is adopted as a pre-processing part to classify the neutral and emotional speech. Then, the recognized emotion speech is adjusted by prosody modification. Different methods including Gaussian normalization, the Gaussian mixture model （GMM） and support vector regression （SVR） are adopted to define the mapping rules of F0s between emotional and neutral speech, and the average linear ratio is used for the duration modification. Finally, the modified emotional speech is employed for the speaker recognition. The experimental results show that the proposed ESRS can significantly improve the performance of emotional speaker recognition, and the identification rate （IR） is higher than that of the traditional recognition system. The emotional speech with F0 and duration modifications is closer to the neutral one.; 宋鹏赵力邹采荣

一种改进的基于自适应时频分解的实值离散Gabor变换算法: 本文将自适应时频变换思想应用到实值离散Gabor变换中,提出了一种改进的基于自适应时频表示的实值离散Gabor变换算法,在该算法中无须计算双正交窗函数,从而大大扩展了实值离散Gabor变换算法的应用范围。实验结果表明,本...; 周健陶亮; 关键词：实值离散GABOR变换信号重建

认知水声通信中的分布式压缩频谱感知算法(英文)被引量：2: 2012年; 在认知水声通信中,频谱感知是动态频谱接入和动态频谱共享的基础.相比于陆地环境,水下环境变化剧烈:如严重的频率选择性衰落、低的声波传播速度和多径效应等.因此,许多可用于认知无线电的频谱感知算法不能直接用于认知水声通信.除此之外,水下用户或节点均用电池供电,而基于融合中心(融合中心可能与感知用户相隔很远)的频谱感知算法需要将各个感知用户的感知数据传送到融合中心,由于功率受限并且计算资源有限,该方法几乎是不可行的.类似于无线通信系统,水声通信系统中的频谱使用率也很低,这使得水声通信信号在频域是稀疏的.研究结果表明,压缩感知算法可以有效的恢复稀疏信号.基于此,为了克服前述困难,本文提出了分布式压缩频谱感知算法.在该算法中,多个认知用户通过协作的方式获得空间分集增益来克服水声信道的严重衰落,并利用联合稀疏性来增强恢复稀疏信号的能力.通过分布式计算,该算法将协作频谱感知转化为去中心的局部优化问题,对于每个感知用户而言,只需要与其相邻的感知用户进行数据交互,这大大减少了每个感知用户的计算量和传输数据所需的功率消耗.本文对所提出的算法进行了仿真,并与其他算法进行了比较.实验结果证明了本算法在认知水声通信中检测频谱的有效性.; 左加阔包永强赵力邹采荣陶文凤; 关键词：频谱感知压缩感知分布式计算

织物图像增强技术的研究被引量：2: 2011年; 对具有不同特性的织物数字图像利用二维离散傅立叶变换进行图像增强和图像复原等操作,能有效地改善图像的质量,突出所需要的细节,为织物密度的自动测量提供最佳质量的图像。研究了二维离散快速傅立叶变换算法,以及利用该算法在频率域中进行图像增强和图像复原的新方法。实验结果分析表明,利用提出的方法可以较好地改善织物图像的质量。; 张素贞叶建隆邹采荣; 关键词：图像增强图像复原傅里叶变换频率域

基于图像亚像素处理的电缆护套厚度精确测量被引量：4: 2011年; 基于图像亚像素定位的精密测量系统是一个很有应用前景的研究方向。文中着重分析了影响测量系统精度的几大因素:彩色图像亮度信息的完整采集、图像梯度值计算精度、多类型边缘亚像素重定位精度。然后提出了综合应用主轴分析、多尺度梯度、通用亚像素重定位等方法,设计出适合做电缆护套厚度检测的简单有效算法流程。; 周锋卞金洪赵力

Whisper intelligibility enhancement based on noise robust feature and SVM被引量：2: 2012年; A machine learning based speech enhancement method is proposed to improve the intelligibility of whispered speech. A binary mask estimated by a two-class support vector machine （SVM） classifier is used to synthesize the enhanced whisper. A novel noise robust feature called Gammatone feature cosine coefficients （GFCCs） extracted by an auditory periphery model is derived and used for the binary mask estimation. The intelligibility performance of the proposed method is evaluated and compared with the traditional speech enhancement methods. Objective and subjective evaluation results indicate that the proposed method can effectively improve the intelligibility of whispered speech which is contaminated by noise. Compared with the power subtract algorithm and the log-MMSE algorithm, both of which do not improve the intelligibility in lower signal-to-noise ratio （SNR） environments, the proposed method has good performance in improving the intelligibility of noisy whisper. Additionally, the intelligibility of the enhanced whispered speech using the proposed method also outperforms that of the corresponding unprocessed noisy whispered speech.; 周健赵力梁瑞宇方贤勇

渝B2-20050021-1　渝公网安备 50019002500403号　违法和不良信息举报中心　互联网出版许可证　新出网证(渝)字10号

国家自然科学基金(60975017)

文献类型

领域

主题

机构

作者

传媒

年份

用户反馈

国家自然科学基金(60975017)

文献类型

领域

主题

机构

作者

传媒

年份

用户登录

用户反馈