龙艳花 副研究员

发布者:信机学院网站管理员发布时间:2017-05-08浏览次数:3098


  


龙艳花,博士
上海师范大学,电气信息系,副研究员
Email:yanhua@shnu.edu.cn
主要学科领域:信号与信息处理
研究方向:智能语音及语言信号处理,包括语音识别,声纹识别,语种识别等,统计建模,机器学习,数据挖掘。
教育背景:
2006.09- 2011.07:硕博连读,中国科学技术大学,电子工程与信息科学系,科大讯飞语音实验室。
2002.09-2006.07:本科,西南科技大学,信息工程学院,通信工程系,学士学位
工作经历:
2013.06-至今:上海师范大学,电气信息系,副研究员
2011.10-2013.05:英国剑桥大学,工程系,机器与智能实验室,语音研究组,博士后,导师:Phil Woodland教授,Mark Gales教授,研究课题:自然语音技术-Natural Speech Technology(NST)。
2009.10-2010.02:微软亚洲研究院,语音组,导师:Frank Soong(宋謌平),研究课题:基于谐波噪声模型(HNM)的高质量语音分析与合成。
2008.07-2009.01:新加坡南洋理工大学(NTU),以及研究所-A*STAR,I2R (Institute for Infocomm Research),NTU访问学生,导师:Chng Eng Siong教授,研究所实习生,Human Language Technology(HLT) Department,I2R,A*STAR,导师:Haizhou Li教授
科研项目:
1.基于深度学习的声纹识别方法研究,2014/07-2017/06,在研,主持。
2.语音识别中副语言信息的自动转写方法,上海市高校青年教师培养资助计划,2014/12-2017/06,在研,主持。
3.多语种混合语音识别开发,上海市联盟计划,在研,主持。
4.面向语音识别的副语言信息标注算法研究,上海师范大学理工科校级科研一般项目,2014/01-2015/12,结题。
获奖情况:
1.2008 NIST Speaker Recognition Evaluation (SRE),在核心测试任务中,作为关键技术人员及组长带领的团队获得EER、 minDCF 两项国际第一名,DCF第三名,综合成绩国际第一,该成果被国家自然科学基金委,中国科学院网站等 100 多家媒体报导。
2.2009 NIST Language Recognition Evaluation,团队在通用语种测试中各项指标综合排名国际第二;同时,在更具挑战性的8组方言对测试中,有6组方言对测试性能均远远超过了其他参赛单位,综合排名国际第一。
3.2010 NIST Speaker Recognition Evaluation,作为关键技术人员及组长带领的团队获得EER,minDCF,DCF指标综合成绩国际第二名。
专利情况:
1.采用声纹和语音识别进行个性化电视语音唤醒的方法,专利申请号:201410840544.9
2.一种文件夹加密方法,专利申请号:201410784456.1
3.声纹识别银行转账APP,软件著作权登记,登记号:015SR141402
4.基于Matlab的能量分类语音端点检测软件V1.0,软件著作权登记,登记号:2016SR074002
5.基于Matlab的双门限语音端点检测软件V1.0,软件著作权登记,登记号:2016SR074008
已发表的部分论文:
1.Yanhua Long, et.al. Domain Compensation Based on Phonetically Discriminative Features for Speaker Verification, Computer Speech & Language, 2017, (41): 161-179.
2.Haoran Wei, Yanhua Long, et.al. Improvements on self-adaptive voice activity detector for telephone data, International Journal of Speech Technology, 2016, 19(3):623-630.
3.龙艳花,倪继锋,叶宏. “基于深度神经网络的说话人信道自适应方法”,2016, 48(2): 151-155.
4.Yan-Hua Long, Hong Ye. Filled Pause Refinement Based on the Pronunciation Probability for Lecture Speech, PLos One, 10(4):2015, e0123466.doi: 10.1371/ journal.pone. 0123466.
5.Bo Li, Yanhua Long, Hong Ye. Outlier Detection and Cluster Center Initialization for K-means Algorithm, Journal of Computational Information Systems, 11(12): 2015, 4333–4342.
6.龙艳花,戴礼荣. “采用M-矢量和支持向量机的说话人确认系统”. 华中科技大学学报(自然科学版),2014,42(8):63-68.
7.Y. Long, M.J.F. Gales, P. Lanchantin, X. Liu, M.S. Seigel, P.C. Woodland. “Improving Lightly Supervised Training for Broadcast Transcription”. Interspeech, pp.2187-2191, 2013.
8.P. Lanchantin, P. Bell, M.J.F. Gales, T. Hain, X. Liu, Y. Long, J. Quinnell, S. Renals, O. Saz, M.S. Seigel, P. Swietojanski, P.C. Woodland. “Automatic transcription of multi-genre media archives”. SLAM, pp.26-31, 2013.
9.P. Bell, M. Gales, P. Lanchantin, X. Liu, Y. Long, S. Renals, P. Swietojanski, P.C. Woodland, “Transcription of Multi-Genre Media Archives Using Out-of-domain data”, SLT, pp.324-329, 2012.
10.Yanhua Long, Zhi-Jie Yan, Frank K Soong, et.al. “Improvements in Speaker Characterization Using Spectral Subband Energy Based on Harmonic plus Noise Model”, pp.373-376, INTERSPEECH, 2011.
11.Yanhua Long, Zhi-Jie Yan, Frank K Soong, et.al. “Speaker Characterization using Spectral Subband Energy Ratio based on Harmonic Plus Noise Model”, pp.4520-4523, ICASSP, 2011.
12.Ying XU, Yan Song, Yan-Hua Long, et.al.” The Description of iFlyTek Speech Lab System for NIST2009 Language Recognition Evaluation”, pp.157-161, ISCSLP, 2010.
13.Yanhua Long, LiRong Dai, Er-yu Wang, et.al. “Non-negative matrix factorization based discriminative features for speaker verification”, pp.291-295, ISCSLP, 2010.
14.Wu Guo, Yanhua Long, Eryu Wang,er.al. “IFly speech lab 2010 speaker recognition evaluation system description”. NIST SRE2010, system description paper. (NIST SRE2010 Evaluation paper)
15.Yanhua Long, LiRong Dai, Bin Ma, Wu Guo. “Effects of the Phonological Relevance in Speaker Verification”, pp. 2130-2133, INTERSPEECH, 2010.
16.Wu Guo, Zhao Zhang, Yanhua Long, Lirong Dai. “N-gram Nearest Neighbor Algorithm for Voice Password System”, pp.4438-4441, ICASSP, 2010.
17.Yanhua Long, Bin Ma, Haizhou Li, et. al. “Exploiting Prosodic Information for Speaker Recognition”, pp. 4225- 4228, ICASSP, 2009.
18.Wu Guo, Yanhua Long, Yijie Li, et.al. “iFLY system for the NIST 2008 speaker recognition evaluation”, pp. 4209 – 4212, ICASSP, 2009.
19.Yanhua Long, Wu Guo, Bin Ma, et. al. “Subspace Construction and Selection for Speaker Recognition”, pp.1-4, ICICS, 2009.
20.Yanhua Long, Wu Guo, Lirong dai. “A PCA Method Based on Speaker Session Variability”, Journal of Pattern recognition and artificial intelligence, pp.270-274, No. 22, Issue 2, 2009.
21.Yanhua Long, Wu Guo, LiRong Dai. “To Balance Training Data for SVM Based Speaker Verification “, Journal of Chinese Information Processing, pp.76-80, No. 5, Issue 3, 2008. (Chinese core journals)
22.Yanhua Long, Wu Guo, LiRong Dai.” An SIPCA-WCCN Method for SVM-based Speaker Verification System”, pp.1295–1299, ICALIP, 2008
23.Yanhua Long, Wu Guo, LiRong Dai.” Interfusing the Confused Region Score of Speaker Verification Systems”, pp.1-4, ISCSLP2008.
24.Yanhua Long, Wu Guo, LiRong Dai.” Sequence Kernel for SVM based Speaker verification system”, Journal of Tsinghua University (Science and Technology), pp.688-692, Vol.48, No.S1, 2008.