师资队伍

师资队伍

易江燕

副研究员
脑与认知科学研究所


教育背景


20159-20186月,中国科学院自动化研究所,模式识别与智能系统专业,获博士学位

20079-20107月,中国社会科学院研究生院,计算语言学专业,获硕士学位

2003年9月-2007年7月,云南师范大学计信学院,计算机科学与技术专业,获学士学位



工作履历


202411-至今,清华大学自动化系,副研究员

202010-202410月,中国科学院自动化研究所,副研究员、硕导、博导

20187-202010月,中国科学院自动化研究所,助理研究员

2011年9月-2014年11月,阿里巴巴(中国)网络技术有限公司,资深算法工程师


学术兼职


202411-至今,the EAAI journal BoardEditor

20248-至今,IEEE信号处理学会语音与语言处理技术委员会SLTC Member

202011-至今,亚太信号与信息处理协会语音-语言-音频技术委员会SLATC Member

201810-至今,中国计算机学会(CCF)语音对话与听觉专委会,执委、常委

20198-至今,全国人机语音通讯学术会议(NCMMSC)常设机构, 委员

语音领域重要国际会议ICASSP 2024、Interspeech 2020/2022,Area Chair



研究领域


语音信息处理、个性化语音合成与鉴别


研究概况


申请人长期围绕个性化语音模拟、语音信息安全等语音建模技术面临的 “鲁棒性不足”问题,展开系统深入的研究。主持国家自然科学基金、科技部重大项目和国际合作项目等10余项,在IEEE TASLP、AI、PR、ICML、AAAI、ACM MM和ICASSP等重要国际期刊和会议上发表论文80余篇,已授权发明专利60余项(含美国发明专利9项)。研究成果应用于公安、网信等国家部门和华为、电信等众多企业,2022年获中国人工智能学会吴文俊人工智能技术发明特等奖、2023年获北京市发明专利奖一等奖、2024年获中国发明协会成果奖一等奖。


奖励与荣誉


2023年,国家优秀青年科学基金获得者

2022年,中国人工智能学会吴文俊人工智能科学技术发明特等奖

2024年,中国发明协会成果奖一等奖

2023年,北京市发明专利奖一等奖

2021年,ICASSP 2021多说话人音色克隆国际竞赛冠军

2019年,第十九届全国信号处理学术年会最佳论文

2019年,第十三届全国人机语音通讯学术会议最佳学生论文

2018年,Intel AIDC Beijing Best Poster Award

2024年,中国科学院自动化研究所优秀共产党员


学术成果


代表性成果

[1] Jiangyan Yi, Chenglong Wang, Jianhua Tao, Chu Yuan Zhang et al. SceneFake: An Initial Dataset and Benchmarks for Scene Fake Audio Detection. Pattern Recognition (2024)

[2] Jiangyan Yi, Jianhua Tao, Ruibo Fu, Tao Wang, Chu Yuan Zhang, Chenglong Wang: Adversarial Multi-Task Learning for Mandarin Prosodic Boundary Prediction with Multi-Modal Embeddings. IEEE ACM Trans. Audio Speech Lang. Process. 31: 2963-2973 (2023)

[3] Jiangyan Yi, Jianhua Tao, Zhengqi Wen, Ye Bai: Language-Adversarial Transfer Learning for Low-Resource Speech Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 27(3): 621-630 (2019)

[4] Jiangyan Yi, Jianhua Tao, Ye Bai, Zhengkun Tian, Cunhang Fan: Transfer knowledge for punctuation prediction via adversarial training. Speech Communication. 149: 1-10 (2023)

[5] Jiangyan Yi, Zhengqi Wen, Jianhua Tao, Hao Ni, Bin Liu: CTC Regularized Model Adaptation for Improving LSTM RNN Based Multi-Accent Mandarin Speech Recognition. J. Signal Process. Syst. 90(7): 985-997 (2018)

[6] Jiangyan Yi, Ruibo Fu, Jianhua Tao, Shuai Nie, Haoxin Ma, Chenglong Wang, Tao Wang, Zhengkun Tian, Ye Bai, Cunhang Fan, Shan Liang, Shiming Wang, Shuai Zhang, Xinrui Yan, Le Xu, Zhengqi Wen, Haizhou Li: ADD 2022: the First Audio Deep Synthesis Detection Challenge. ICASSP 2022 : 9216-9220

[7] Jiangyan Yi, Ye Bai, Jianhua Tao, Haoxin Ma, Zhengkun Tian, Chenglong Wang , Tao Wang, Ruibo Fu: Half-Truth: A Partially Fake Audio Detection Dataset. INTERSPEECH 2021 : 1654-1658

[8] Jiangyan Yi, Jianhua Tao, Zhengkun Tian, Ye Bai, Cunhang Fan: Focal Loss for Punctuation Prediction. INTERSPEECH 2020: 721-725

[9] Jiangyan Yi, Jianhua Tao, Ye Bai: Language-invariant Bottleneck Features from Adversarial End-to-end Acoustic Models for Low Resource Speech Recognition. ICASSP 2019: 6071-6075

[10] Jiangyan Yi, Jianhua Tao: Self-attention Based Model for Punctuation Prediction Using Word and Speech Embeddings. ICASSP 2019: 7270-7274

[11] Jiangyan Yi, Jianhua Tao, Zhengqi Wen, Ye Bai: Adversarial Multilingual Training for Low-Resource Speech Recognition. ICASSP 2018: 4899-4903

[12] Jiangyan Yi, Jianhua Tao, Zhengqi Wen, Ya Li: Distilling Knowledge from an Ensemble of Models for Punctuation Prediction. INTERSPEECH 2017: 2779-2783

[13] Tao Wang, Jiangyan Yi*, Ruibo Fu, Jianhua Tao, Zhengqi Wen, Chu Yuan Zhang: Emotion selectable end-to-end text-based speech editing. Artificial Intelligence. 329 (2024)

[14] Cunhang Fan, Jiangyan Yi*, Jianhua Tao, Zhengkun Tian, Bin Liu, Zhengqi Wen: Gated Recurrent Fusion With Joint Training Framework for Robust End-to-End Speech Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 29: 198-209 (2021)

[15] Ye Bai, Jiangyan Yi*, Jianhua Tao, Zhengqi Wen, Zhengkun Tian, Shuai Zhang: Integrating Knowledge Into End-to-End Speech Recognition From External Text-Only Data. IEEE ACM Trans. Audio Speech Lang. Process. 29: 1340-1351 (2021)

[16] Ye Bai, Jiangyan Yi*, Jianhua Tao, Zhengkun Tian, Zhengqi Wen, Shuai Zhang: Fast End-to-End Speech Recognition Via Non-Autoregressive Models and Cross-Modal Knowledge Transferring From BERT. IEEE ACM Trans. Audio Speech Lang. Process. 29: 1897-1911 (2021)

[17] Tao Wang, Jiangyan Yi*, Ruibo Fu, Jianhua Tao, Zhengqi Wen: CampNet: Context-Aware Mask Prediction for End-to-End Text-Based Speech Editing. IEEE ACM Trans. Audio Speech Lang. Process. 30: 2241-2254 (2022)

[18] Tao Wang, Ruibo Fu, Jiangyan Yi*, Jianhua Tao, Zhengqi Wen: NeuralDPS: Neural Deterministic Plus Stochastic Model With Multiband Excitation for Noise-Controllable Waveform Generation. IEEE ACM Trans. Audio Speech Lang. Process. 30: 865-878 (2022)

[19] Zhengkun Tian, Jiangyan Yi*, Jianhua Tao, Shuai Zhang, Zhengqi Wen. Hybrid Autoregressive and Non-Autoregressive Transformer Models for Speech Recognition. IEEE Signal Processing Letters. 29: 762-766 (2022)

[20] Xiaohui Zhang, Jiangyan Yi*, Jianhua Tao, Chenglong Wang, Chu Yuan Zhang: Do You Remember? Overcoming Catastrophic Forgetting for Fake Audio Detection. ICML 2023: 41819-41831

[21] Xiaohui Zhang, Jiangyan Yi*, Chenglong Wang, Chu Yuan Zhang, Siding Zeng, Jianhua Tao: What to Remember: Self-Adaptive Continual Learning for Audio Deepfake Detection. AAAI 2024:19569-19577

[22] Hao Gu, Jiangyan Yi*, Chenglong Wang, Yong Ren, Jianhua Tao, Xinrui Yan, Yujie Chen, Xiaohui Zhang: Utilizing Speaker Profiles for Impersonation Audio Detection. ACM Multimedia 2024: 1961-1970