教育背景
1993年、1996年 分别获南京大学理学学士、硕士学位
2001年 清华大学工学博士学位
工作履历
2001.07-2003.02 清华大学计算机系 助理研究员
2003.03-2022.12 中国科学院自动化研究所 历任副研究员、研究员、模式识别国家重点实验室副主任、学术委员会委员、学位委员会委员
2022.12-至今 清华大学自动化系 长聘教授
学术兼职
Board Member, International Speech Communication Association(ISCA )
Steering Committee Member, IEEE Transactions on Affective Computing
Associate Editor, Machine Intelligence Research (MIR)
Chairperson, ISCA SIG-CSLP
中国人工智能学会,常务理事、情感智能专委副主任
中国计算机学会,会士、常务理事、语音对话听觉专委副主任
中国图形图像学会,理事、人机交互专委主任
计算机研究与发展,编委
国家标准化委员会人工智能分委员会,副主任、智能感知集成工作组组长
研究领域
智能信息融合与处理、语音处理、情感计算、大数据分析
研究概况
国家杰出青年基金项目,多通道融合的言语分析与生成理论和方法研究,项目负责人,2015.01-2019.12
国家自然科学基金重点项目,连续状态空间个性化语音情感识别,项目负责人,2019.01-2023.12
国家自然科学基金重点项目,高性能人机对话建模技术研究,项目负责人,2022-2025
科技部国家重点研发计划,类脑听觉前端模型与系统研究,项目负责人,2021-2026
科技部国家重点研发计划,基于云计算的移动办公智能交互技术与系统,项目负责人,2018-2021
科技部 863项目,自然人机交互中口语产生新方法,负责人,2006-2008
科技部863项目,多方言的高表现力情感语音交互系统,项目负责人,2015-2017
中国科学院战略性先导科技专项C类项目,大数据分析关键技术及应用,项目负责人,2018 -2023
北京市科委,高精度、低延时的多通道AR虚实融合交互系统研发,项目负责人,2022-2023
国家发改委项目,音视频内容分析网关及服务器平台产业化,负责人,2013-2015
奖励与荣誉
2022年,第九届中国电子学会十佳优秀科技工作者
2021年,获得中国电子学会技术发明奖一等奖
2021年,中国科学院大学研究生优秀课程
2020年,中国科学院大学唐立新教学名师奖
2018年,获得中国电子学会科学技术奖一等奖
2018年,国务院政府特殊津贴人员
2017年,国家万人计划科技创新领军人才
2015年,国家杰出青年科学基金
2014年,获得北京市科学技术奖二等奖
学术成果
主要论著代表
[1] Jianhua Tao, Tieniu Tan (Eds),Affective Information Processing,Springer,2008
[2] Keikichi Hirose,Jianhua Tao,Speech Prosody in Speech Synthesis:Modeling and generation of prosody for high quality and flexible speech synthesis,Springer,2015
[3] Pengpeng Shao, Guohua Yang, Dawei Zhang, Jianhua Tao, Feihu Che, Tong Liu,Tucker decomposition-based Temporal Knowledge Graph Completion,Knowledge-Based Systems,Volume 238,2022
[4] Xiao Sun,Jingyuan Li,and Jianhua Tao,Emotional Conversation Generation Orientated Syntactically Constrained Bidirectional-Asynchronous Framework,IEEE TRANSACTIONS ON AFFECTIVE COMPUTING,VOL.13,NO. 1,2022
[5] Wang,Tao,Fu,Ruibo,Yi,Jiangyan,Tao,Jianhua,Wen,Zhengqi,NeuralDPS: Neural Deterministic Plus Stochastic Model With Multiband Excitation for Noise-Controllable Waveform Generation, IEEE/ACM Transactions on Audio, Speech, and Language Processing ,30:865-878,2022
[6] Tao Wang,Jiangyan Yi,Ruibo Fu,Jianhua Tao, Zhengqi Wen,CampNet: Context-Aware Mask Prediction for End-to-End Text-Based Speech Editing, IEEE/ACM Transactions on Audio, Speech, and Language Processing,Volume30,2022
[7] Zhengkun Tian, Jiangyan Yi, Jianhua Tao, Shuai Zhang; Zhengqi Wen, Hybrid Autoregressive and Non-Autoregressive Transformer Models for Speech Recognition, IEEE Signal Processing Letters,Volume29,2022
[8] Zheng Lian, Bin Liu, Jianhua Tao, PIRNet: Personality-enhanced Iterative Refinement Network for Emotion Recognition in Conversation, IEEE Transactions on Neural Networks and Learning Systems,2022
[9] Zheng Lian, Bin Liu, Jianhua Tao,SMIN: Semi-supervised Multi-modal Interaction Network for Conversational Emotion Recognition, IEEE Transactions on Affective Computing,DOI:10.1109,2022
[10] Ke Xu, Bin Liu, Jianhua Tao, Zhao Lv, Cunhang Fan, Leichao Song,AHRNN: Attention‐Based Hybrid Robust Neural Network for emotion recognition,Cognitive Computation and Systems,4(1):85-95,2022
[11] Ye Bai, Jiangyan Yi, Jianhua Tao, Zhengkun Tian, Zhengqi Wen and Shuai Zhang,Integrating Knowledge Into End-to-End Speech Recognition From External Text-Only Data,IEEE/ACM Transactions on Audio, Speech, and Language Processing,Vol 29,1340-1351,2021
[12] Ye Bai, Jiangyan Yi, Jianhua Tao, Zhengkun Tian, Zhengqi Wen and Shuai Zhang, Fast End-to-End Speech Recognition Via Non-Autoregressive Models and Cross-Modal Knowledge Transferring From BERT, IEEE/ACM Transactions on Audio, Speech, and Language Processing,Vol 29,1897-1911,2022
[13] Feihu Che, Jianhua Tao, Guohua Yang, Tong Liu, Dawei Zhang,Multi-aspect self-supervised learning for heterogeneous information network,Knowledge-Based Systems,Volume 233,2021
[14] Yongwei Li, Jianhua Tao, Donna Erickson, Bin Liu, Masato Akagi,F0-Noise-Robust Glottal Source and Vocal Tract Analysis Based on ARX-LF Model,IEEE/ACM Transactions on Audio, Speech, and Language Processing,Vol.29, 3375-3383.2021
[15] Zheng Lian, Bin Liu, Jianhua Tao, CTNet: Conversational Transformer Network for Emotion Recognition, IEEE/ACM Transactions on Audio, Speech, and Language Processing,Vol 29,985-1000,2021
[16] Zheng Lian, Bin Liu, Jianhua Tao, DECN: Dialogical Emotion Correction Network for Conversational Emotion Recognition, Neurocomputing,Vol 454,483-495,2021
[17] Mingyue Niu, Bin Liu, Jianhua Tao, Qifei Li,A time-frequency channel attention and vectorization network for automatic depression level prediction, Neurocomputing, Vol 450,208-218,2021
[18] Feihu Che, Guohua Yang, Dawei Zhang, Jianhua Tao, Tong Liu,Self-supervised graph representation learning via bootstrapping, Neurocomputing, Vol 456,88-96,2021
[19] Fan Cunhang; Yi Jiangyan; Tao Jianhua; Tian Zhengkun; Liu Bin; Wen Zhengqi,Gated Recurrent Fusion with Joint Training Framework for Robust End-to-End Speech Recognition, IEEE/ACM Transactions on Audio, Speech, and Language Processing,Vol 29, 198-209,2021
[20] Xiao Sun,Jia Li,Xing Wei, Changliang Li , Jianhua Tao ,Emotional Conversation Generation Based on a Bayesian Deep Neural Network,ACM TRANSACTIONS ON INFORMATION SYSTEMS,VOL.38,NO.1,2020
[21] Xiao Sun , Jia Li, Xing Wei , Changliang Li, Jianhua Tao,Emotional editing constraint conversation content generation based on reinforcement learning, Information Fusion,VOL.56,70-80,2020
[22] Ziping Zhao,Zhongtian Bao,Zixing Zhang,Nicholas Cummins,Haishuai Wang,Jianhua Tao, Björn Schuller,Automatic Assessment of Depression From Speech via a Hierarchical Attention Transfer Network and Attention Autoencoders, IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING,VOL.14, NO.2,2020
[23] Fan, Cunhang; Tao, Jianhua; Liu, Bin; Yi, Jiangyan; Wen, Zhengqi; Liu, Xuefei,End-to-End Post-Filter for Speech Separation With Deep Attention Fusion Features, IEEE/ACM TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING,VOL.28, 1303-1314,2020
[24] Bocheng Zhao, Jianhua Tao, Minghao Yang, Zhengkun Tian,Cunhang Fan,Ye Bai,Deep imitator: handwriting calligraphy imitation via deep attention networks, Pattern Recognition,VOL.104,107080-107080,2020
[25]Zheng Lian, Ya Li, Jianhua Tao, Jian Huang ,Mingyue Niu,Expression Analysis Based on Face Regions in Read-world Conditions, International Journal of Automation and Computing,Vol 17,96-107,2020
[26]Yibin Zheng , Jianhua Tao, Zhengqi Wen, Jiangyan Yi ,Forward–Backward Decoding Sequence for Regularizing End-to-End TTS, IEEE/ACM Trans. Audio, Speech & Language Processing,Vol 12,2067-2079,2019
[27] Jianhua Tao, Jian Huang, Ya Li, Zheng Lian, Mingyue Niu, Semi-supervised Ladder Networks for Speech Emotion Recognition, International Journal of Automation and Computing,Vol.16 No.4, 437-448,2019
[28] Jiangyan Yi,Jianhua Tao,Zhengqi Wen, Ye Bai, Language-Adversarial Transfer Learning for Low-Resource Speech Recognition, IEEE/ACM Trans. Audio, Speech & Language Processing,Vol 3(27), 621-630,2019
[29] Ya Li, Jianhua Tao, Wei Lai, Xiaoying Xu,Quantitative intonation modeling of interrogative sentences for Mandarin speech synthesis, Speech Communication,Volume 89, PP:92-102,2017
[30] Ya Li, Jianhua Tao, Linlin Chao,Wei Bao, Yazhu Liu, CHEAVD: a Chinese natural emotional audio–visual database, Journal of Ambient Intelligence and Humanized Computing,Vol 8(6),913-924,2016
[31] Su-Jing Wang,Wen-Jing Yan,Xiaobai Li,Guoying Zhao,Chun-Guang Zhou,Xiaolan Fu,Minghao Yang,Jianhua Ta, Micro-Expression Recognition Using Color Spaces,IEEE Transactions on Image Processing,Vol.24, No.12, 6034-6047,2015
[32] Wang Xiaoyan,Yang Minghao,Xia Ming,Zhan Yongsong,Shi Lihui,Ma,Chuanyan Tao,Jianhua,Chen Shengyong, Fast unsupervised texture segmentation using Texel similarity map, Journal Of Modern Optics,Vol. 62, 1211-1222,2015
[33] Minghao Yang,Jianhua Tao,Linlin Chao,Hao Li,Dawei Zhang,Hao Che,Tingli Gao,Bin Liu, User behavior fusion in dialog management with multi-modal history cues,Multimedia Tools and Applications,Volume 74, Issue 22, 10025-10051,2015
[34] J Tao,K Hirose,K Tokuda,AW Black,Introduction to the Issue on Statistical Parametric Speech Synthesis,IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 8 (2) :170-172,2014
[35] Zhengqi Wen,Jianhua Tao,Shifeng Pan,Yang Wang,Pitch-Scaled Spectrum Based Excitation Model for HMM-based Speech Synthesis, Journal of Signal Processing Systems,Vol74,423-435,2014
[36] Minghao Yang, Jianhua Tao, Kaihui Mu, Ya Li, Jianfeng Che,A Multimodal Approach of Generating 3D Human-like Talking Agent,Journal on Multimodal User Interfaces,Vol.5(1-2),61-68,2012
[37] Jianhua Tao, Shifeng Pan, Minghao Yang, Ya Li, Kaihui Mu and Jianfeng Che,Utterance independent bimodal emotion recognition in spontaneous communication,Journal on Advances in Signal Processing,Vol.4,1-11,2011
[38] Jianhua Tao, Meng Zhang, Jani Nurminen, Jilei Tian, Xia Wang, Supervisory Data Alignment for Text-independent Voice Conversion, IEEE Transactions on Audio, Speech and Language Processing, Vol. 18, No. 5, 932-943,2010
[39] Jianhua Tao, Le Xin, Panrong Yin, Realistic Visual Speech Synthesis based on Hybrid Concatenation Method, IEEE Transactions on Audio, Speech and Language Processing, Vol. 17, No. 3, 469-477,2009
[40] Jianhua Tao, Yongguo Kang, Aijun Li,Prosody conversion from neutral speech to emotional speech, IEEE Transactions on Audio, Speech, and Language Processing, Vol. 14, No. 4,1145-1154,2006
[41] Ren-Hua Wang, Sin-Horng Chen, Jianhua Tao, Min Chu,MANDARIN TEXT-TO-SPEECH SYNTHESIS,Advanced Chinese Spoken Language Processing,Vol.5,2006