2000年9月–2004年7月 西北工业大学,航海工程学院,学士
2004年9月–2007年3月 北京邮电大学,自动化系,硕士
2007年9月–2012年1月 清华大学,自动化系,博士
2012年2月–2015年5月 航天五院,北京控制工程研究所,主管设计师;
2015年6月–2018年7月 清华大学,深圳研究生院信息学部,博士后/助理研究员;
2018年8月–2023年1月 清华大学,自动化系,助理研究员;
2023年1月–至今 清华大学,自动化系,副研究员
担任NeurIPS, ICLR, IJCAI, AAMAS等国际学术会议和MSSP, RAL, AST等国际期刊审稿人
人工智能学会智能决策专委会(筹) 秘书长
智能无人系统建模仿真专委会 委员
强化学习、多智能体协同决策、智能控制
1. 国家科技部科技创新2030“脑科学与类脑研究”重大项目:面向类脑芯片的深度增强学习方法,2022.01-2027.12,子课题负责人
2. 国家实验室课题,智能博弈对抗策略的数字建模工具协作研究与开发,2022.1-2022.12,项目负责人
3. 国防创新特区重点项目,2020.6-2022.11,项目负责人
4. 航天二院横向课题,2020.6-2021.6,项目负责人
5. 国防创新特区重点项目,2019.12-2022.6,项目负责人
6. 国防创新特区重点项目,2019.08-2020.7,项目负责人
军队科技进步一等奖
论文:
[1] Kailin Zeng, QiYuan Zhang, Bin Chen, Bin Liang, and Jun Yang*. APD: Learning Diverse Behaviors for Reinforcement Learning Through Unsupervised Active Pre-Training, IEEE Robotics and Automation Letters, 2022.
[2] Shu Leng, Xianglong Li, Meng Yu, Jun Yang*, Bin Liang. Flexible online planning based residual space object de-spinning for dual-arm space-borne maintenance, Aerospace Science and Technology, 2022, 130:1-13.
[3] Xiaoteng Ma, Yiqin Yang&, Hao Hu, Qihan Liu, Jun Yang*, Chongjie Zhang, Qianchuan Zhao, Bin Liang. Offline Reinforcement Learning with Value-based Episodic Memory, Tenth International Conference on Learning Representations (ICLR), 2022.
[4] Jun Yang, Bin Chen, Yanan Wang, Chunzhu Wang. Crack detection in carbide anvil using acoustic signal and deep learning with particle swarm optimization, Measurement, 2021.
[5] Duo Wang, Ming Zhang, Yuchun Xu, Weining Lu, Jun Yang*, Tao Zhang, Metric-based Meta-learning Model for Few-shot Fault Diagnosis under Multiple Limited Data Conditions, Mechanical Systems and Signal Processing, 2021.
[6] Qiyuan Zhang, Xiaoteng Ma, Yiqin Yang, Chenghao Li, Jun Yang*, Yu Liu, Bin Liang, Learning to Discover Task-Relevant Features for Interpretable Reinforcement Learning, IEEE Robotics and Automation Letters, 2021.
[7] Yiqin Yang , Xiaoteng Ma, Chenghao Li, Zewu Zheng, Qiyuan Zhang, Gao Huang, Jun Yang*, Qianchuan Zhao, Believe What You See: Implicit Constraint Approach for Offline Multi-Agent Reinforcement Learning, 35th Conference on Neural Information Processing Systems (NeurIPS), 2021.
[8] Chenghao Li, Tonghan Wang, Chengjie Wu, Qianchuan Zhao, Jun Yang∗, Chongjie Zhang. Celebrating Diversity in Shared Multi-Agent Reinforcement Learning, 35th Conference on Neural Information Processing Systems (NeurIPS), 2021.
[9] Xiaoteng Ma, Xiaohang Tang, Li Xia, Jun Yang*, Qianchuan Zhao. Average-Reward Reinforcement Learning with Trust Region Methods, Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence (IJCAI), 2021.
[10] Xiaoteng Ma, Yiqin Yang, Chenghao Li, Qianchuan Zhao, Jun Yang, Yiwen Lu. Modeling the Interaction between Agents in Cooperative Multi-Agent Reinforcement Learning, 20th International Conference on Autonomous Agents and Multi-agent Systems (AAMAS), 2021.
[11] Xiaoyan Hu; Li Xia, Jun, Yang, Qianchuan Zhao. A Fast-Convergence Method of Monte Carlo Counterfactual Regret Minimization for Imperfect Information Dynamic Games, IEEE 9th Data Driven Control and Learning Systems Conference, 2020.
[12] Chenghao Li; Xiaoteng Ma; Li Xia; Qianchuan Zhao; Jun Yang. Fairness Control of Traffic Light via Deep Reinforcement Learning, 16th IEEE International Conference on Automation Science and Engineering (CASE), 2020.
发明专利:
授权国家发明专利20余项