师资队伍

师资队伍

杨君

副研究员
导航与控制研究所


教育背景


2000年9月–2004年7月 西北工业大学,航海工程学院,学士

2004年9月–2007年3月 北京邮电大学,自动化系,硕士

2007年9月–2012年1月 清华大学,自动化系,博士


工作履历


2012年2月–2015年5月 航天五院,北京控制工程研究所,主管设计师;

2015年6月–2018年7月 清华大学,深圳研究生院信息学部,博士后/助理研究员;

2018年8月–2023年1月 清华大学,自动化系,助理研究员;

2023年1月–至今 清华大学,自动化系,副研究员


学术兼职


担任NeurIPS, ICLR, IJCAI, AAMAS等国际学术会议和MSSP, RAL, AST等国际期刊审稿人


社会兼职


人工智能学会智能决策专委会(筹) 秘书长

智能无人系统建模仿真专委会 委员


研究领域


强化学习、多智能体协同决策、智能控制


研究概况


1. 国家科技部科技创新2030“脑科学与类脑研究”重大项目:面向类脑芯片的深度增强学习方法,2022.01-2027.12,子课题负责人

2. 国家实验室课题,智能博弈对抗策略的数字建模工具协作研究与开发,2022.1-2022.12,项目负责人

3. 国防创新特区重点项目,2020.6-2022.11,项目负责人

4. 航天二院横向课题,2020.6-2021.6,项目负责人

5. 国防创新特区重点项目,2019.12-2022.6,项目负责人

6. 国防创新特区重点项目,2019.08-2020.7,项目负责人


奖励与荣誉


军队科技进步一等奖


学术成果

论文:

[1] Kailin Zeng, QiYuan Zhang, Bin Chen, Bin Liang, and Jun Yang*. APD: Learning Diverse Behaviors for Reinforcement Learning Through Unsupervised Active Pre-Training, IEEE Robotics and Automation Letters, 2022.

[2] Shu Leng, Xianglong Li, Meng Yu, Jun Yang*, Bin Liang. Flexible online planning based residual space object de-spinning for dual-arm space-borne maintenance, Aerospace Science and Technology, 2022, 130:1-13.

[3] Xiaoteng Ma, Yiqin Yang&, Hao Hu, Qihan Liu, Jun Yang*, Chongjie Zhang, Qianchuan Zhao, Bin Liang. Offline Reinforcement Learning with Value-based Episodic Memory, Tenth International Conference on Learning Representations (ICLR), 2022.

[4] Jun Yang, Bin Chen, Yanan Wang, Chunzhu Wang. Crack detection in carbide anvil using acoustic signal and deep learning with particle swarm optimization, Measurement, 2021.

[5] Duo Wang, Ming Zhang, Yuchun Xu, Weining Lu, Jun Yang*, Tao Zhang, Metric-based Meta-learning Model for Few-shot Fault Diagnosis under Multiple Limited Data Conditions, Mechanical Systems and Signal Processing, 2021.

[6] Qiyuan Zhang, Xiaoteng Ma, Yiqin Yang, Chenghao Li, Jun Yang*, Yu Liu, Bin Liang, Learning to Discover Task-Relevant Features for Interpretable Reinforcement Learning, IEEE Robotics and Automation Letters, 2021.

[7] Yiqin Yang , Xiaoteng Ma, Chenghao Li, Zewu Zheng, Qiyuan Zhang, Gao Huang, Jun Yang*, Qianchuan Zhao, Believe What You See: Implicit Constraint Approach for Offline Multi-Agent Reinforcement Learning, 35th Conference on Neural Information Processing Systems (NeurIPS), 2021.

[8] Chenghao Li, Tonghan Wang, Chengjie Wu, Qianchuan Zhao, Jun Yang∗, Chongjie Zhang. Celebrating Diversity in Shared Multi-Agent Reinforcement Learning, 35th Conference on Neural Information Processing Systems (NeurIPS), 2021.

[9] Xiaoteng Ma, Xiaohang Tang, Li Xia, Jun Yang*, Qianchuan Zhao. Average-Reward Reinforcement Learning with Trust Region Methods, Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence (IJCAI), 2021.

[10] Xiaoteng Ma, Yiqin Yang, Chenghao Li, Qianchuan Zhao, Jun Yang, Yiwen Lu. Modeling the Interaction between Agents in Cooperative Multi-Agent Reinforcement Learning, 20th International Conference on Autonomous Agents and Multi-agent Systems (AAMAS), 2021.

[11] Xiaoyan Hu; Li Xia, Jun, Yang, Qianchuan Zhao. A Fast-Convergence Method of Monte Carlo Counterfactual Regret Minimization for Imperfect Information Dynamic Games, IEEE 9th Data Driven Control and Learning Systems Conference, 2020.

[12] Chenghao Li; Xiaoteng Ma; Li Xia; Qianchuan Zhao; Jun Yang. Fairness Control of Traffic Light via Deep Reinforcement Learning, 16th IEEE International Conference on Automation Science and Engineering (CASE), 2020.


发明专利:

授权国家发明专利20余项