Hello! Welcome to the website of Zhang Yang (张杨).

I am now a Postdoctoral Fellow at Hong Kong Polytechnic University, under the supervision of Prof. Edward Chung. I obtained my B.S. and Ph.D. degrees from Xi’an Jiaotong University advised by Prof. Qingyu Yang,杨清宇.

I am deeply interested in modern AI, particularly reinforcement learning and large language models, and hope to use them to tackle the challenging control and optimization problems in power dispatch at the nexus of smart grids and intelligent transportation systems.

I have published several papers, including nine as first author, in top international AI conferences such as NeurIPS, ICML, KDD, IJCAI, and AAAI, as well as in leading journals including IEEE Transactions and Applied Energy. I have served as a reviewer for ICLR, AAAI, IJCAI, SIGIR, and various IEEE Transactions journals. Moreover, I am the first inventor on two granted patents.

I maintain a close collaboration with the startup MemOS. More details can be found in my Chinese CV.

🔥 News

  • 2026.01:  🎉🎉 One paper is accepted by TNNLS.
  • 2026.01:  🙌🙌 I give a talk in Tongji University, School of Electronic and Information Engineering.
  • 2025.11:  🎉🎉 My doctoral desseration was awarded as “Outstanding Doctoral Dissertations of Shaanxi Province (陕西省优秀博士论文)”
  • 2025.11:  🙌🙌 I give a talk in Shanghai Jiaotong University, Paris Elite Institute of Technology.
  • 2025.05:  🎉🎉 One paper is accepted by KDD 2025.
  • 2025.05:  🎉🎉 One paper is accepted by ICML 2025.
  • 2025.04:  🎉🎉 One paper is accepted by IJCAI 2025.
  • 2025.02:  🙌🙌 I recevided The Hong Kong Polytechnic University Postdoctoral Matching Fund.

📝 Publications

  • Yang Zhang, Yunjian Xu, Chengwei Zhang, Chao Wang, et al. Rethinking the Utilization of Individual Rewards in Multi-Agent Reinforcement Learning with Sparse Team Rewards [J]. TNNLS, 2026.

  • Yang Zhang, Yu Yu, Bo Tang, Yu Zhu, et al. Token-level Accept or Reject: A micro alignment approach for Large Language Models [C]. IJCAI, 2025.

  • Zhihe Yang, Yunjian Xu, Yang Zhang. Q-Supervised Contrastive Representation: A State Decoupling Framework for Safe Offline Reinforcement Learning [C]. ICML, 2025.

  • Lindong Xie, Yang Zhang, Zhixian Tang, Edward Chung, et al. Co-Evolution of Large Language Models and Configuration Strategies to Enhance Surrogate-Assisted Evolutionary Algorithm [C]. KDD, 2025.

  • Yang Zhang, Qingyu Yang, Dou An, Donehe Li, et al. Coordination Multistep Multiagent Reinforcement Learning for Optimal Energy Schedule Strategy of Charging Stations in Smart Grid [J]. IEEE Transactions on Cybernetics, 2023.

  • Yang Zhang, Qingyu Yang, Donghe Li, Dou An. A Reinforcement and Imitation Learning Method for Pricing Strategy of Electricity Retailer with Customers’ Flexibility [J]. Applied Energy, 2022.

  • Yang Zhang, Bo Tang, Qingyu Yang, Dou An, et al. BCORLE(λ): An Offline Reinforcement Learning and Evaluation Framework for Coupons Allocation in E-commerce Market [C]. NeurIPS, 2021.

  • Yang Zhang, Qingyu Yang, Dou An, Chengwei Zhang. Coordination Between Individual Agents in Multi-Agent Reinforcement Learning [C]. AAAI, 2021.

  • Donghe Li, Qingyu Yang, Wei Yu, Dou An, Yang Zhang. Towards differential privacy-based online double auction for smart grid [J]. IEEE Transactions on Information Forensics and Security, 2020.

  • Yang Zhang, Zhengfeng Zhang, Qingyu Yang, Dou An, et al. EV charging bidding by Multi-DQN reinforcement learning in electricity auction Market [J]. Neurocomputing, 2020.

  • Yang Zhang, Qingyu Yang, Dou An, Donghe Li, et al. An Online Continuous Progressive Second Price Auction for Electric Vehicle Charging [J]. IEEE Internet of Things Journal, 2019.

  • Yang Zhang, Qingyu Yang, Dou An, Donghe Li. A blockchain based peer-to-peer electricity trading mechanism in residential electricity market [C]. Chinese Automation Congress, 2018.

🎖 Honors and Awards

  • 2025.11 Outstanding Doctoral Dissertations of Shaanxi Province (陕西省优秀博士论文).
  • 2025.05 Outstanding Doctoral Dissertations of Xi’an Jiaotong Univerisity (西安交通大学优秀博士论文).
  • 2025.02 Hong Kong Polytechnic University Postdoctoral Matching Fund.
  • 2024.04 First Prize of the Shaanxi Higher Education Science and Technology Award, Ranked 4th (陕西高等学校科学技术奖一等奖).
  • 2022.10 Top 15 Doctoral candidate in Xi’an Jiaotong University (西安交通大学优秀研究生标兵).
  • 2022.10 National Scholarship (国家奖学金).
  • 2017.10 National Encouragement Scholarship (国家励志奖学金).

📖 Educations

  • 2018.09 - 2023.03, Xi’an Jiaotong Univerisity, Ph.D.
  • 2014.09 - 2018.06, Xi’an Jiaotong Univerisity, Bachelor.

🏬 Work Experiences

  • 2024.09 - now, Hong Kong Polytechnic University, Postdoctoral Fellow.
  • 2024.04 - 2024.08, Chinese University of Hong Kong, Research Associate.
  • 2023.04 - 2024.03, Zhejiang Lab, Assistant Researcher.
  • 2020.12 - 2021.03, Alibaba Group, Research Intern.

💬 Invited Talks

  • 2026.01, Tongji University, School of Electronic and Information Engineering. Intelligent Decision-Making Algorithm Driven by Reinforcement Learning and Large Language Models and Its Application in Electric Vehicle Charging Scheduling.
  • 2025.12, Shanghai Jiaotong University, Paris Elite Institute of Technology. Large Language Model and Reinforcement Learning Hybrid-Driven Data-Efficient Control.
  • 2025.05, Hong Kong Polytechnic University, Stackelberg Equilibrium-based Multi-agent Reinforcement Learning for Electric Vehicles Charging Scheduling.
  • 2023.03, Xi’an Jiaotong University, The Student Speaker of Graduation Ceremony.
  • 2022.05, RL China (Online). Offline Reinforcement Learning-based Coupons Allocation in E-commerce Market.
  • 2021.01, Alibaba Group. Agent Correlation-based Multi-agent Reinforcement Learning.

Flag Counter