Zhiwei Xu (徐志伟)
Ph.D Candidate

Institute of Automation, Chinese Academy of Sciences
School of Artificial Intelligence, University of Chinese Academy of Sciences

Location: 95 Zhongguancun East Road, BEIJING, CHINA
News | Research Interest | Education | Publications | Services | Awards

Email: xuzhiwei2019@ia.ac.cn (prior);      diligencexu@gmail.com
[GitHub] [DBLP] [Semantic Scholar] [Google Scholar] [Wechat]

News


Research Interest

My research interests include reinforcement learning, game theory and multi-agent systems. Currently, I focus on the following research topics:

Education


Publications

Conferences:

    [19] Stackelberg Decision Transformer for Asynchronous Action Coordination in Multi-Agent Systems
    Forty-first International Conference on Machine Learning(ICML), in Vienna, Austria, 2024.
    Bin Zhang, Hangyu Mao, Lijuan Li, Zhiwei Xu, Dapeng Li, Rui Zhao, and Guoliang Fan
    [Arxiv]

    [18] PDiT: Interleaving Perception and Decision-making Transformers for Deep Reinforcement Learning
    International Conference on Autonomous Agents and Multi-Agent Systems(AAMAS), in Auckland, New Zealand, 2024. (Full Paper)
    Hangyu Mao, Rui Zhao, Ziyue Li, Zhiwei Xu, Hao Chen, Yiqun Chen, Bin Zhang, Zhen Xiao, Junge Zhang, and Jiangjin Yin
    [Arxiv][Code]

    [17] From Explicit Communication to Tacit Cooperation:A Novel Paradigm for Cooperative MARL
    International Conference on Autonomous Agents and Multi-Agent Systems(AAMAS), in Auckland, New Zealand, 2024. (Extended Abstract)
    Dapeng Li, Zhiwei Xu, Bin Zhang, and Guoliang Fan
    [Arxiv]

    [16] Adaptive Parameter Sharing for Multi-Agent Reinforcement Learning
    IEEE International Conference on Acoustics, Speech and Signal Processing(ICASSP), in Seoul, Korea, 2024.
    Dapeng Li, Na Lou, Bin Zhang, Zhiwei Xu, and Guoliang Fan
    [Arxiv]

    [15] Dual Self-Awareness Value Decomposition Framework without Individual Global Max for Cooperative MARL
    Thirty-seventh Conference on Neural Information Processing Systems(NeurIPS), in New Orleans, USA, 2023. (Poster)
    Zhiwei Xu, Bin Zhang, Dapeng Li, Guangchong Zhou, Zeren Zhang, and Guoliang Fan
    [Arxiv]

    [14] Mastering Complex Coordination through Attention-based Dynamic Graph
    International Conference on Neural Information Processing(ICONIP), in Changsha, China, 2023.
    Guangchong Zhou, Zhiwei Xu, Zeren Zhang, and Guoliang Fan
    [Arxiv]

    [13] SORA: Improving Multi-agent Cooperation with a Soft Role Assignment Mechanism
    International Conference on Neural Information Processing(ICONIP), in Changsha, China, 2023.
    Guangchong Zhou, Zhiwei Xu, Zeren Zhang, and Guoliang Fan

    [12] Inducing Stackelberg Equilibrium through Spatio-Temporal Sequential Decision-Making in Multi-Agent Reinforcement Learning
    32nd International Joint Conference on Artificial Intelligence(IJCAI), in Macao, S.A.R, China, 2023.
    Bin Zhang, Lijuan Li, Zhiwei Xu, Dapeng Li, and Guoliang Fan
    [Arxiv]

    [11] SEA: A Spatially Explicit Architecture for Multi-Agent Reinforcement Learning
    International Joint Conference on Neural Networks(IJCNN), in Queensland, Australia, 2023.
    Dapeng Li, Zhiwei Xu, Bin Zhang, and Guoliang Fan
    [Arxiv]

    [10] Hierarchical Multi-Agent Reinforcement Learning with Intrinsic Reward Rectification
    IEEE International Conference on Acoustics, Speech and Signal Processing(ICASSP), in Rhodes island, Greece, 2023. (Poster)
    Zhihao Liu, Zhiwei Xu, and Guoliang Fan

    [9] Consensus Learning for Cooperative Multi-Agent Reinforcement Learning
    Thirty-Seventh AAAI Conference on Artificial Intelligence(AAAI), in Washington, DC, USA, 2023. (Oral)
    Zhiwei Xu, Bin Zhang, Dapeng Li, Zeren Zhang, Guangchong Zhou, Hao Chen, and Guoliang Fan
    [Arxiv][Code]

    [8] HAVEN: Hierarchical Cooperative Multi-Agent Reinforcement Learning with Dual Coordination Mechanism
    Thirty-Seventh AAAI Conference on Artificial Intelligence(AAAI), in Washington, DC, USA, 2023. (Oral)
    Zhiwei Xu, Yunpeng Bai, Bin Zhang, Dapeng Li, and Guoliang Fan
    [Arxiv][Code]

    [7] Mingling Foresight with Imagination: Model-Based Cooperative Multi-Agent Reinforcement Learning
    Thirty-sixth Conference on Neural Information Processing Systems(NeurIPS), in New Orleans, USA, 2022. (Spotlight)
    Zhiwei Xu, Dapeng Li, Bin Zhang, Yuan Zhan, Yunpeng Bai, and Guoliang Fan
    [Arxiv]

    [6] Multi-Agent Hyper-Attention Policy Optimization
    International Conference on Neural Information Processing(ICONIP), in New Delhi, India, 2022.
    Bin Zhang*, Zhiwei Xu*, Yiqun Chen*, Dapeng Li, Yunpeng Bai, Guoliang Fan, and Lijuan Li

    [5] Efficient Policy Generation in Multi-Agent Systems via Hypergraph Neural Network
    International Conference on Neural Information Processing(ICONIP), in New Delhi, India, 2022.
    Bin Zhang, Yunpeng Bai, Zhiwei Xu, Dapeng Li, and Guoliang Fan
    [Arxiv]

    [4] Learn Effective Representation for Deep Reinforcement Learning
    IEEE International Conference on Multimedia and Expo(ICME), in Taipei, 2022. (Oral)
    Yuan Zhan, Zhiwei Xu, and Guoliang Fan

    [3] SIDE: State Inference for Partially Observable Cooperative Multi-Agent Reinforcement Learning
    International Conference on Autonomous Agents and Multi-Agent Systems(AAMAS), in Auckland, New Zealand, 2022. (Full Paper)
    Zhiwei Xu, Yunpeng Bai, Dapeng Li, Bin Zhang, and Guoliang Fan
    [Arxiv][Code]

    [2] Learning to Coordinate via Multiple Graph Neural Networks
    International Conference on Neural Information Processing(ICONIP), in BALI, Indonesia, 2021.
    Zhiwei Xu, Bin Zhang, Yunpeng Bai, Dapeng Li, and Guoliang Fan
    [Arxiv][Code]

    [1] MMD-MIX: Value Function Factorisation with Maximum Mean Discrepancy for Cooperative Multi-Agent Reinforcement Learning
    International Joint Conference on Neural Networks(IJCNN), in Shenzhen, China, 2021. (Poster)
    Zhiwei Xu, Dapeng Li, Yunpeng Bai, and Guoliang Fan
    [Arxiv]


Pre-prints:

    [3] Controlling Large Language Model-based Agents for Large-Scale Decision-Making: An Actor-Critic Approach
    Bin Zhang, Hangyu Mao, Jingqing Ruan, Ying Wen, Yang Li, Shao Zhang, Zhiwei Xu, Dapeng Li, Ziyue Li, Rui Zhao, Lijuan Li, and Guoliang Fan
    [Arxiv]

    [2] TPTU: Task Planning and Tool Usage of Large Language Model-based AI Agents
    Jingqing Ruan*, Yihong Chen*, Bin Zhang*, Zhiwei Xu*, Tianpeng Bao*, Guoqing Du*, Shiwei Shi*, Hangyu Mao*, Ziyue Li, Xingyu Zeng, and Rui Zhao
    [Arxiv]

    [1] Style Miner: Find Significant and Stable Explanatory Factors in Time Series with Constrained Reinforcement Learning
    Dapeng Li, Feiyang Pan, Jia He, Zhiwei Xu, Dandan Tu, and Guoliang Fan
    [Arxiv]


Services

PC Member or Reviewer for:

  • Neural Information Processing Systems (NeurIPS: 2022, 2023)
  • International Conference on Learning Representations (ICLR: 2024)
  • International Conference on Machine Learning (ICML: 2024)
  • International Joint Conference on Artificial Intelligence (IJCAI: 2024)

Awards

  • 2023, National Scholarship for doctoral students, Ministry of Education
  • 2022, Merit Student, University of Chinese Academy of Sciences
  • 2019, Outstanding Undergraduate, Sichuan University
  • 2016, National Scholarship for undergraduate students, Ministry of Education