Welcome to Zhiwei Xu’s Homepage!

I am currently an assistant professor at the School of Artificial Intelligence, Shandong University. I received my Ph.D. degree from the Institute of Automation, Chinese Academy of Sciences, advised by Prof. Guoliang Fan (范国梁). My research interests include Reinforcement Learning, Multi-agent System, and Large Language Model (LLM) Agents.

I am looking for cooperation opportunities. If you are interested with my experience or research, please feel free to contact me via Wechat or Email(zhiwei_xu@sdu.edu.cn).

🔥 News

  • 2024.12:   One paper is accepted by AAMAS 2025!
  • 2024.12:   Two papers are accepted by AAAI 2025!
  • 2024.09:   One paper is accepted by ICONIP 2024!
  • 2024.05:   One paper is accepted by ICML 2024!
  • 2023.12:   Two papers are accepted by AAMAS 2024!
  • 2023.12:   One paper is accepted by ICASSP 2024!
  • 2023.12:  🎉🎉 Awarded with the National Scholarship for Doctoral Students!
  • 2023.09:   One first-author paper is accepted by NeurIPS 2023!
  • 2023.07:   Two papers are accepted by ICONIP 2023!
  • 2023.04:   One paper is accepted by IJCAI 2023!

📖 Experience

📝 Publications

Conferences:

  • Unveiling Decision Intention for Cooperative Multi-Agent Reinforcement Learning
    International Conference on Autonomous Agents and Multi-Agent Systems(AAMAS), in Detroit, Michigan, USA, 2025.
    Zeren Zhang, Zhiwei Xu, Guangchong Zhou, Dapeng Li, Bin Zhang, and Guoliang Fan

  • Efficient Communication in Multi-Agent Reinforcement Learning with Implicit Consensus Generation
    The 39th Annual AAAI Conference on Artificial Intelligence(AAAI), in Philadelphia, Pennsylvania, USA, 2025.
    Dapeng Li, Na Lou, Zhiwei Xu, Bin Zhang, and Guoliang Fan

  • Focus on Local: Finding Reliable Discriminative Regions for Visual Place Recognition
    The 39th Annual AAAI Conference on Artificial Intelligence(AAAI), in Philadelphia, Pennsylvania, USA, 2025.
    Changwei Wang, Shunpeng Chen, Yukun Song, Rongtao Xu, Zherui Zhang, Jiguang zhang, Haoran Yang, Yu Zhang, Kexue Fu, Shide Du, Zhiwei Xu, Longxiang Gao, Li Guo, and Shibiao Xu

  • Decentralized Extension for Centralized Multi-Agent Reinforcement Learning via Online Distillation
    International Conference on Neural Information Processing(ICONIP), in Auckland, New Zealand, 2024.
    Zeren Zhang, Bin Zhang, Guangchong Zhou, Dapeng Li, Zhiwei Xu, and Guoliang Fan

  • Stackelberg Decision Transformer for Asynchronous Action Coordination in Multi-Agent Systems
    Forty-first International Conference on Machine Learning(ICML), in Vienna, Austria, 2024.
    Bin Zhang, Hangyu Mao, Lijuan Li, Zhiwei Xu, Dapeng Li, Rui Zhao, and Guoliang Fan
    [Arxiv]

  • PDiT: Interleaving Perception and Decision-making Transformers for Deep Reinforcement Learning
    International Conference on Autonomous Agents and Multi-Agent Systems(AAMAS), in Auckland, New Zealand, 2024. (Full Paper)
    Hangyu Mao, Rui Zhao, Ziyue Li, Zhiwei Xu, Hao Chen, Yiqun Chen, Bin Zhang, Zhen Xiao, Junge Zhang, and Jiangjin Yin
    [Arxiv][Code]

  • From Explicit Communication to Tacit Cooperation:A Novel Paradigm for Cooperative MARL
    International Conference on Autonomous Agents and Multi-Agent Systems(AAMAS), in Auckland, New Zealand, 2024. (Extended Abstract)
    Dapeng Li, Zhiwei Xu, Bin Zhang, and Guoliang Fan
    [Arxiv]

  • Adaptive Parameter Sharing for Multi-Agent Reinforcement Learning
    IEEE International Conference on Acoustics, Speech and Signal Processing(ICASSP), in Seoul, Korea, 2024.
    Dapeng Li, Na Lou, Bin Zhang, Zhiwei Xu, and Guoliang Fan
    [Arxiv]

  • Dual Self-Awareness Value Decomposition Framework without Individual Global Max for Cooperative MARL
    Thirty-seventh Conference on Neural Information Processing Systems(NeurIPS), in New Orleans, USA, 2023. (Poster)
    Zhiwei Xu, Bin Zhang, Dapeng Li, Guangchong Zhou, Zeren Zhang, and Guoliang Fan
    [Arxiv]

  • Mastering Complex Coordination through Attention-based Dynamic Graph
    International Conference on Neural Information Processing(ICONIP), in Changsha, China, 2023.
    Guangchong Zhou, Zhiwei Xu, Zeren Zhang, and Guoliang Fan
    [Arxiv]

  • SORA: Improving Multi-agent Cooperation with a Soft Role Assignment Mechanism
    International Conference on Neural Information Processing(ICONIP), in Changsha, China, 2023.
    Guangchong Zhou, Zhiwei Xu, Zeren Zhang, and Guoliang Fan

  • Inducing Stackelberg Equilibrium through Spatio-Temporal Sequential Decision-Making in Multi-Agent Reinforcement Learning
    32nd International Joint Conference on Artificial Intelligence(IJCAI), in Macao, S.A.R, China, 2023.
    Bin Zhang, Lijuan Li, Zhiwei Xu, Dapeng Li, and Guoliang Fan
    [Arxiv]

  • SEA: A Spatially Explicit Architecture for Multi-Agent Reinforcement Learning
    International Joint Conference on Neural Networks(IJCNN), in Queensland, Australia, 2023.
    Dapeng Li, Zhiwei Xu, Bin Zhang, and Guoliang Fan
    [Arxiv]

  • Hierarchical Multi-Agent Reinforcement Learning with Intrinsic Reward Rectification
    IEEE International Conference on Acoustics, Speech and Signal Processing(ICASSP), in Rhodes island, Greece, 2023. (Poster)
    Zhihao Liu, Zhiwei Xu, and Guoliang Fan

  • Consensus Learning for Cooperative Multi-Agent Reinforcement Learning
    Thirty-Seventh AAAI Conference on Artificial Intelligence(AAAI), in Washington, DC, USA, 2023. (Oral)
    Zhiwei Xu, Bin Zhang, Dapeng Li, Zeren Zhang, Guangchong Zhou, Hao Chen, and Guoliang Fan
    [Arxiv][Code]

  • HAVEN: Hierarchical Cooperative Multi-Agent Reinforcement Learning with Dual Coordination Mechanism
    Thirty-Seventh AAAI Conference on Artificial Intelligence(AAAI), in Washington, DC, USA, 2023. (Oral)
    Zhiwei Xu, Yunpeng Bai, Bin Zhang, Dapeng Li, and Guoliang Fan
    [Arxiv][Code]

  • Mingling Foresight with Imagination: Model-Based Cooperative Multi-Agent Reinforcement Learning
    Thirty-sixth Conference on Neural Information Processing Systems(NeurIPS), in New Orleans, USA, 2022. (Spotlight)
    Zhiwei Xu, Dapeng Li, Bin Zhang, Yuan Zhan, Yunpeng Bai, and Guoliang Fan
    [Arxiv]

  • Multi-Agent Hyper-Attention Policy Optimization
    International Conference on Neural Information Processing(ICONIP), in New Delhi, India, 2022.
    Bin Zhang*, Zhiwei Xu*, Yiqun Chen*, Dapeng Li, Yunpeng Bai, Guoliang Fan, and Lijuan Li

  • Efficient Policy Generation in Multi-Agent Systems via Hypergraph Neural Network
    International Conference on Neural Information Processing(ICONIP), in New Delhi, India, 2022.
    Bin Zhang, Yunpeng Bai, Zhiwei Xu, Dapeng Li, and Guoliang Fan
    [Arxiv]

  • Learn Effective Representation for Deep Reinforcement Learning
    IEEE International Conference on Multimedia and Expo(ICME), in Taipei, 2022. (Oral)
    Yuan Zhan, Zhiwei Xu, and Guoliang Fan

  • SIDE: State Inference for Partially Observable Cooperative Multi-Agent Reinforcement Learning
    International Conference on Autonomous Agents and Multi-Agent Systems(AAMAS), in Auckland, New Zealand, 2022. (Full Paper)
    Zhiwei Xu, Yunpeng Bai, Dapeng Li, Bin Zhang, and Guoliang Fan
    [Arxiv][Code]

  • Learning to Coordinate via Multiple Graph Neural Networks
    International Conference on Neural Information Processing(ICONIP), in BALI, Indonesia, 2021.
    Zhiwei Xu, Bin Zhang, Yunpeng Bai, Dapeng Li, and Guoliang Fan
    [Arxiv][Code]

  • MMD-MIX: Value Function Factorisation with Maximum Mean Discrepancy for Cooperative Multi-Agent Reinforcement Learning
    International Joint Conference on Neural Networks(IJCNN), in Shenzhen, China, 2021. (Poster)
    Zhiwei Xu, Dapeng Li, Yunpeng Bai, and Guoliang Fan
    [Arxiv]

Pre-prints:

  • Beyond Local Views: Global State Inference with Diffusion Models for Cooperative Multi-Agent Reinforcement Learning
    Zhiwei Xu, Hangyu Mao, Nianmin Zhang, Xin Xin, Pengjie Ren, Dapeng Li, Bin Zhang, Guoliang Fan, Zhumin Chen, Changwei Wang, and Jiangjin Yin
    [Arxiv]

  • Verco: Learning Coordinated Verbal Communication for Multi-agent Reinforcement Learning
    Dapeng Li, Hang Dong, Lu Wang, Bo Qiao, Si Qin, Qingwei Lin, Dongmei Zhang, Qi Zhang, Zhiwei Xu, Bin Zhang, and Guoliang Fan
    [Arxiv]

  • Controlling Large Language Model-based Agents for Large-Scale Decision-Making: An Actor-Critic Approach
    Bin Zhang, Hangyu Mao, Jingqing Ruan, Ying Wen, Yang Li, Shao Zhang, Zhiwei Xu, Dapeng Li, Ziyue Li, Rui Zhao, Lijuan Li, and Guoliang Fan
    [Arxiv]

  • TPTU: Task Planning and Tool Usage of Large Language Model-based AI Agents
    Jingqing Ruan*, Yihong Chen*, Bin Zhang*, Zhiwei Xu*, Tianpeng Bao*, Guoqing Du*, Shiwei Shi*, Hangyu Mao*, Ziyue Li, Xingyu Zeng, and Rui Zhao
    [Arxiv]

  • Style Miner: Find Significant and Stable Explanatory Factors in Time Series with Constrained Reinforcement Learning
    Dapeng Li, Feiyang Pan, Jia He, Zhiwei Xu, Dandan Tu, and Guoliang Fan
    [Arxiv]

💻 Services

Program Committee Member or Reviewer:

  • Neural Information Processing Systems (NeurIPS)
  • International Conference on Learning Representations (ICLR)
  • International Conference on Machine Learning (ICML)
  • AAAI Conference on Artificial Intelligence (AAAI)
  • International Joint Conference on Artificial Intelligence (IJCAI)
  • International Conference on Autonomous Agents and Multiagent Systems (AAMAS)

🎖 Honors and Awards

  • 2023   National Scholarship for doctoral students, Ministry of Education
  • 2022   Merit Student, University of Chinese Academy of Sciences
  • 2019   Outstanding Undergraduate, Sichuan University
  • 2016   National Scholarship for undergraduate students, Ministry of Education