Zhiwei Xu (徐志伟) |
Conferences:
[19] Stackelberg Decision Transformer for Asynchronous Action Coordination in Multi-Agent Systems
Forty-first International Conference on Machine Learning(ICML), in Vienna, Austria, 2024.
Bin Zhang, Hangyu Mao, Lijuan Li, Zhiwei Xu, Dapeng Li, Rui Zhao, and Guoliang Fan
[Arxiv]
[18] PDiT: Interleaving Perception and Decision-making Transformers for Deep Reinforcement Learning
International Conference on Autonomous Agents and Multi-Agent Systems(AAMAS), in Auckland, New Zealand, 2024. (Full Paper)
Hangyu Mao, Rui Zhao, Ziyue Li, Zhiwei Xu, Hao Chen, Yiqun Chen, Bin Zhang, Zhen Xiao, Junge Zhang, and Jiangjin Yin
[Arxiv][Code]
[17] From Explicit Communication to Tacit Cooperation:A Novel Paradigm for Cooperative MARL
International Conference on Autonomous Agents and Multi-Agent Systems(AAMAS), in Auckland, New Zealand, 2024. (Extended Abstract)
Dapeng Li, Zhiwei Xu, Bin Zhang, and Guoliang Fan
[Arxiv]
[16] Adaptive Parameter Sharing for Multi-Agent Reinforcement Learning
IEEE International Conference on Acoustics, Speech and Signal Processing(ICASSP), in Seoul, Korea, 2024.
Dapeng Li, Na Lou, Bin Zhang, Zhiwei Xu, and Guoliang Fan
[Arxiv]
[15] Dual Self-Awareness Value Decomposition Framework without Individual Global Max for Cooperative MARL
Thirty-seventh Conference on Neural Information Processing Systems(NeurIPS), in New Orleans, USA, 2023. (Poster)
Zhiwei Xu, Bin Zhang, Dapeng Li, Guangchong Zhou, Zeren Zhang, and Guoliang Fan
[Arxiv]
[14] Mastering Complex Coordination through Attention-based Dynamic Graph
International Conference on Neural Information Processing(ICONIP), in Changsha, China, 2023.
Guangchong Zhou, Zhiwei Xu, Zeren Zhang, and Guoliang Fan
[Arxiv]
[13] SORA: Improving Multi-agent Cooperation with a Soft Role Assignment Mechanism
International Conference on Neural Information Processing(ICONIP), in Changsha, China, 2023.
Guangchong Zhou, Zhiwei Xu, Zeren Zhang, and Guoliang Fan
[12] Inducing Stackelberg Equilibrium through Spatio-Temporal Sequential Decision-Making in Multi-Agent Reinforcement Learning
32nd International Joint Conference on Artificial Intelligence(IJCAI), in Macao, S.A.R, China, 2023.
Bin Zhang, Lijuan Li, Zhiwei Xu, Dapeng Li, and Guoliang Fan
[Arxiv]
[11] SEA: A Spatially Explicit Architecture for Multi-Agent Reinforcement Learning
International Joint Conference on Neural Networks(IJCNN), in Queensland, Australia, 2023.
Dapeng Li, Zhiwei Xu, Bin Zhang, and Guoliang Fan
[Arxiv]
[10] Hierarchical Multi-Agent Reinforcement Learning with Intrinsic Reward Rectification
IEEE International Conference on Acoustics, Speech and Signal Processing(ICASSP), in Rhodes island, Greece, 2023. (Poster)
Zhihao Liu, Zhiwei Xu, and Guoliang Fan
[9] Consensus Learning for Cooperative Multi-Agent Reinforcement Learning
Thirty-Seventh AAAI Conference on Artificial Intelligence(AAAI), in Washington, DC, USA, 2023. (Oral)
Zhiwei Xu, Bin Zhang, Dapeng Li, Zeren Zhang, Guangchong Zhou, Hao Chen, and Guoliang Fan
[Arxiv][Code]
[8] HAVEN: Hierarchical Cooperative Multi-Agent Reinforcement Learning with Dual Coordination Mechanism
Thirty-Seventh AAAI Conference on Artificial Intelligence(AAAI), in Washington, DC, USA, 2023. (Oral)
Zhiwei Xu, Yunpeng Bai, Bin Zhang, Dapeng Li, and Guoliang Fan
[Arxiv][Code]
[7] Mingling Foresight with Imagination: Model-Based Cooperative Multi-Agent Reinforcement Learning
Thirty-sixth Conference on Neural Information Processing Systems(NeurIPS), in New Orleans, USA, 2022. (Spotlight)
Zhiwei Xu, Dapeng Li, Bin Zhang, Yuan Zhan, Yunpeng Bai, and Guoliang Fan
[Arxiv]
[6] Multi-Agent Hyper-Attention Policy Optimization
International Conference on Neural Information Processing(ICONIP), in New Delhi, India, 2022.
Bin Zhang*, Zhiwei Xu*, Yiqun Chen*, Dapeng Li, Yunpeng Bai, Guoliang Fan, and Lijuan Li
[5] Efficient Policy Generation in Multi-Agent Systems via Hypergraph Neural Network
International Conference on Neural Information Processing(ICONIP), in New Delhi, India, 2022.
Bin Zhang, Yunpeng Bai, Zhiwei Xu, Dapeng Li, and Guoliang Fan
[Arxiv]
[4] Learn Effective Representation for Deep Reinforcement Learning
IEEE International Conference on Multimedia and Expo(ICME), in Taipei, 2022. (Oral)
Yuan Zhan, Zhiwei Xu, and Guoliang Fan
[3] SIDE: State Inference for Partially Observable Cooperative Multi-Agent Reinforcement Learning
International Conference on Autonomous Agents and Multi-Agent Systems(AAMAS), in Auckland, New Zealand, 2022. (Full Paper)
Zhiwei Xu, Yunpeng Bai, Dapeng Li, Bin Zhang, and Guoliang Fan
[Arxiv][Code]
[2] Learning to Coordinate via Multiple Graph Neural Networks
International Conference on Neural Information Processing(ICONIP), in BALI, Indonesia, 2021.
Zhiwei Xu, Bin Zhang, Yunpeng Bai, Dapeng Li, and Guoliang Fan
[Arxiv][Code]
[1] MMD-MIX: Value Function Factorisation with Maximum Mean Discrepancy for Cooperative Multi-Agent Reinforcement Learning
International Joint Conference on Neural Networks(IJCNN), in Shenzhen, China, 2021. (Poster)
Zhiwei Xu, Dapeng Li, Yunpeng Bai, and Guoliang Fan
[Arxiv]
Pre-prints:
[3] Controlling Large Language Model-based Agents for Large-Scale Decision-Making: An Actor-Critic Approach
Bin Zhang, Hangyu Mao, Jingqing Ruan, Ying Wen, Yang Li, Shao Zhang, Zhiwei Xu, Dapeng Li, Ziyue Li, Rui Zhao, Lijuan Li, and Guoliang Fan
[Arxiv]
[2] TPTU: Task Planning and Tool Usage of Large Language Model-based AI Agents
Jingqing Ruan*, Yihong Chen*, Bin Zhang*, Zhiwei Xu*, Tianpeng Bao*, Guoqing Du*, Shiwei Shi*, Hangyu Mao*, Ziyue Li, Xingyu Zeng, and Rui Zhao
[Arxiv]
[1] Style Miner: Find Significant and Stable Explanatory Factors in Time Series with Constrained Reinforcement Learning
Dapeng Li, Feiyang Pan, Jia He, Zhiwei Xu, Dandan Tu, and Guoliang Fan
[Arxiv]
PC Member or Reviewer for: