default search action

combined dblp search
author search
venue search
publication search

ask others

Zhiwei Xu 0005

徐志伟

> Home > Persons

Person information

unicode name: 徐志伟
affiliation: Chinese Academy of Sciences, Institute of Automation, Beijing, China
affiliation: University of Chinese Academy of Sciences, School of Artificial Intelligence, Beijing, China

Other persons with the same name

see FAQ

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[c16]
- view
  authority control:
- export record
  dblp key:
  - conf/atal/MaoZ00CCZXZY24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/atal/MaoZ00CCZXZY24
Hangyu Mao, Rui Zhao, Ziyue Li, Zhiwei Xu, Hao Chen, Yiqun Chen, Bin Zhang, Zhen Xiao, Junge Zhang, Jiangjin Yin:
PDiT: Interleaving Perception and Decision-making Transformers for Deep Reinforcement Learning. AAMAS 2024: 1363-1371
[c15]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/0052M0000F24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/0052M0000F24
Bin Zhang, Hangyu Mao, Lijuan Li, Zhiwei Xu, Dapeng Li, Rui Zhao, Guoliang Fan:
Sequential Asynchronous Action Coordination in Multi-Agent Systems: A Stackelberg Decision Transformer Approach. ICML 2024
[i19]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2404-17780
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2404-17780
Dapeng Li, Hang Dong, Lu Wang, Bo Qiao, Si Qin, Qingwei Lin, Dongmei Zhang, Qi Zhang, Zhiwei Xu, Bin Zhang, Guoliang Fan:
Verco: Learning Coordinated Verbal Communication for Multi-agent Reinforcement Learning. CoRR abs/2404.17780 (2024)
[i18]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2408-09501
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2408-09501
Zhiwei Xu, Hangyu Mao, Nianmin Zhang, Xin Xin, Pengjie Ren, Dapeng Li, Bin Zhang, Guoliang Fan, Zhumin Chen, Changwei Wang, Jiangjin Yin:
Beyond Local Views: Global State Inference with Diffusion Models for Cooperative Multi-Agent Reinforcement Learning. CoRR abs/2408.09501 (2024)
2023
[c14]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/0005Z0ZZCF23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/0005Z0ZZCF23
Zhiwei Xu, Bin Zhang, Dapeng Li, Zeren Zhang, Guangchong Zhou, Hao Chen, Guoliang Fan:
Consensus Learning for Cooperative Multi-Agent Reinforcement Learning. AAAI 2023: 11726-11734
[c13]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/XuBZ0F23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/XuBZ0F23
Zhiwei Xu, Yunpeng Bai, Bin Zhang, Dapeng Li, Guoliang Fan:
HAVEN: Hierarchical Cooperative Multi-Agent Reinforcement Learning with Dual Coordination Mechanism. AAAI 2023: 11735-11743
[c12]
- view
  authority control:
- export record
  dblp key:
  - conf/iconip/ZhouXZF23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iconip/ZhouXZF23
Guangchong Zhou, Zhiwei Xu, Zeren Zhang, Guoliang Fan:
Mastering Complex Coordination Through Attention-Based Dynamic Graph. ICONIP (1) 2023: 305-318
[c11]
- view
  authority control:
- export record
  dblp key:
  - conf/iconip/ZhouXZF23a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iconip/ZhouXZF23a
Guangchong Zhou, Zhiwei Xu, Zeren Zhang, Guoliang Fan:
SORA: Improving Multi-agent Cooperation with a Soft Role Assignment Mechanism. ICONIP (1) 2023: 319-331
[c10]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/ijcai/0052000F23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcai/0052000F23
Bin Zhang, Lijuan Li, Zhiwei Xu, Dapeng Li, Guoliang Fan:
Inducing Stackelberg Equilibrium through Spatio-Temporal Sequential Decision-Making in Multi-Agent Reinforcement Learning. IJCAI 2023: 353-361
[c9]
- view
  authority control:
- export record
  dblp key:
  - conf/ijcnn/LiXZF23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcnn/LiXZF23
Dapeng Li, Zhiwei Xu, Bin Zhang, Guoliang Fan:
SEA: A Spatially Explicit Architecture for Multi-Agent Reinforcement Learning. IJCNN 2023: 1-8
[c8]
- view
  - electronic edition @ nips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/XuZ0ZZF23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/XuZ0ZZF23
Zhiwei Xu, Bin Zhang, Dapeng Li, Guangchong Zhou, Zeren Zhang, Guoliang Fan:
Dual Self-Awareness Value Decomposition Framework without Individual Global Max for Cooperative MARL. NeurIPS 2023
[i17]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2302-02180
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2302-02180
Zhiwei Xu, Bin Zhang, Dapeng Li, Guangchong Zhou, Zeren Zhang, Guoliang Fan:
Dual Self-Awareness Value Decomposition Framework without Individual Global Max for Cooperative Multi-Agent Reinforcement Learning. CoRR abs/2302.02180 (2023)
[i16]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2303-11716
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2303-11716
Dapeng Li, Feiyang Pan, Jia He, Zhiwei Xu, Dandan Tu, Guoliang Fan:
Style Miner: Find Significant and Stable Explanatory Factors in Time Series with Constrained Reinforcement Learning. CoRR abs/2303.11716 (2023)
[i15]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2304-10351
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2304-10351
Bin Zhang, Lijuan Li, Zhiwei Xu, Dapeng Li, Guoliang Fan:
Inducing Stackelberg Equilibrium through Spatio-Temporal Sequential Decision-Making in Multi-Agent Reinforcement Learning. CoRR abs/2304.10351 (2023)
[i14]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2304-12532
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2304-12532
Dapeng Li, Zhiwei Xu, Bin Zhang, Guoliang Fan:
SEA: A Spatially Explicit Architecture for Multi-Agent Reinforcement Learning. CoRR abs/2304.12532 (2023)
[i13]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2304-14656
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2304-14656
Dapeng Li, Zhiwei Xu, Bin Zhang, Guoliang Fan:
From Explicit Communication to Tacit Cooperation: A Novel Paradigm for Cooperative MARL. CoRR abs/2304.14656 (2023)
[i12]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-07856
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-07856
Bin Zhang, Hangyu Mao, Lijuan Li, Zhiwei Xu, Dapeng Li, Rui Zhao, Guoliang Fan:
Stackelberg Decision Transformer for Asynchronous Action Coordination in Multi-Agent Systems. CoRR abs/2305.07856 (2023)
[i11]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2308-03427
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2308-03427
Jingqing Ruan, Yihong Chen, Bin Zhang, Zhiwei Xu, Tianpeng Bao, Guoqing Du, Shiwei Shi, Hangyu Mao, Xingyu Zeng, Rui Zhao:
TPTU: Task Planning and Tool Usage of Large Language Model-based AI Agents. CoRR abs/2308.03427 (2023)
[i10]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2311-13884
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2311-13884
Bin Zhang, Hangyu Mao, Jingqing Ruan, Ying Wen, Yang Li, Shao Zhang, Zhiwei Xu, Dapeng Li, Ziyue Li, Rui Zhao, Lijuan Li, Guoliang Fan:
Controlling Large Language Model-based Agents for Large-Scale Decision-Making: An Actor-Critic Approach. CoRR abs/2311.13884 (2023)
[i9]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-04245
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-04245
Guangchong Zhou, Zhiwei Xu, Zeren Zhang, Guoliang Fan:
Mastering Complex Coordination through Attention-based Dynamic Graph. CoRR abs/2312.04245 (2023)
[i8]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-15863
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-15863
Hangyu Mao, Rui Zhao, Ziyue Li, Zhiwei Xu, Hao Chen, Yiqun Chen, Bin Zhang, Zhen Xiao, Junge Zhang, Jiangjin Yin:
PDiT: Interleaving Perception and Decision-making Transformers for Deep Reinforcement Learning. CoRR abs/2312.15863 (2023)
2022
[c7]
- view
  authority control:
- export record
  dblp key:
  - conf/atal/XuBLZF22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/atal/XuBLZF22
Zhiwei Xu, Yunpeng Bai, Dapeng Li, Bin Zhang, Guoliang Fan:
SIDE: State Inference for Partially Observable Cooperative Multi-Agent Reinforcement Learning. AAMAS 2022: 1400-1408
[c6]
- view
  authority control:
- export record
  dblp key:
  - conf/icmcs/ZhanXF22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icmcs/ZhanXF22
Yuan Zhan, Zhiwei Xu, Guoliang Fan:
Learn Effective Representation for Deep Reinforcement Learning. ICME 2022: 1-6
[c5]
- view
  authority control:
- export record
  dblp key:
  - conf/iconip/Zhang0C0BFL22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iconip/Zhang0C0BFL22
Bin Zhang, Zhiwei Xu, Yiqun Chen, Dapeng Li, Yunpeng Bai, Guoliang Fan, Lijuan Li:
Multi-Agent Hyper-Attention Policy Optimization. ICONIP (1) 2022: 76-87
[c4]
- view
  authority control:
- export record
  dblp key:
  - conf/iconip/ZhangB0LF22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iconip/ZhangB0LF22
Bin Zhang, Yunpeng Bai, Zhiwei Xu, Dapeng Li, Guoliang Fan:
Efficient Policy Generation in Multi-agent Systems via Hypergraph Neural Network. ICONIP (2) 2022: 219-230
[c3]
- view
  - electronic edition @ nips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/00050ZZBF22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/00050ZZBF22
Zhiwei Xu, Dapeng Li, Bin Zhang, Yuan Zhan, Yunpeng Bai, Guoliang Fan:
Mingling Foresight with Imagination: Model-Based Cooperative Multi-Agent Reinforcement Learning. NeurIPS 2022
[i7]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-03265
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-03265
Bin Zhang, Yunpeng Bai, Zhiwei Xu, Dapeng Li, Guoliang Fan:
Efficient Cooperation Strategy Generation in Multi-Agent Video Games via Hypergraph Neural Network. CoRR abs/2203.03265 (2022)
[i6]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2204-09418
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2204-09418
Zhiwei Xu, Dapeng Li, Bin Zhang, Yuan Zhan, Yunpeng Bai, Guoliang Fan:
Mingling Foresight with Imagination: Model-Based Cooperative Multi-Agent Reinforcement Learning. CoRR abs/2204.09418 (2022)
[i5]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2206-02583
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2206-02583
Zhiwei Xu, Bin Zhang, Dapeng Li, Zeren Zhang, Guangchong Zhou, Guoliang Fan:
Consensus Learning for Cooperative Multi-Agent Reinforcement Learning. CoRR abs/2206.02583 (2022)
2021
[c2]
- view
  authority control:
- export record
  dblp key:
  - conf/iconip/XuZBLF21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iconip/XuZBLF21
Zhiwei Xu, Bin Zhang, Yunpeng Bai, Dapeng Li, Guoliang Fan:
Learning to Coordinate via Multiple Graph Neural Networks. ICONIP (3) 2021: 52-63
[c1]
- view
  authority control:
- export record
  dblp key:
  - conf/ijcnn/XuLBF21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcnn/XuLBF21
Zhiwei Xu, Dapeng Li, Yunpeng Bai, Guoliang Fan:
MMD-MIX: Value Function Factorisation with Maximum Mean Discrepancy for Cooperative Multi-Agent Reinforcement Learning. IJCNN 2021: 1-7
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2104-03503
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2104-03503
Zhiwei Xu, Bin Zhang, Yunpeng Bai, Dapeng Li, Guoliang Fan:
Learning to Coordinate via Multiple Graph Neural Networks. CoRR abs/2104.03503 (2021)
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2105-06228
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2105-06228
Zhiwei Xu, Yunpeng Bai, Dapeng Li, Bin Zhang, Guoliang Fan:
SIDE: I Infer the State I Want to Learn. CoRR abs/2105.06228 (2021)
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2106-11652
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2106-11652
Zhiwei Xu, Dapeng Li, Yunpeng Bai, Guoliang Fan:
MMD-MIX: Value Function Factorisation with Maximum Mean Discrepancy for Cooperative Multi-Agent Reinforcement Learning. CoRR abs/2106.11652 (2021)
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2110-07246
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-07246
Zhiwei Xu, Yunpeng Bai, Bin Zhang, Dapeng Li, Guoliang Fan:
HAVEN: Hierarchical Cooperative Multi-Agent Reinforcement Learning with Dual Coordination Mechanism. CoRR abs/2110.07246 (2021)

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.