default search action
Zhiwei Xu 0005
Person information
- unicode name: 徐志伟
- affiliation: Chinese Academy of Sciences, Institute of Automation, Beijing, China
- affiliation: University of Chinese Academy of Sciences, School of Artificial Intelligence, Beijing, China
Other persons with the same name
- Zhiwei Xu (aka: Zhi-Wei Xu, Zhi Wei Xu) — disambiguation page
- Zhiwei Xu 0001 (aka: Zhiwei (Tommy) Xu) — University of Michigan-Dearborn, MI, USA (and 1 more)
- Zhiwei Xu 0002 — Chinese Academy of Sciences, Institute of Computing Technology, Beijing, China
- Zhiwei Xu 0003 — Zhejiang University, Institute of Marine Electronic and Intelligent System, Ocean College, Zhoushan, China (and 1 more)
- Zhiwei Xu 0004 (aka: Zhi-Wei Xu 0004) — Wuhan University of Science and Technology, School of Computer Science and Technology, China
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c16]Hangyu Mao, Rui Zhao, Ziyue Li, Zhiwei Xu, Hao Chen, Yiqun Chen, Bin Zhang, Zhen Xiao, Junge Zhang, Jiangjin Yin:
PDiT: Interleaving Perception and Decision-making Transformers for Deep Reinforcement Learning. AAMAS 2024: 1363-1371 - [c15]Bin Zhang, Hangyu Mao, Lijuan Li, Zhiwei Xu, Dapeng Li, Rui Zhao, Guoliang Fan:
Sequential Asynchronous Action Coordination in Multi-Agent Systems: A Stackelberg Decision Transformer Approach. ICML 2024 - [i19]Dapeng Li, Hang Dong, Lu Wang, Bo Qiao, Si Qin, Qingwei Lin, Dongmei Zhang, Qi Zhang, Zhiwei Xu, Bin Zhang, Guoliang Fan:
Verco: Learning Coordinated Verbal Communication for Multi-agent Reinforcement Learning. CoRR abs/2404.17780 (2024) - [i18]Zhiwei Xu, Hangyu Mao, Nianmin Zhang, Xin Xin, Pengjie Ren, Dapeng Li, Bin Zhang, Guoliang Fan, Zhumin Chen, Changwei Wang, Jiangjin Yin:
Beyond Local Views: Global State Inference with Diffusion Models for Cooperative Multi-Agent Reinforcement Learning. CoRR abs/2408.09501 (2024) - 2023
- [c14]Zhiwei Xu, Bin Zhang, Dapeng Li, Zeren Zhang, Guangchong Zhou, Hao Chen, Guoliang Fan:
Consensus Learning for Cooperative Multi-Agent Reinforcement Learning. AAAI 2023: 11726-11734 - [c13]Zhiwei Xu, Yunpeng Bai, Bin Zhang, Dapeng Li, Guoliang Fan:
HAVEN: Hierarchical Cooperative Multi-Agent Reinforcement Learning with Dual Coordination Mechanism. AAAI 2023: 11735-11743 - [c12]Guangchong Zhou, Zhiwei Xu, Zeren Zhang, Guoliang Fan:
Mastering Complex Coordination Through Attention-Based Dynamic Graph. ICONIP (1) 2023: 305-318 - [c11]Guangchong Zhou, Zhiwei Xu, Zeren Zhang, Guoliang Fan:
SORA: Improving Multi-agent Cooperation with a Soft Role Assignment Mechanism. ICONIP (1) 2023: 319-331 - [c10]Bin Zhang, Lijuan Li, Zhiwei Xu, Dapeng Li, Guoliang Fan:
Inducing Stackelberg Equilibrium through Spatio-Temporal Sequential Decision-Making in Multi-Agent Reinforcement Learning. IJCAI 2023: 353-361 - [c9]Dapeng Li, Zhiwei Xu, Bin Zhang, Guoliang Fan:
SEA: A Spatially Explicit Architecture for Multi-Agent Reinforcement Learning. IJCNN 2023: 1-8 - [c8]Zhiwei Xu, Bin Zhang, Dapeng Li, Guangchong Zhou, Zeren Zhang, Guoliang Fan:
Dual Self-Awareness Value Decomposition Framework without Individual Global Max for Cooperative MARL. NeurIPS 2023 - [i17]Zhiwei Xu, Bin Zhang, Dapeng Li, Guangchong Zhou, Zeren Zhang, Guoliang Fan:
Dual Self-Awareness Value Decomposition Framework without Individual Global Max for Cooperative Multi-Agent Reinforcement Learning. CoRR abs/2302.02180 (2023) - [i16]Dapeng Li, Feiyang Pan, Jia He, Zhiwei Xu, Dandan Tu, Guoliang Fan:
Style Miner: Find Significant and Stable Explanatory Factors in Time Series with Constrained Reinforcement Learning. CoRR abs/2303.11716 (2023) - [i15]Bin Zhang, Lijuan Li, Zhiwei Xu, Dapeng Li, Guoliang Fan:
Inducing Stackelberg Equilibrium through Spatio-Temporal Sequential Decision-Making in Multi-Agent Reinforcement Learning. CoRR abs/2304.10351 (2023) - [i14]Dapeng Li, Zhiwei Xu, Bin Zhang, Guoliang Fan:
SEA: A Spatially Explicit Architecture for Multi-Agent Reinforcement Learning. CoRR abs/2304.12532 (2023) - [i13]Dapeng Li, Zhiwei Xu, Bin Zhang, Guoliang Fan:
From Explicit Communication to Tacit Cooperation: A Novel Paradigm for Cooperative MARL. CoRR abs/2304.14656 (2023) - [i12]Bin Zhang, Hangyu Mao, Lijuan Li, Zhiwei Xu, Dapeng Li, Rui Zhao, Guoliang Fan:
Stackelberg Decision Transformer for Asynchronous Action Coordination in Multi-Agent Systems. CoRR abs/2305.07856 (2023) - [i11]Jingqing Ruan, Yihong Chen, Bin Zhang, Zhiwei Xu, Tianpeng Bao, Guoqing Du, Shiwei Shi, Hangyu Mao, Xingyu Zeng, Rui Zhao:
TPTU: Task Planning and Tool Usage of Large Language Model-based AI Agents. CoRR abs/2308.03427 (2023) - [i10]Bin Zhang, Hangyu Mao, Jingqing Ruan, Ying Wen, Yang Li, Shao Zhang, Zhiwei Xu, Dapeng Li, Ziyue Li, Rui Zhao, Lijuan Li, Guoliang Fan:
Controlling Large Language Model-based Agents for Large-Scale Decision-Making: An Actor-Critic Approach. CoRR abs/2311.13884 (2023) - [i9]Guangchong Zhou, Zhiwei Xu, Zeren Zhang, Guoliang Fan:
Mastering Complex Coordination through Attention-based Dynamic Graph. CoRR abs/2312.04245 (2023) - [i8]Hangyu Mao, Rui Zhao, Ziyue Li, Zhiwei Xu, Hao Chen, Yiqun Chen, Bin Zhang, Zhen Xiao, Junge Zhang, Jiangjin Yin:
PDiT: Interleaving Perception and Decision-making Transformers for Deep Reinforcement Learning. CoRR abs/2312.15863 (2023) - 2022
- [c7]Zhiwei Xu, Yunpeng Bai, Dapeng Li, Bin Zhang, Guoliang Fan:
SIDE: State Inference for Partially Observable Cooperative Multi-Agent Reinforcement Learning. AAMAS 2022: 1400-1408 - [c6]Yuan Zhan, Zhiwei Xu, Guoliang Fan:
Learn Effective Representation for Deep Reinforcement Learning. ICME 2022: 1-6 - [c5]Bin Zhang, Zhiwei Xu, Yiqun Chen, Dapeng Li, Yunpeng Bai, Guoliang Fan, Lijuan Li:
Multi-Agent Hyper-Attention Policy Optimization. ICONIP (1) 2022: 76-87 - [c4]Bin Zhang, Yunpeng Bai, Zhiwei Xu, Dapeng Li, Guoliang Fan:
Efficient Policy Generation in Multi-agent Systems via Hypergraph Neural Network. ICONIP (2) 2022: 219-230 - [c3]Zhiwei Xu, Dapeng Li, Bin Zhang, Yuan Zhan, Yunpeng Bai, Guoliang Fan:
Mingling Foresight with Imagination: Model-Based Cooperative Multi-Agent Reinforcement Learning. NeurIPS 2022 - [i7]Bin Zhang, Yunpeng Bai, Zhiwei Xu, Dapeng Li, Guoliang Fan:
Efficient Cooperation Strategy Generation in Multi-Agent Video Games via Hypergraph Neural Network. CoRR abs/2203.03265 (2022) - [i6]Zhiwei Xu, Dapeng Li, Bin Zhang, Yuan Zhan, Yunpeng Bai, Guoliang Fan:
Mingling Foresight with Imagination: Model-Based Cooperative Multi-Agent Reinforcement Learning. CoRR abs/2204.09418 (2022) - [i5]Zhiwei Xu, Bin Zhang, Dapeng Li, Zeren Zhang, Guangchong Zhou, Guoliang Fan:
Consensus Learning for Cooperative Multi-Agent Reinforcement Learning. CoRR abs/2206.02583 (2022) - 2021
- [c2]Zhiwei Xu, Bin Zhang, Yunpeng Bai, Dapeng Li, Guoliang Fan:
Learning to Coordinate via Multiple Graph Neural Networks. ICONIP (3) 2021: 52-63 - [c1]Zhiwei Xu, Dapeng Li, Yunpeng Bai, Guoliang Fan:
MMD-MIX: Value Function Factorisation with Maximum Mean Discrepancy for Cooperative Multi-Agent Reinforcement Learning. IJCNN 2021: 1-7 - [i4]Zhiwei Xu, Bin Zhang, Yunpeng Bai, Dapeng Li, Guoliang Fan:
Learning to Coordinate via Multiple Graph Neural Networks. CoRR abs/2104.03503 (2021) - [i3]Zhiwei Xu, Yunpeng Bai, Dapeng Li, Bin Zhang, Guoliang Fan:
SIDE: I Infer the State I Want to Learn. CoRR abs/2105.06228 (2021) - [i2]Zhiwei Xu, Dapeng Li, Yunpeng Bai, Guoliang Fan:
MMD-MIX: Value Function Factorisation with Maximum Mean Discrepancy for Cooperative Multi-Agent Reinforcement Learning. CoRR abs/2106.11652 (2021) - [i1]Zhiwei Xu, Yunpeng Bai, Bin Zhang, Dapeng Li, Guoliang Fan:
HAVEN: Hierarchical Cooperative Multi-Agent Reinforcement Learning with Dual Coordination Mechanism. CoRR abs/2110.07246 (2021)
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-10-23 20:33 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint