A Modern Self-Referential Weight Matrix That Learns to Modify Itself

Irie, Kazuki; Schlag, Imanol; Csordás, Róbert; Schmidhuber, Jürgen

Computer Science > Machine Learning

arXiv:2202.05780v1 (cs)

[Submitted on 11 Feb 2022 (this version), latest version 17 Jun 2022 (v2)]

Title:A Modern Self-Referential Weight Matrix That Learns to Modify Itself

Authors:Kazuki Irie, Imanol Schlag, Róbert Csordás, Jürgen Schmidhuber

View PDF

Abstract:The weight matrix (WM) of a neural network (NN) is its program. The programs of many traditional NNs are learned through gradient descent in some error function, then remain fixed. The WM of a self-referential NN, however, can keep rapidly modifying all of itself during runtime. In principle, such NNs can meta-learn to learn, and meta-meta-learn to meta-learn to learn, and so on, in the sense of recursive self-improvement. While NN architectures potentially capable of implementing such behavior have been proposed since the '90s, there have been few if any practical studies. Here we revisit such NNs, building upon recent successes of fast weight programmers and closely related linear Transformers. We propose a scalable self-referential WM (SRWM) that uses outer products and the delta update rule to modify itself. We evaluate our SRWM in supervised few-shot learning and in multi-task reinforcement learning with procedurally generated game environments. Our experiments demonstrate both practical applicability and competitive performance of the proposed SRWM. Our code is public.

Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2202.05780 [cs.LG]
	(or arXiv:2202.05780v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2202.05780

Submission history

From: Kazuki Irie [view email]
[v1] Fri, 11 Feb 2022 17:24:31 UTC (9,234 KB)
[v2] Fri, 17 Jun 2022 12:54:20 UTC (9,401 KB)

Computer Science > Machine Learning

Title:A Modern Self-Referential Weight Matrix That Learns to Modify Itself

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:A Modern Self-Referential Weight Matrix That Learns to Modify Itself

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators