Paper Notes on Pretrain Language Models with Factual Knowledge

Introduction

This is a paper list about PLMs & Knowledge.

Keywords Convention

abbreviation

key features

main task

editing info

Papers

Survey

A Survey of Knowledge-Intensive NLP with Pre-Trained Language Models [pdf]

summarize the current progress of pre-trained language model based knowledge-enhanced models (PLMKEs) by dissecting their three vital elements: knowledge sources, knowledge-intensive NLP tasks, and knowledge fusion methods. main encyclopedic knowledge-intensive NLP tasks:
- Open-domain QA
  - Natural Questions
  - HotpotQA
- Fact Verification
  - FEVER
  - BOOLQ
- Entity Linking
  - ACE2004
  - AIDA CoNLL-YAGO
  - WNWI
  - WNCW

Probing

LAMA
ParaREL
BERT is Not a Knowledge Base (Yet): Factual Knowledge vs. Name-Based Reasoning in Unsupervised QA (preprint)
How Can We Know What Language Models Know? (TACL 2020)
How Much Knowledge Can You Pack Into the Parameters of a Language Model? (EMNLP 2020)
Transformer Feed-Forward Layers Are Key-Value Memories (EMNLP 2021)
Knowledge Neurons in Pretrained Transformers (ACL 2022)
Transformer Feed-Forward Layers Build Predictions by Promoting Concepts in the Vocabulary Space (preprint)

Knowledge Retrieving

Generalization through Memorization: Nearest Neighbor Language Models (ICLR 2020)
REALM: Retrieval-Augmented Language Model Pre-Training (ICML 2020)
Adaptive Semiparametric Language Models (TACL 2021)
Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks (NeurIPS 2020)
Leveraging Passage Retrieval with Generative Models for Open Domain Question Answering (EACL 2021 short)
End-to-End Training of Multi-Document Reader and Retriever for Open-Domain Question Answering (NeurIPS 2021)
WebGPT: Browser-assisted question-answering with human feedback (preprint)
Improving language models by retrieving from trillions of tokens (preprint)
Training Data is More Valuable than You Think: A Simple and Effective Method by Retrieving from Training Data (preprint)
A Survey on Retrieval-Augmented Text Generation (preprint)

Knowledge Injection

Entities as Experts: Sparse Memory Access with Entity Supervision (EMNLP 2020)
Facts as Experts: Adaptable and Interpretable Neural Memory over Symbolic Knowledge (preprint)
K-Adapter: Infusing Knowledge into Pre-Trained Models with Adapters (Findings of ACL 2021)
Reasoning Over Virtual Knowledge Bases With Open Predicate Relations (ICML 2021)
AdapterFusion: Non-Destructive Task Composition for Transfer Learning (EACL 2021)
Kformer: Knowledge Injection in Transformer Feed-Forward Layers (preprint)

Model Editing

This section contains the pilot works that might contributes to the prevalence of model editiing paradigm.

Editable Neural Networks ICLR 2020.

Anton Sinitsin, Vsevolod Plokhotnyuk, Dmitriy Pyrkin, Sergei Popov, Artem Babenko. [pdf], [project], 2020.7

Related work:
- re-train on the original dataset augmented with new samples (expensive)
- use a manual cache (e.g. lookup table) that overrules model predictions on problematic samples (can't generalize to semantic equivalent inputs) Method: Editable Training (employ meta-learning techniques to ensure that mistakes can be corrected without harming its overall performance)
Fact-based Text Editing ACL 2020.

Hayate Iso, Chao Qiao, Hang Li. [pdf], [project], 2020.7

Main cons.:
- propose a new dataset (task): transforms a draft text into a revised text based on given triples
- new model, FACTEDITOR. The model consists of three components, a buffer for storing the draft text and its representations, a stream for storing the revised text and its representations, and a memory for storing the triples and their representations
Modifying Memories in Transformer Models Explicitly modifying specific factual knowledge in Transformer models while ensuring the model performance does not degrade on the unmodified facts.
- Motivation
  - updating stale knowledge
  - protecting privacy
  - eliminating unintended biases (debias)
- Baseline
  - Retraining the model on modified training set
  - Fine-tuning on modified facts
  - Fine-tuning on a mixture of modified and unmodified batches
- Method
  - Constrained fine-tuning on supporting evidences for modified facts
Editing Factual Knowledge in Language Models
- Evaluation
  - success rate: updates the knowledge
  - retain accuracy: retains the original predictions
  - equivalence accuracy: for semantically equivalent inputs,
  - performance deterioration: test performance deteriorates
Fast Model Editing at Scale a model editor efficient: if the time and memory requirements for computing φ and evaluating E are small. gradients are highdimensional objects 👉 using a low-rank decomposition of the gradient
Locating and Editing Factual Knowledge in GPT (preprint)

Continual Pretraining

Don't Stop Pretraining: Adapt Language Models to Domains and Tasks (ACL 2020)
Mind the Gap: Assessing Temporal Generalization in Neural Language Models (NeurIps 2021)
Towards Continual Knowledge Learning of Language Models (preprint)
DEMix Layers: Disentangling Domains for Modular Language Modeling (preprint)
Lifelong Pretraining: Continually Adapting Language Models to Emerging Corpora (preprint)
Time-Aware Language Models as Temporal Knowledge Bases (preprint)
Dynamic Language Models for Continuously Evolving Content (KDD 2021)
ELLE: Efficient Lifelong Pre-training for Emerging Data (Findings of ACL 2022)

Model Editing v.s. Continual Learning

Common: assimilating or updating a model’s behavior without catastrophic forgetting
Continual learning: learn a new task while preserving the performance on the previous tasks wholly new behaviors or datasets long sequences of model updates
Model editing: explicitly modifying specific factual knowledge in models while ensuring the model performance does not degrade on the unmodified facts considers an edit or batch of edits applied all at once requires the model to memorize new facts that conflict with previously learned facts

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
.history		.history
img		img
readme.md		readme.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Paper Notes on Pretrain Language Models with Factual Knowledge

Contents

Introduction

Keywords Convention

Papers

Survey

Probing

Knowledge Retrieving

Knowledge Injection

Model Editing

Continual Pretraining

Model Editing v.s. Continual Learning

About

Releases

Packages

Contributors 2

dqxiu/PLMs-with-Knowledge

Folders and files

Latest commit

History

Repository files navigation

Paper Notes on Pretrain Language Models with Factual Knowledge

Contents

Introduction

Keywords Convention

Papers

Survey

Probing

Knowledge Retrieving

Knowledge Injection

Model Editing

Continual Pretraining

Model Editing v.s. Continual Learning

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Packages