JointLK: Joint Reasoning with Language Models and Knowledge Graphs for Commonsense Question Answering

This repo provides the source code & data of our paper: JointLK: Joint Reasoning with Language Models and Knowledge Graphs for Commonsense Question Answering (NAACL 2022).

For convenience, all data, checkpoints and codes can be downloaded from my Baidu Netdisk.

1. Dependencies

Run the following commands to create a conda environment (assuming CUDA11):

conda create -n jointlk python=3.7
source activate jointlk
pip install torch==1.7.1+cu110 -f https://download.pytorch.org/whl/torch_stable.html
pip install transformers==3.2.0
pip install nltk spacy==2.1.6
python -m spacy download en
# for torch-geometric
pip install torch-cluster==1.5.9 -f https://pytorch-geometric.com/whl/torch-1.7.1+cu110.html
pip install torch-spline-conv==1.2.1 -f https://pytorch-geometric.com/whl/torch-1.7.1+cu110.html
pip install torch-scatter==2.0.6 -f https://pytorch-geometric.com/whl/torch-1.7.1+cu110.html
pip install torch-sparse==0.6.9 -f https://pytorch-geometric.com/whl/torch-1.7.1+cu110.html
pip install torch-geometric==1.6.3 -f https://pytorch-geometric.com/whl/torch-1.7.1+cu110.html

See the file env.yaml for all environment dependencies.

2. Download Data

We use preprocessed data from the QA-GNN repository, which can also be downloaded from my Baidu Netdisk.

The data file structure will look like:

.
├── data/
    ├── cpnet/                 (prerocessed ConceptNet)
    ├── csqa/
        ├── train_rand_split.jsonl
        ├── dev_rand_split.jsonl
        ├── test_rand_split_no_answers.jsonl
        ├── statement/             (converted statements)
        ├── grounded/              (grounded entities)
        ├── graphs/                (extracted subgraphs)
        ├── ...
    ├── obqa/
    ├── medqa_usmle/
    └── ddb/

3. Training JointLK

(Assuming slurm job scheduling system)

For CommonsenseQA, run

sbatch sbatch_run_jointlk__csqa.sh

For OpenBookQA, run

sbatch sbatch_run_jointlk__obqa.sh

4. Pretrained model checkpoints

CommonsenseQA

Trained model	In-house Dev acc.	In-house Test acc.
RoBERTa-large + JointLK [link]	77.6	75.3
RoBERTa-large + JointLK [link]	78.4	74.2

OpenBookQA

Trained model	Dev acc.	Test acc.
RoBERTa-large + JointLK [link]	68.8	70.4
AristoRoBERTa-large + JointLK [link]	79.2	85.6

5. Evaluating a pretrained model checkpoint

For CommonsenseQA, run

sbatch sbatch_run_jointlk__csqa_test.sh

For OpenBookQA, run

sbatch sbatch_run_jointlk__obqa_test.sh

6. Acknowledgment

This repo is built upon the following work:

QA-GNN: Question Answering using Language Models and Knowledge Graphs
https://github.com/michiyasunaga/qagnn

Many thanks to the authors and developers!

Others

We noticed that the QA-GNN repository added test results on the MedQA dataset. To facilitate future researchers to compare different models, we also test the performance of JointLK on MedQA.

For training MedQA, run

sbatch sbatch_run_jointlk__medqa_usmle.sh

for testing MedQA, run

sbatch sbatch_run_jointlk__medqa_usmle_test.sh

A pretrained model checkpoint

Trained model	Dev acc.	Test acc.
SapBERT-base + JointLK [link]	38.0	39.8

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
figs		figs
modeling		modeling
utils		utils
LICENSE		LICENSE
README.md		README.md
env.yaml		env.yaml
eval_jointlk.py		eval_jointlk.py
jointlk.py		jointlk.py
sbatch_run_jointlk__csqa.sh		sbatch_run_jointlk__csqa.sh
sbatch_run_jointlk__csqa_test.sh		sbatch_run_jointlk__csqa_test.sh
sbatch_run_jointlk__medqa_usmle.sh		sbatch_run_jointlk__medqa_usmle.sh
sbatch_run_jointlk__medqa_usmle_test.sh		sbatch_run_jointlk__medqa_usmle_test.sh
sbatch_run_jointlk__obqa.sh		sbatch_run_jointlk__obqa.sh
sbatch_run_jointlk__obqa_fact.sh		sbatch_run_jointlk__obqa_fact.sh
sbatch_run_jointlk__obqa_test.sh		sbatch_run_jointlk__obqa_test.sh
sbatch_run_jointlk__obqa_test_fact.sh		sbatch_run_jointlk__obqa_test_fact.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

JointLK: Joint Reasoning with Language Models and Knowledge Graphs for Commonsense Question Answering

1. Dependencies

2. Download Data

3. Training JointLK

4. Pretrained model checkpoints

5. Evaluating a pretrained model checkpoint

6. Acknowledgment

Others

About

Releases

Packages

License

Yueqing-Sun/JointLK

Folders and files

Latest commit

History

Repository files navigation

JointLK: Joint Reasoning with Language Models and Knowledge Graphs for Commonsense Question Answering

1. Dependencies

2. Download Data

3. Training JointLK

4. Pretrained model checkpoints

5. Evaluating a pretrained model checkpoint

6. Acknowledgment

Others

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Packages