SciFact Pre-training

This work further investigates whether pre-training on related scientific 'fact verification' tasks might improve performance for the Evidence Inference BERT-to-BERT pipeline model. Specifically, we use the SciFact claim verification corpus for such pre-training.

See README.evidence_inference.md for the original Evidence Inference README.

Colab Notebooks

The following experiments were run using Colab Pro.

PICO Extraction with bi-LSTM-CRF
PICO Extraction with SciBERT
SciFact Claim Prediction Analysis and Preprocessing
BERT Pipeline Hyperparameter Tuning on SciFact
BERT Pipeline Evidence Inference Abstract-Only
BERT Pipeline Evidence Inference with SciFact Pretraining

Experiment Design

The following steps were performed to evaluate the effectiveness of SciFact pre-training for the Evidence Inference BERT-to-BERT pipeline model:

Extract and preprocess PICO spans from SciFact claims.
Adapt SciFact data into the format expected by the BERT pipeline and define corresponding samplers.
Train the pipeline on SciFact and save the model weights. We optionally converted RoBERTa to SciBERT due to memory constraints.
Train the pipeline on Evidence Inference using the pre-trained weights.

Note that this experiment does not change the model architecture and instead forces the SciFact dataset into the same format as the Evidence Inference data via PICO extraction of SciFact claims. The following options may also be considered:

Add module to model that learns some representation for prompts and claims first before feeding it to the model.
Apply linearization to both the claims and the prompts.

Name		Name	Last commit message	Last commit date
Latest commit History 98 Commits
annotations		annotations
evidence_inference		evidence_inference
outputs		outputs
params		params
scripts		scripts
.gitignore		.gitignore
LICENSE		LICENSE
README.annotation_process.md		README.annotation_process.md
README.evidence_inference.md		README.evidence_inference.md
README.md		README.md
SETUP.md		SETUP.md
evidence_inference_intro.ipynb		evidence_inference_intro.ipynb
requirements.txt		requirements.txt
verify_span_quality.py		verify_span_quality.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SciFact Pre-training

Colab Notebooks

Experiment Design

About

Releases

Packages

Languages

License

wangah/evidence-inference

Folders and files

Latest commit

History

Repository files navigation

SciFact Pre-training

Colab Notebooks

Experiment Design

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages