DPAM

code for Modeling Dynamic Pairwise Attention for Crime Classification over Legal Articles, SIGIR 2018 [pdf]

Bibtex

@inproceedings{Wang:2018:MDP:3209978.3210057,
 author = {Wang, Pengfei and Yang, Ze and Niu, Shuzi and Zhang, Yongfeng and Zhang, Lei and Niu, ShaoZhang},
 title = {Modeling Dynamic Pairwise Attention for Crime Classification over Legal Articles},
 booktitle = {The 41st International ACM SIGIR Conference on Research \&\#38; Development in Information Retrieval},
 series = {SIGIR '18},
 year = {2018},
 isbn = {978-1-4503-5657-2},
 location = {Ann Arbor, MI, USA},
 pages = {485--494},
 numpages = {10},
 url = {http:https://doi.acm.org/10.1145/3209978.3210057},
 doi = {10.1145/3209978.3210057},
 acmid = {3210057},
 publisher = {ACM},
 address = {New York, NY, USA},
 keywords = {dynamic threshold predictor, multi-label classification, pairwise attention model},
}

Description

In juridical field, judges usually need to consult several relevant cases to determine the specific articles that the evidence violated, which is a task that is time consuming and needs extensive professional knowledge. we treat the evidences as documents, and articles as labels, thus the conviction process can be cast as a multi-label classification problem.

DPAM model is used to predict articles by the evidences. This model adopt the multi-task learning method to learn the multi-label classifier and the threshold predictor jointly.

Getting Started

clone the code
download the dataset
mv the dataset file into code dir
run the script: run.sh

Working with the code

To work with the DPAM, you can simply run the script: run.sh. and you can also modify the parameters in the DPAM_train.py, then run the script: python DPAM_train.py

Download

The dataset contains 4 files: one_hot_vocab_70.txt src_rule_data.txt src_train_dev_data.txt stop_words.txt

the download url: https://drive.google.com/open?id=1TCcTzte2cQ2wxWw-kXsZdUApCsNejdn8

one_hot_vocab_70.txt

this file contains the mapping of the top 70 (labels)articles to one hot representation.

src_rule_data.txt

the file has two columns, the first column is article number, the second column is labels(articles) description.

src_train_dev_data.txt

this file contains 17160 samples, which is the train and dev source data, the file has two columns, the first column is evidence description, the second column is labels(articles).

stop_words.txt

this file contains some stop words, which is ignored in our train and dev samples.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
model		model
DPAM_train.py		DPAM_train.py
README.md		README.md
data_helper.py		data_helper.py
make_rule_data.py		make_rule_data.py
make_train_dev_dataset.py		make_train_dev_dataset.py
run.sh		run.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DPAM

Bibtex

Description

Getting Started

Working with the code

Download

one_hot_vocab_70.txt

src_rule_data.txt

src_train_dev_data.txt

stop_words.txt

About

Releases

Packages

Languages

yangze01/DPAM

Folders and files

Latest commit

History

Repository files navigation

DPAM

Bibtex

Description

Getting Started

Working with the code

Download

one_hot_vocab_70.txt

src_rule_data.txt

src_train_dev_data.txt

stop_words.txt

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages