Skip to content

borgr/GEC_UD_divergences

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

40 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

GEC_UD_divergences

Create a matrix

See Using_m2, currently the file creates both matrices and m2 in the main function, comment out if unneeded (matrix is get_confusion_matrix + extract_matrices) python using_m2/GEC_UD_divergences_m2.py source.conllu reference.conllu m2file.m2

Create an m2 file

See Using_m2, currently the file creates both matrices and m2 in the main function, comment out if unneeded (m2 is in syntactic_m2 function) python using_m2/GEC_UD_divergences_m2.py source.conllu reference.conllu m2file.m2

Other implementation:

There is also an end to end implementation for English with or without combining with ERRANT (adaptations to support other languages or morphology when POS is not changing are welcome)

Utility: convert m2 to source and reference text

python using_m2 preprocessing.py m2file.m2

Cite

If you have found this work useful please cite the CoNLL paper.

    title = "Classifying Syntactic Errors in Learner Language",
    author = "Choshen, Leshem  and
      Nikolaev, Dmitry  and
      Berzak, Yevgeni  and
      Abend, Omri",
    booktitle = "Proceedings of the 24th Conference on Computational Natural Language Learning",
    month = nov,
    year = "2020",
    address = "Online",
    publisher = "Association for Computational Linguistics",
    url = "https://www.aclweb.org/anthology/2020.conll-1.7",
    pages = "97--107",
    }

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published