LLM_metaphor_detection

This repository contains the materials for the paper "Literary Metaphor Detection with LLM Fine-Tuning and Few-Shot Learning".

Project description

The paper explores the task of metaphor detection using Large Language Models (LLMs) from a Digital Humanities perspective, focusing on the detection of literary metaphors. For this task, the Transformer-based model DisitlBERT and the Sentence Transformer-based model all-MiniLM-L6-v2 (using the SetFit framework) are fine-tuned on four datasets. The results suggest significant performance improvements for fine-tuning over baseline methods and for SetFit over traditional fine-tuning. However, since this improvement was not observed for all datasets, it is challenging to formulate generalised statements on the superiority of fine-tuning over traditional machine learning approaches for the MD task.

Used datasets

The datasets for this project can either be found in the folder raw_datasets or downloaded from the links below:

PoFo dataset by Kesarwani et al.
TroFi dataset by Birke and Sarkar.
MOH dataset by Mohammad et al.

Code

The code folder contains notebooks for preprocessing, fine-tuning Transformers and SetFit and evaluation. The notebooks provide detailed explanations and instructions on the inputs and outputs for this project.

Fine-tuned models

The resulting fine-tuned models are publicly available on Zenodo.

References

Birke, Julia, and Anoop Sarkar. "A clustering approach for nearly unsupervised recognition of nonliteral language." 11th Conference of the European chapter of the association for computational linguistics, 2006, p. 329-336, aclanthology.org/E06-1042.

Kesarwani, Vaibhav, et al. “Metaphor Detection in a Poetry Corpus.” Proceedings of the Joint SIGHUM Workshop on Computational Linguistics for Cultural Heritage, Social Sciences, Humanities and Literature, edited by Beatrice Alex et al. Association for Computational Linguistics, 2017, pp. 1–9, https://doi.org/10.18653/v1/W17-2201.

Mohammad, Saif, et al. “Metaphor as a Medium for Emotion: An Empirical Study.” Proceedings of the Fifth Joint Conference on Lexical and Computational Semantics, 2016, pp. 23–33, https://doi.org/10.18653/v1/S16-2003.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
code		code
preprocessed_datasets		preprocessed_datasets
raw_datasets		raw_datasets
setfit_training_output		setfit_training_output
transformers_training_output		transformers_training_output
visualisations		visualisations
LICENSE		LICENSE
Literary_Metaphor_Detection_Spielberg.pdf		Literary_Metaphor_Detection_Spielberg.pdf
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LLM_metaphor_detection

Project description

Used datasets

Code

Fine-tuned models

References

About

Releases

Packages

Languages

License

ma-spie/LLM_metaphor_detection

Folders and files

Latest commit

History

Repository files navigation

LLM_metaphor_detection

Project description

Used datasets

Code

Fine-tuned models

References

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages