Skip to content

This repository contains a dataset for hate speech detection on social media platforms.

License

Notifications You must be signed in to change notification settings

intelligence-csd-auth-gr/Ethos-Hate-Speech-Dataset

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

44 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

ETHOS Hate Speech Dataset

ETHOS: multi-labEl haTe speecH detectiOn dataSet. This repository contains a dataset for hate speech detection on social media platforms, called Ethos. There are two variations of the dataset:

  • Ethos_Dataset_Binary.csv[Ethos_Dataset_Binary.csv] contains 998 comments in the dataset alongside with a label about hate speech presence or absence. 565 of them do not contain hate speech, while the rest of them, 433, contain.
  • Ethos_Dataset_Multi_Label.csv [Ethos_Dataset_Multi_Label.csv] which contains 8 labels for the 433 comments with hate speech content. These labels are violence (if it incites (1) or not (0) violence), directed_vs_general (if it is directed to a person (1) or a group (0)), and 6 labels about the category of hate speech like, gender, race, national_origin, disability, religion and sexual_orientation.

Ethos /ˈiːθɒs/

is a Greek word meaning “character” that is used to describe the guiding beliefs or ideals that characterize a community, nation, or ideology. The Greeks also used this word to refer to the power of music to influence emotions, behaviors, and even morals.

Please check our older dataset as well: https://intelligence.csd.auth.gr/topics/hate-speech-detection/

Reference

Please if you use this dataset in your research cite out preprint paper: ETHOS: a multi-label hate speech detection dataset

@article{mollas_ethos_2022,
	title = {{ETHOS}: a multi-label hate speech detection dataset},
	issn = {2198-6053},
	url = {https://doi.org/10.1007/s40747-021-00608-2},
	doi = {10.1007/s40747-021-00608-2},
	journal = {Complex \& Intelligent Systems},
	author = {Mollas, Ioannis and Chrysopoulou, Zoe and Karlos, Stamatis and Tsoumakas, Grigorios},
	month = jan,
	year = {2022},
}

Contributors on Ethos

Name Email
Grigorios Tsoumakas [email protected]
Ioannis Mollas [email protected]
Zoe Chrysopoulou [email protected]
Stamatis Karlos [email protected]

License

GNU GPLv3

About

This repository contains a dataset for hate speech detection on social media platforms.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages