Bias and Fairness in Large Language Models: A Survey

Isabel O. Gallegos, Ryan A. Rossi, Joe Barrow, Md Mehrab Tanjim, Sungchul Kim, Franck Dernoncourt, Tong Yu, Ruiyi Zhang, and Nesreen K. Ahmed

To enable easy use of bias evaluation datasets, we compile publicly-available ones and provide access here. We provide links to the original data sources below. We do not modify any of the datasets, but do remove unrelated material from the original repositories. Please refer to the original works for more detailed documentation.

Dataset	Link
BBQ	https://github.com/nyu-mll/BBQ
BEC-Pro	https://github.com/marionbartl/gender-bias-BERT
Bias NLI	https://github.com/sunipa/On-Measuring-and-Mitigating-Biased-Inferences-of-Word-Embeddings
BOLD	https://github.com/amazon-science/bold
BUG	https://github.com/SLAB-NLP/BUG
CrowS-Pairs	https://github.com/nyu-mll/crows-pairs/
Equity Evaluation Corpus	https://saifmohammad.com/WebPages/Biases-SA.html
GAP	https://github.com/google-research-datasets/gap-coreference
Grep-BiasIR	https://github.com/KlaraKrieg/GrepBiasIR
HolisticBias	https://github.com/facebookresearch/ResponsibleNLP
HONEST	https://github.com/MilaNLProc/honest
PANDA	https://github.com/facebookresearch/ResponsibleNLP
RealToxicityPrompts	https://toxicdegeneration.allenai.org
RedditBias	https://github.com/umanlp/RedditBias
StereoSet	https://github.com/McGill-NLP/bias-bench, https://github.com/moinnadeem/stereoset
TrustGPT	https://github.com/HowieHwong/TrustGPT
UnQover	https://github.com/allenai/unqover
WinoBias	https://github.com/uclanlp/corefBias
WinoBias+	https://github.com/vnmssnhv/NeuTralRewriter
WinoGender	https://github.com/rudinger/winogender-schemas
WinoQueer	https://github.com/katyfelkner/winoqueer

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Bias and Fairness in Large Language Models: A Survey

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 32 Commits
BBQ		BBQ
BEC-Pro		BEC-Pro
BOLD		BOLD
BUG		BUG
Bias-NLI		Bias-NLI
CrowS-Pairs		CrowS-Pairs
Equity-Evaluation-Corpus		Equity-Evaluation-Corpus
GAP		GAP
Grep-BiasIR		Grep-BiasIR
HONEST		HONEST
HolisticBias		HolisticBias
PANDA		PANDA
RealToxicityPrompts		RealToxicityPrompts
RedditBias		RedditBias
StereoSet		StereoSet
TrustGPT		TrustGPT
UnQover		UnQover
WinoBias+		WinoBias+
WinoBias		WinoBias
WinoQueer		WinoQueer
Winogender		Winogender
.gitignore		.gitignore
README.md		README.md

Franck-Dernoncourt/Fair-LLM-Benchmark

Folders and files

Latest commit

History

Repository files navigation

Bias and Fairness in Large Language Models: A Survey

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages