#

truthfulness

Here are 4 public repositories matching this topic...

ictnlp / TruthX

Code for ACL 2024 paper "TruthX: Alleviating Hallucinations by Editing Large Language Models in Truthful Space"

safety llama representation language-model mistral explainable-ai hallucination baichuan hallucinations gpt-4 truthfulness llm llms chatgpt chatglm llm-inference llama2 llama3

Updated Mar 26, 2024
Python

OpenMOSS / Say-I-Dont-Know

[ICML'2024] Can AI Assistants Know What They Don't Know?

alignment truthfulness large-language-models

Updated Feb 5, 2024
Python

thu-ml / MMTrustEval

A toolbox for benchmarking trustworthiness of multimodal large language models (MultiTrust)

benchmark privacy toolbox safety multi-modal fairness robustness claude gpt-4 trustworthy-ai truthfulness mllm

Updated Jul 22, 2024
Python

alexisrozhkov / llm-calib

Improving LLM truthfulness via reporting confidence

alignment truthfulness llm rlhf

Updated Jun 9, 2024
Python

Improve this page

Add a description, image, and links to the truthfulness topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the truthfulness topic, visit your repo's landing page and select "manage topics."