Skip to content

Commit

Permalink
updating README
Browse files Browse the repository at this point in the history
  • Loading branch information
mmazeika committed Feb 7, 2024
1 parent 7193689 commit 40b0a4a
Showing 1 changed file with 2 additions and 1 deletion.
3 changes: 2 additions & 1 deletion README.md
Original file line number Diff line number Diff line change
Expand Up @@ -60,7 +60,7 @@ Automated red teaming holds substantial promise for uncovering and mitigating th

There are two primary ways to use HarmBench: (1) evaluating red teaming methods against a set of LLMs, and (2) evaluating LLMs against a set of red teaming methods. These use cases are both supported by the same evaluation pipeline, illustrated above. The process for running this evaluation pipeline is described below in the Quick Start section.

HarmBench has out-of-the-box support for transformers-compatible LLMs, numerous closed-source APIs, and several multimodal models. To add new red teaming methods and models that aren't compatible with transformers, it may help to learn more about how the codebase is structured. We provide a set of docs and *(coming soon)* tutorials to help with more advanced usage.
HarmBench has out-of-the-box support for transformers-compatible LLMs, numerous closed-source APIs, and several multimodal models. To add new red teaming methods and models that aren't compatible with transformers, it may help to learn more about how the codebase is structured. We provide a set of docs to help with more advanced usage.


## ☕ Quick Start ☕
Expand Down Expand Up @@ -249,6 +249,7 @@ We plan to update to HarmBench over the next few weeks with the following:
- Support for system prompt attacks
- Experiment code from the paper, including adversarial training code
- Validation classifiers
- Tutorials for more in-depth usage

## 🙏 Acknowledgements and Citation 🙏

Expand Down

0 comments on commit 40b0a4a

Please sign in to comment.