Skip to content

Commit

Permalink
upd: freadme
Browse files Browse the repository at this point in the history
  • Loading branch information
jindongwang committed Mar 13, 2024
1 parent e856221 commit 0403a12
Showing 1 changed file with 25 additions and 21 deletions.
46 changes: 25 additions & 21 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -174,28 +174,32 @@ PromptBench currently supports different datasets, models, prompt engineering me

### Datasets

- GLUE: SST-2, CoLA, QQP, MRPC, MNLI, QNLI, RTE, WNLI
- MMLU
- BIG-Bench Hard (Bool logic, valid parentheses, date...)
- Math
- GSM8K
- SQuAD V2
- IWSLT 2017
- UN Multi
- CSQA (CommonSense QA)
- Numersense
- QASC
- Last Letter Concatenate
- VQAv2
- NoCaps
- MMMU
- MathVista
- AI2D
- ChartQA
- ScienceQA
- Language datasets:
- GLUE: SST-2, CoLA, QQP, MRPC, MNLI, QNLI, RTE, WNLI
- MMLU
- BIG-Bench Hard (Bool logic, valid parentheses, date...)
- Math
- GSM8K
- SQuAD V2
- IWSLT 2017
- UN Multi
- CSQA (CommonSense QA)
- Numersense
- QASC
- Last Letter Concatenate
- Multi-modal datasets:
- VQAv2
- NoCaps
- MMMU
- MathVista
- AI2D
- ChartQA
- ScienceQA

### Models

Language models:

- Open-source models:
- google/flan-t5-large
- databricks/dolly-v1-6b
Expand All @@ -211,7 +215,7 @@ PromptBench currently supports different datasets, models, prompt engineering me
- GPT-4
- Gemini Pro

### Models (Multi-Modal)
Multi-modal models:

- Open-source models:
- BLIP2
Expand All @@ -220,7 +224,7 @@ PromptBench currently supports different datasets, models, prompt engineering me
- InternLM-XComposer2-VL
- Proprietary models
- GPT-4v
- GeminiProVision
- Gemini Pro Vision
- Qwen-VL-Max, Qwen-VL-Plus

### Prompt Engineering
Expand Down

0 comments on commit 0403a12

Please sign in to comment.