The `prompt engineering` leaderboard is cut off in half... #51

zhimin-z · 2024-02-21T00:08:49Z

The second leaderboard seems to be part of the first one (since it does not come with any name on top like the first one) but it is cut off mysteriously...
Check https://llm-eval.github.io/pages/leaderboard/pe.html

zhimin-z · 2024-02-21T00:13:49Z

It seems the unified leaderboard looks like below:

However, many evaluation results are missing from the table in this case...
@madhavMathur @jindongwang @msftgits @dnfclas

zhimin-z · 2024-02-21T00:18:07Z

Additionally, would you consider merging the initial cells in the first row into a single cell? Currently, the segmented display somehow detracts from the overall readability and aesthetic appeal.

I created a PR accordingly and hope you to take a look :)

icecream-and-tea · 2024-02-21T08:03:34Z

Thank you for your attention and advice!
The first and second issues stem from a common cause: different prompt engineering methods are applicable to different tasks.
The 'Least to Most' method aims to help LLM solve complex problems through Decomposition and Subproblem solving.

So compared with methods in first table, 'Least to Most' is more applicable to math, symbolic tasks, rather than common sense reasoning, which led to this method using different datasets from the first table. Finally, we presented results to two tables.

zhimin-z · 2024-02-21T15:29:01Z

Thank you for your attention and advice! The first and second issues stem from a common cause: different prompt engineering methods are applicable to different tasks. The 'Least to Most' method aims to help LLM solve complex problems through Decomposition and Subproblem solving. So compared with methods in first table, 'Least to Most' is more applicable to math, symbolic tasks, rather than common sense reasoning, which led to this method using different datasets from the first table. Finally, we presented results to two tables.

ok, that makes more sense. As I said, the current table looks split, and hope there could be a better display for demonstration purposes. Would you mind taking a look at my llm-eval/llm-eval.github.io#4?

jindongwang · 2024-03-19T08:32:10Z

@zhimin-z Merged your PR

jindongwang closed this as completed Mar 19, 2024

zhimin-z mentioned this issue Mar 19, 2024

Beautify the leaderboard #58

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

The `prompt engineering` leaderboard is cut off in half... #51

The `prompt engineering` leaderboard is cut off in half... #51

zhimin-z commented Feb 21, 2024 •

edited

Loading

zhimin-z commented Feb 21, 2024 •

edited

Loading

zhimin-z commented Feb 21, 2024 •

edited

Loading

icecream-and-tea commented Feb 21, 2024

zhimin-z commented Feb 21, 2024 •

edited

Loading

jindongwang commented Mar 19, 2024

The prompt engineering leaderboard is cut off in half... #51

The prompt engineering leaderboard is cut off in half... #51

Comments

zhimin-z commented Feb 21, 2024 • edited Loading

zhimin-z commented Feb 21, 2024 • edited Loading

zhimin-z commented Feb 21, 2024 • edited Loading

icecream-and-tea commented Feb 21, 2024

zhimin-z commented Feb 21, 2024 • edited Loading

jindongwang commented Mar 19, 2024

The `prompt engineering` leaderboard is cut off in half... #51

The `prompt engineering` leaderboard is cut off in half... #51

zhimin-z commented Feb 21, 2024 •

edited

Loading

zhimin-z commented Feb 21, 2024 •

edited

Loading

zhimin-z commented Feb 21, 2024 •

edited

Loading

zhimin-z commented Feb 21, 2024 •

edited

Loading