Furigana Annotators

Evaluating various models on the furigana annotations of book titles from the National Diet Library (https://huggingface.co/datasets/AlienKevin/ndlbib-furigana).

Randomly sampled 1,000 samples from the dataset for evaluation.

Model	Character Error Rate (CER)
gpt-4o-2024-05-13	2.17%
pykakasi 2.2.1	3.71%
deepseek-v2-chat	10.95%
qwen2-7b-instruct	55.97%

Evaluation

Create a models.json to store your API keys:

{
    "deepseek-chat": {
        "base_url": "https://api.deepseek.com",
        "api_key": "sk-xxxx"
    },
    "gpt-4o": {
        "base_url": "https://api.openai.com/v1",
        "api_key": "sk-xxxx"
    }
}

Install dependencies:

conda install openai polars tqdm pykakasi pyarrow

Run models to get furigana predictions:

python run_kakasi.py
python run_llm.py deepseek-chat
python run_llm.py gpt-4o

Evaluate predictions. CER will be printed in the console.

python eval.py kakasi
python eval.py deepseek-chat
python eval.py gpt-4o

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
results		results
.gitignore		.gitignore
README.md		README.md
eval.py		eval.py
gen_dataset.py		gen_dataset.py
run_kakasi.py		run_kakasi.py
run_llm.py		run_llm.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Furigana Annotators

Evaluation

About

Languages

AlienKevin/furigana

Folders and files

Latest commit

History

Repository files navigation

Furigana Annotators

Evaluation

About

Resources

Stars

Watchers

Forks

Languages