initial_prompt influences transciption result wrongly #10

chesha1 · 2023-02-04T10:15:00Z

When using initial prompts, result of batch-whisper is different from official whisper and it will also lose the first few words of the reslut.
That is,

whisper without initial prompt:
result["text"] = 'abcdefg'
result["segments"][0]["text"] = 'abcdefg'
whisper with initial prompt:
result["text"] = 'abcdefg'
result["segments"][0]["text"] = 'abc, defg'
batch-whisper without initial prompt:
same with 1
batch-whisper with initial prompt:
result["text"] = 'bcdefg'
result["segments"][0]["text"] = 'abcdefg'

Part of codes are below:

    audio_list = list()
    for k in range(batch_size):
        audio_list.append(load_audio(path + file_list[idx + k], sr=16000))
    prompt = ["以下是普通话的句子。"] * batch_size
    result = model.transcribe(audio_list, language='zh', task='transcribe', fp16=False,
                              initial_prompt=prompt)

chesha1 closed this as not planned Won't fix, can't repro, duplicate, stale Aug 21, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

initial_prompt influences transciption result wrongly #10

initial_prompt influences transciption result wrongly #10

chesha1 commented Feb 4, 2023 •

edited

Loading

initial_prompt influences transciption result wrongly #10

initial_prompt influences transciption result wrongly #10

Comments

chesha1 commented Feb 4, 2023 • edited Loading

chesha1 commented Feb 4, 2023 •

edited

Loading