Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Similar to Whisper PR#228, this adds -ocsv, aka --output-csv, writing CSV file containing millisecond timestamps #340

Merged
merged 1 commit into from
Dec 29, 2022

Conversation

NielsMayer
Copy link
Contributor

For efficiency in storage, parsing and accuracy from representing timestamps in milliseconds (rather than floating-point numbers), this PR, like openai/whisper#228 adds feature to output CSV file containing lines formatted like:
<startTime-in-integer-milliseconds>, <endTime-in-integer-milliseconds>, "<transcript-line-including-commas>"

The CSV file is created when option -ocsv, or --output-csv is provided to the 'main' command-line.

Similar to the output file naming conventions for --output_vtt or --output_srt, the resulting CSV file for an input file named <mediaBaseName>.wav is named <mediaBaseName>.wav.csv

…feature to examples/main, which outputs a CSV file containing lines formatted as follows <startTime-in-integer-milliseconds>, <endTime-in-integer-milliseconds>, "<transcript-line-including-commas>".
@NielsMayer
Copy link
Contributor Author

One of the big advantages of whisper.cpp is that when you're processing a 31 HOUR video file, it doesn't just die with "KILLED" and no other debug info, like whisper does on the exact same 16k .wav file (alongside medium model, 16 threads, 4.45Ghz 8 core AMD 4750g CPU):

image

I also like how whisper.cpp correctly transcribes a C++ header file as "child.h" and not, say "child dot h" . (original source video: https://www.youtube.com/watch?v=8jLOx1hD3_o ) :-)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants