Name	Name	Last commit message	Last commit date
Latest commit History 51 Commits
config	config
src	src
.gitignore	.gitignore
README.md	README.md
requirements.txt	requirements.txt

whisper-transcriber-telegram-bot

A Python-based Whisper AI transcriber bot for Telegram.

About

This is a Whisper AI-based transcriber Telegram Bot running on Python, designed to transcribe audio from various media sources supported by yt-dlp. While initially focused on YouTube, the bot now supports a broad range of sites listed here, leveraging a locally run OpenAI's Whisper model to process audio and return the transcription in multiple formats.

Features

Processes media URLs from a variety of sources supported by yt-dlp.
Downloads audio using yt-dlp from supported sites including but not limited to YouTube.
Uses a local model from the openai-whisper package for transcription.
Transcribes audio using OpenAI's Whisper model
- (see openai/whisper for more info)
Returns transcription in text, SRT, and VTT formats.
Handles concurrent transcription requests efficiently.

Installation

To set up the Whisper Transcriber Telegram Bot, follow these steps:

Clone the repository:

git clone https://github.com/FlyingFathead/whisper-transcriber-telegram-bot.git
cd whisper-transcriber-telegram-bot

Install the required Python packages:
```
pip install -r requirements.txt
```
Set up your Telegram bot token either in config/bot_token.txt or as an environment variable TELEGRAM_BOT_TOKEN.
Run the bot:
```
python src/main.py
```

Usage

After launching the bot, you can interact with it via Telegram:

Send a YouTube video URL to the bot.
The bot will acknowledge the request and begin processing.
Once processing is complete, the bot will send the transcription files to you.

Changes

v0.10 - /help & /about commands added for further assistance
- config.ini now has a list of supported models that can be changed as needed
v0.09 - users can now change the model Whisper model with /model command
v0.08 - auto-retry TG connection on start-up connection failure
- can be set in config.ini with RestartOnConnectionFailure
v0.07.7 - log output from whisper to logging
v0.07.6 - update interval for logging yt-dlp downloads now configurable from config.ini
v0.07.5 - 10-second interval update for yt-dlp logging
v0.07.4 - fixes for non-youtube urls
v0.07.2 - job queues fine-tuned to be more informative
v0.07.1 - job queues introduced
v0.07 - transcript queuing, more precise transcript time estimates
v0.06 - better handling of details for all video sources, transcription time estimates
v0.05 - universal video description parsing (platform-agnostic)
v0.04.1 - version number printouts and added utils
v0.04 - expanded support for various media sources via yt-dlp, supported sites listed here
v0.03 - better logging to console, Whisper model + keep audio y/n can now be set in config.ini
v0.02 - add video information to the transcript text file
- (see: config.ini => IncludeHeaderInTranscription = True)
v0.01 - initial commit

Contributing

Contributions are welcome! If you have suggestions for improvements or bug fixes, please open an issue or submit a pull request.

Credits

FlyingFathead - Project creator
ChaosWhisperer - Contributions to the Whisper integration and documentation

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

whisper-transcriber-telegram-bot

About

Features

Installation

Usage

Changes

Contributing

Credits

About

Releases

Packages

Contributors 3

Languages

FlyingFathead/whisper-transcriber-telegram-bot

Folders and files

Latest commit

History

Repository files navigation

whisper-transcriber-telegram-bot

About

Features

Installation

Usage

Changes

Contributing

Credits

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages