- Sanchay is a Sanskrit word that means
collection
oraccumulation
. - We think that It is a fitting name for this project because it captures the
essence
of what it does - it takes a video and collects its key elements (transcription, subtitles, and video chapters
) in an organized and easily accessible manner. - It's like creating a collection of information from a single source, making it more useful and convenient to work with. The name Sanchaya also has a nice ring to it.
- Sanchay-AI internally uses
Whisper
andOpenAI
to achieve this feat.
Pre-requisite: Python
pip install -U openai-whisper
brew install ffmpeg
pip install setuptools-rust
pip install --upgrade tiktoken
- Ensure that your video is stored in
videos directory
. - Edit Env in
./run.sh
, most importantly, theINPUT_FILE_PATH
property & theOPENAI_API_KEY
property
chmod +x run.sh
./run.sh
Key | Description |
---|---|
INPUT_FILE_PATH | Video file path |
OPENAI_API_KEY | https://platform.openai.com/api-keys |
OPENAI_MODEL | https://platform.openai.com/docs/models |
MAX_TOKENS | This is used to split the subtitles into chunks of MAX_TOKENS. MAX_TOKENS should be 4/5th of your OPENAI_MODEL's max tokens |
OUTPUT_TYPE | options [txt, json, vtt, srt, tsv, all], logic supports vtt files only for now |
OUTPUT_DIR | Directory where output will be stored |