Skip to content

Takes your video and generates video title, description, hashtags, transcription, subtitles and more.

Notifications You must be signed in to change notification settings

aadeshkulkarni/sanchay-ai

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Sanchay AI

  • Sanchay is a Sanskrit word that means collection or accumulation.
  • We think that It is a fitting name for this project because it captures the essence of what it does - it takes a video and collects its key elements (transcription, subtitles, and video chapters) in an organized and easily accessible manner.
  • It's like creating a collection of information from a single source, making it more useful and convenient to work with. The name Sanchaya also has a nice ring to it.
  • Sanchay-AI internally uses Whisper and OpenAI to achieve this feat.

Project Setup

Pre-requisite: Python

Install openai-Whisper

pip install -U openai-whisper

Install ffmpeg

brew install ffmpeg

Install setuptools-rust

pip install setuptools-rust

Install tiktoken

pip install --upgrade tiktoken

Note:

  • Ensure that your video is stored in videos directory.
  • Edit Env in ./run.sh, most importantly, the INPUT_FILE_PATH property & the OPENAI_API_KEY property

Run the script

chmod +x run.sh
./run.sh

Environment Variables

Key Description
INPUT_FILE_PATH Video file path
OPENAI_API_KEY https://platform.openai.com/api-keys
OPENAI_MODEL https://platform.openai.com/docs/models
MAX_TOKENS This is used to split the subtitles into chunks of MAX_TOKENS. MAX_TOKENS should be 4/5th of your OPENAI_MODEL's max tokens
OUTPUT_TYPE options [txt, json, vtt, srt, tsv, all], logic supports vtt files only for now
OUTPUT_DIR Directory where output will be stored

About

Takes your video and generates video title, description, hashtags, transcription, subtitles and more.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages