Skip to content

TagGPT: A simple ChatGPT based multimodal dialog generation engine that can "see/draw" and "hear/speak"

Notifications You must be signed in to change notification settings

haodong2000/TagGPT

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

29 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

TagGPT

Demos | Prompt

Configuration

  1. Add your API_KEY for OPENAI, REPLICATE, and AAI into the environment variables
export OPENAI_API_KEY='...'
export REPLICATE_API_TOKEN='...'
export AAI_API_KEY='...'
  1. Install required packages
openai
replicate
gtts
requests
datetime
pillow
time

Demos

Usage

  • Images are expressed as [[[./path/to/image]]]
  • Audios are expressed as <<<./path/to/audio>>>
  • END is the end-of-input marker
  • Examples:
You: Here is a picture [[[./images/in/dogpizza.jpg]]], replace the dogs with white cute cats.
END

ChatGPT: Here is your updated picture: [[[./images/out/sd2_2023-03-25_12-49-24.png]]]. Enjoy!
You: <<<./audios/in/assemblyai.mp3>>>
please help me writa a short introduction about AssemblyAI END

ChatGPT: AssemblyAI is a deep learning company that ...

Project Structure

TagGPT
├── AIGC
│   ├── clip_interrogator.py
│   ├── dall_e2.py
│   ├── gtts_t2a.py
│   ├── stable_diffusion_2.py
│   └── transcribe.py
├── audios
│   ├── in
|   └── out
├── images
│   ├── in
|   └── out
└── run_gpt_3.5.py

To-do

  • Build a UI (maybe)

Tips

  1. if you meet with requests.exceptions.ConnectionError, change http to https in ./AIGC/utils.py might help :)