vision

vision is a simple OpenAI CLI and GPTScript Tool for interacting with vision models.

Prerequisites

NodeJS
OpenAI API key

Installation

Clone this repository or download the source code:

git clone [email protected]:gptscript-ai/vision.git
cd vision

Install the npm dependencies
```
npm install 
```

Usage

Command help

$ node index.js --help
Usage: index [options] <prompt> <images...>

Utility for processing images with the OpenAI API

Arguments:
  prompt                      Prompt to send to the vision model
  images                      List of image URIs to process. Supports file:https:// and https:// protocols. Images must be jpeg or png.

Options:
  --openai-api-key <key>      OpenAI API Key (env: OPENAI_API_KEY)
  --openai-base-url <string>  OpenAI base URL (env: OPENAI_BASE_URL)
  --openai-org-id <string>    OpenAI Org ID to use (env: OPENAI_ORG_ID)
  --max-tokens <number>       Max tokens to use (default: 2048, env: MAX_TOKENS)
  --model <model>             Model to process images with (choices: "gpt-4-vision-preview", default: "gpt-4-vision-preview", env: MODEL)
  --detail <detail>           Fidelity to use when processing images (choices: "low", "high", "auto", default: "auto", env: DETAIL)
  -h, --help                  display help for command

Ask a question about an image in a local file

node index.js 'Describe the picture' 'file:https://examples/eiffel-tower.png'

Ask a question about an image at a remote URL

node index.js 'Describe the picture' 'https://github.com/gptscript-ai/vision/blob/main/examples/eiffel-tower.png?raw=true'

Ask a question related to multiple images

node index.js 'Do you think these two portraits are by the same artist?' 'https://github.com/gptscript-ai/vision/blob/main/examples/eiffel-tower.png?raw=true' 'file:https://examples/eiffel-tower.png'

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
examples		examples
.gitignore		.gitignore
README.md		README.md
bootstrap.gpt		bootstrap.gpt
index.js		index.js
package.json		package.json
tool.gpt		tool.gpt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

vision

Prerequisites

Installation

Usage

Command help

Ask a question about an image in a local file

Ask a question about an image at a remote URL

Ask a question related to multiple images

About

Releases

Packages

Languages

ibuildthecloud/vision

Folders and files

Latest commit

History

Repository files navigation

vision

Prerequisites

Installation

Usage

Command help

Ask a question about an image in a local file

Ask a question about an image at a remote URL

Ask a question related to multiple images

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages