Nitro - Accelerated AI Inference Engine

Getting Started - Docs - Changelog - Bug reports - Discord

⚠️ Nitro is currently in Development: Expect breaking changes and bugs!

Features

Supported features

Simple http webserver to do inference on triton (without triton client)
Upload inference result to s3 (txt2img)

TODO:

Local file server
Cache
GGML inference support (llama.cpp, etc...)
Plugins support

Nitro Endpoints

- /inferences/llm_models OPENAI_COMPATIBLE (STREAMING)
- /inferences/txt2img POST - JSON
- /inferences/img2img POST - MULTIPART

Documentation

Installation

Using Docker (Recommended)

Prerequisites: Ensure you have a base Docker image with Triton Client installed.
- Currently, only compatible with nvcr.io/nvidia/tritonserver:23.06-py3-sdk.
Build Docker Image:
```
docker build . -t jan_infer
```

Configuration:

Download and modify the example config file from here.
Make sure to rename it by removing "example." from the filename.

custom_config:
  s3_public_endpoint:  <your s3 endpoint>
  triton_endpoint: <your triton ip:port>
  s3_bucket: <your s3 bucket name>
  drogon_port: <backend deployment port>

Run Docker Container:

Replace the placeholders with your specific configurations.

docker run \
  -v /path/to/your/config.yaml:/workspace/workdir/janinfer_backend/config.yaml \
  -p 3000:3000 \
  -e AWS_ACCESS_KEY_ID=<your_access_key> \
  -e AWS_SECRET_ACCESS_KEY=<your_secret_key> \
  -e AWS_DEFAULT_REGION=<your_region> \
  jan_infer

Note: /path/to/your/config.yaml -> This is the config file that you need to make in step 3, you can place it anywhere as long as mount it properly like above.

That's it! You should now have the inference backend up and running.

ation about how some parts of the backend is implemented can be found at Developer Documentation

About Nitro

Repo Structure

.
|-- core
|   |-- inference_backend
|   |   |-- controllers
|   |   |   |-- img2img
|   |   |   |-- llm_models
|   |   |   `-- txt2img
|   |   |-- include
|   |   |-- schemas
|   |   `-- test
|   |-- models
|   `-- scripts
`-- docs
    |-- development
    `-- openapi

Architecture

Contributing

Contributions are welcome! Please read the CONTRIBUTING.md file for guidelines on how to contribute to this project.

Please note that Jan intends to build a sustainable business that can provide high quality jobs to its contributors. If you are excited about our mission and vision, please contact us to explore opportunities.

Contact

For support: please file a Github ticket
For questions: join our Discord here
For long form inquiries: please email [email protected]

Name		Name	Last commit message	Last commit date
Latest commit History 30 Commits
core		core
docs		docs
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Nitro - Accelerated AI Inference Engine

Features

Supported features

TODO:

Nitro Endpoints

Documentation

Installation

Using Docker (Recommended)

About Nitro

Repo Structure

Architecture

Contributing

Contact

About

Releases

Packages

Languages

License

luiyen/nitro

Folders and files

Latest commit

History

Repository files navigation

Nitro - Accelerated AI Inference Engine

Features

Supported features

TODO:

Nitro Endpoints

Documentation

Installation

Using Docker (Recommended)

About Nitro

Repo Structure

Architecture

Contributing

Contact

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages