Serverless AI Inference with Gemma 2 using Mozilla's llamafile on AWS Lambda

https://www.unremarkable.ai/serverless-ai-inference-with-gemma-2-using-mozillas-llamafile-on-aws-lambda

Setup

We assume you have the following configured or installed.

An AWS account with credentials configured.
The AWS SAM CLI installed for fast and easy serverless deployments.
Docker installed for easy container builds and deployments.

After you clone the repo, setup your dependencies with the following command:

npm install

Usage

Now you can run the following commands from the root directory.

./bin/build - To download and build a llamafile container for deployment.
./bin/server - To run the download (above) llamafile server locally.
./bin/deploy - Deploy to AWS Lambda. Also does a build if needed.

Chat

This project uses Inquirer.js to chat with the model using OpenAI's API. The model can be running locally using bin/server or deployed to Lambda using bin/deploy. Inquirer will ask for your local or function URL at the beginning of the chat session.

Name		Name	Last commit message	Last commit date
Latest commit History 51 Commits
bin		bin
public		public
src		src
test		test
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE.md		LICENSE.md
README.md		README.md
package-lock.json		package-lock.json
package.json		package.json
template.yaml		template.yaml
variables.sh		variables.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Serverless AI Inference with Gemma 2 using Mozilla's llamafile on AWS Lambda

Setup

Usage

Chat

About

Releases

Packages

Languages

License

metaskills/llamafile-on-lambda

Folders and files

Latest commit

History

Repository files navigation

Serverless AI Inference with Gemma 2 using Mozilla's llamafile on AWS Lambda

Setup

Usage

Chat

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages