#

large-model-inference

Here are 2 public repositories matching this topic...

aws-samples / amazon-sagemaker-llama2-response-streaming-recipes

Amazon SageMaker Llama 2 Inference via Response Streaming

sagemaker sagemaker-endpoint response-streaming large-language-models text-generation-inference llama2 large-model-inference

Updated Jun 28, 2024
Jupyter Notebook

windson / inferentia-deployments

Deploy Large Models on AWS Inferentia (Inf2) instances.

aws lmi inf2 large-model llm inferentia large-language-model large-model-inference aws-inferentia inferentia-2

Updated Dec 28, 2023
Jupyter Notebook

Improve this page

Add a description, image, and links to the large-model-inference topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the large-model-inference topic, visit your repo's landing page and select "manage topics."