- New York
Block or Report
Block or report imran3180
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseStars
Language
Sort by: Recently starred
Fast inference from large lauguage models via speculative decoding
DSPy: The framework for programming—not prompting—foundation models
Evaluation framework for your Retrieval Augmented Generation (RAG) pipelines
The simplest, fastest repository for training/finetuning medium-sized GPTs.
UpTrain is an open-source unified platform to evaluate and improve Generative AI applications. We provide grades for 20+ preconfigured checks (covering language, code, embedding use-cases), perform…
Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs
Adding guardrails to large language models.
Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supportin…
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…
Build ChatGPT over your data, all with natural language
SageMaker custom deployments made easy
Easy, fast and very cheap training and inference on AWS Trainium and Inferentia chips.
A high-throughput and memory-efficient inference and serving engine for LLMs
Call all LLM APIs using the OpenAI format. Use Bedrock, Azure, OpenAI, Cohere, Anthropic, Ollama, Sagemaker, HuggingFace, Replicate (100+ LLMs)
Large Language Model Text Generation Inference
Large Language Model Hosting Container
Example 📓 Jupyter notebooks that demonstrate how to build, train, and deploy machine learning models using 🧠 Amazon SageMaker.
A library for training and deploying machine learning models on Amazon SageMaker
A universal scalable machine learning model deployment solution
🦜🔗 Build context-aware reasoning applications
🐶 A tool to package, serve, and deploy any ML model on any platform. Archived to be resurrected one day🤞
A fast admin dashboard based on FastAPI and TortoiseORM with tabler ui, inspired by Django admin
A collection of services with great free tiers for developers on a budget. Sponsored by Mockoon, the best mock API tool. https://mockoon.com
AI and Machine Learning with Kubeflow, Amazon EKS, and SageMaker