Skip to content
View imran3180's full-sized avatar
  • New York
Block or Report

Block or report imran3180

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Fast inference from large lauguage models via speculative decoding

Python 410 45 Updated Sep 22, 2023

Foundation Model Evaluations Library

Python 150 40 Updated Jun 27, 2024

DSPy: The framework for programming—not prompting—foundation models

Python 13,692 1,050 Updated Jun 27, 2024

Evaluation framework for your Retrieval Augmented Generation (RAG) pipelines

Python 5,548 510 Updated Jun 27, 2024

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 34,268 5,254 Updated Jun 27, 2024

Structured Text Generation

Python 6,884 354 Updated Jun 25, 2024

UpTrain is an open-source unified platform to evaluate and improve Generative AI applications. We provide grades for 20+ preconfigured checks (covering language, code, embedding use-cases), perform…

Python 2,079 174 Updated Jun 8, 2024

Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs

Python 1,848 125 Updated Jun 27, 2024

Adding guardrails to large language models.

Python 3,587 258 Updated Jun 27, 2024

Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supportin…

Jupyter Notebook 10,287 1,463 Updated Jun 27, 2024
Jupyter Notebook 9 2 Updated Jun 17, 2024

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…

C++ 7,324 793 Updated Jun 27, 2024

Build ChatGPT over your data, all with natural language

Python 6,039 610 Updated Apr 5, 2024

SageMaker custom deployments made easy

Jupyter Notebook 56 16 Updated Jan 3, 2024

Easy, fast and very cheap training and inference on AWS Trainium and Inferentia chips.

Jupyter Notebook 176 51 Updated Jun 27, 2024

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 21,752 3,063 Updated Jun 27, 2024

Call all LLM APIs using the OpenAI format. Use Bedrock, Azure, OpenAI, Cohere, Anthropic, Ollama, Sagemaker, HuggingFace, Replicate (100+ LLMs)

Python 10,079 1,119 Updated Jun 27, 2024

Large Language Model Text Generation Inference

Python 8,297 939 Updated Jun 27, 2024

Large Language Model Hosting Container

Dockerfile 72 22 Updated Jun 24, 2024

Example 📓 Jupyter notebooks that demonstrate how to build, train, and deploy machine learning models using 🧠 Amazon SageMaker.

Jupyter Notebook 9,778 6,660 Updated Jun 27, 2024

A library for training and deploying machine learning models on Amazon SageMaker

Python 2,065 1,114 Updated Jun 27, 2024

A universal scalable machine learning model deployment solution

Java 180 58 Updated Jun 27, 2024

🦜🔗 Build context-aware reasoning applications

Python 88,167 13,805 Updated Jun 27, 2024

🐶 A tool to package, serve, and deploy any ML model on any platform. Archived to be resurrected one day🤞

Python 717 43 Updated Sep 13, 2023

A fast admin dashboard based on FastAPI and TortoiseORM with tabler ui, inspired by Django admin

Python 2,666 349 Updated May 25, 2024

A collection of services with great free tiers for developers on a budget. Sponsored by Mockoon, the best mock API tool. https://mockoon.com

12,017 524 Updated Jun 3, 2024

AI and Machine Learning with Kubeflow, Amazon EKS, and SageMaker

Jupyter Notebook 3,312 1,066 Updated Mar 12, 2024

The P programming language.

C# 2,952 173 Updated Jun 19, 2024

Machine learning glossary

Python 2,978 717 Updated Jan 28, 2024
Next