Stars
Minimal code and examnples for inferencing Sapiens foundation human models in Pytorch
This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and cont…
Find the Best LLM for Your Needs through E2E Testing
Fast, Accurate, Lightweight Python library to make State of the Art Embedding
winglian / deita
Forked from hkust-nlp/deitaDeita: Data-Efficient Instruction Tuning for Alignment
A holistic way of understanding how Llama and its components run in practice, with code and detailed documentation.
A Gradio web UI for Large Language Models.
Automate Creation of YouTube Shorts using MoviePy.
2-2000x faster ML algos, 50% less memory usage, works on all hardware - new and old.
GGUF implementation in C as a library and a tools CLI program
Finetune Llama 3.2, Mistral, Phi, Qwen 2.5 & Gemma LLMs 2-5x faster with 80% less memory
LLaMA: Open and Efficient Foundation Language Models
Public reports detailing responses to sets of prompts by Large Language Models.
A curated list of awesome transformer models.
Run inference on replit-3B code instruct model using CPU
CodeUp: A Multilingual Code Generation Llama2 Model with Parameter-Efficient Instruction-Tuning on a Single RTX 3090
A high-throughput and memory-efficient inference and serving engine for LLMs
Curated list of AI-powered developer tools.
The code for the bark-voicecloning model. Training and inference.
Source code behind the python-patterns.guide site by Brandon Rhodes