Lists (14)
Sort Name ascending (A-Z)
Stars
10 Weeks, 20 Lessons, Data Science for All!
High-Resolution Image Synthesis with Latent Diffusion Models
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
Scripts of LLM pre-training and fine-tuning (w/wo LoRA, DeepSpeed)
Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.
Multiple NVIDIA GPUs or Apple Silicon for Large Language Model Inference?
Modeling, training, eval, and inference code for OLMo
Transforms PDF, Documents and Images into Enriched Structured Data
A one stop repository for generative AI research updates, interview resources, notebooks and much more!
Official implementation of TransNormerLLM: A Faster and Better LLM
High quality resources & applications for LLMs, multi-modal models and VectorDBs
Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.
DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models
An interactive NVIDIA-GPU process viewer and beyond, the one-stop solution for GPU process management.
NeurIPS Large Language Model Efficiency Challenge: 1 LLM + 1GPU + 1Day
Superduper: Integrate AI models and machine learning workflows with your database to implement custom AI applications, without moving your data. Including streaming inference, scalable model hostin…
Semantic cache for LLMs. Fully integrated with LangChain and llama_index.
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Get up and running with Llama 3.2, Mistral, Gemma 2, and other large language models.
Open source codebase powering the HuggingChat app
21 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/
Making large AI models cheaper, faster and more accessible
Robust recipes to align language models with human and AI preferences
The Official Python Client for Lamini's API