Stars
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention)
Quickly make and deploy full-stack apps with database, auth, styling, storage etc. figured out for you. Add all primitives you want.
Simplifying reinforcement learning for complex game environments
Code for the paper Fine-Tuning Language Models from Human Preferences
Code for "Learning to summarize from human feedback"
A full-featured, hackable Next.js AI chatbot built by Vercel
Mastering Diverse Domains through World Models
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
🕹️ A diverse suite of scalable reinforcement learning environments in JAX
🦁 A research-friendly codebase for fast experimentation of multi-agent reinforcement learning in JAX
Curated List of React Components & Libraries.
An API standard for multi-agent reinforcement learning environments, with popular reference environments and related utilities
NVIDIA's Deep Imagination Team's PyTorch Library
Example 📓 Jupyter notebooks that demonstrate how to build, train, and deploy machine learning models using 🧠 Amazon SageMaker.
Bitwise is an educational project where we create the software/hardware stack for a computer from scratch.
Pretrain, finetune ANY AI model of ANY size on multiple GPUs, TPUs with zero code changes.
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
Paired Open-Ended Trailblazer (POET) and Enhanced POET
Python programs, usually short, of considerable difficulty, to perfect particular skills.
Semantic segmentation models with 500+ pretrained convolutional and transformer-based backbones.
Segmentation models with pretrained backbones. Keras and TensorFlow Keras.
A collaboratively written review paper on deep learning, genomics, and precision medicine
ULMFiT for Genomic Sequence Data
Many studies have shown that the performance on deep learning is significantly affected by volume of training data. The MedicalNet project provides a series of 3D-ResNet pre-trained models and rela…
Fit interpretable models. Explain blackbox machine learning.
A curated list of applied machine learning and data science notebooks and libraries across different industries (by @firmai)
Rethinking the Value of Network Pruning (Pytorch) (ICLR 2019)