Skip to content
View ttumiel's full-sized avatar

Block or report ttumiel

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention)

Python 2,647 249 Updated Nov 17, 2024

Quickly make and deploy full-stack apps with database, auth, styling, storage etc. figured out for you. Add all primitives you want.

TypeScript 3,096 231 Updated Nov 5, 2024
Jupyter Notebook 90 9 Updated Jun 27, 2024

Simplifying reinforcement learning for complex game environments

Python 1,260 59 Updated Nov 16, 2024

Code for the paper Fine-Tuning Language Models from Human Preferences

Python 1,230 164 Updated Jul 25, 2023

Code for "Learning to summarize from human feedback"

Python 992 144 Updated Sep 5, 2023

A full-featured, hackable Next.js AI chatbot built by Vercel

TypeScript 9,480 2,407 Updated Nov 15, 2024

Mastering Diverse Domains through World Models

Python 1,378 233 Updated Jul 29, 2024

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

Python 5,703 643 Updated Nov 14, 2024

🕹️ A diverse suite of scalable reinforcement learning environments in JAX

Python 635 80 Updated Nov 15, 2024

🦁 A research-friendly codebase for fast experimentation of multi-agent reinforcement learning in JAX

Python 735 90 Updated Nov 18, 2024

Curated List of React Components & Libraries.

42,688 3,506 Updated Aug 12, 2024

An API standard for multi-agent reinforcement learning environments, with popular reference environments and related utilities

Python 2,631 414 Updated Nov 12, 2024

NVIDIA's Deep Imagination Team's PyTorch Library

Python 4,015 451 Updated Nov 29, 2022

Example 📓 Jupyter notebooks that demonstrate how to build, train, and deploy machine learning models using 🧠 Amazon SageMaker.

Jupyter Notebook 10,142 6,779 Updated Nov 15, 2024

Bitwise is an educational project where we create the software/hardware stack for a computer from scratch.

C 5,135 212 Updated Mar 7, 2019

Pretrain, finetune ANY AI model of ANY size on multiple GPUs, TPUs with zero code changes.

Python 28,405 3,386 Updated Nov 18, 2024

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python 34,037 5,782 Updated Nov 17, 2024

Evolution Strategies Tool

Jupyter Notebook 936 163 Updated Dec 8, 2022

Paired Open-Ended Trailblazer (POET) and Enhanced POET

Python 242 53 Updated Mar 23, 2022

Python implementation of CMA-ES

Jupyter Notebook 1,109 179 Updated Oct 9, 2024

Python programs, usually short, of considerable difficulty, to perfect particular skills.

Jupyter Notebook 23,151 2,439 Updated Oct 28, 2024

Semantic segmentation models with 500+ pretrained convolutional and transformer-based backbones.

Python 9,697 1,678 Updated Nov 18, 2024

Segmentation models with pretrained backbones. Keras and TensorFlow Keras.

Python 4,762 1,033 Updated Aug 21, 2024

A collaboratively written review paper on deep learning, genomics, and precision medicine

HTML 1,251 271 Updated Dec 25, 2022

ULMFiT for Genomic Sequence Data

Jupyter Notebook 284 55 Updated Nov 15, 2019

Many studies have shown that the performance on deep learning is significantly affected by volume of training data. The MedicalNet project provides a series of 3D-ResNet pre-trained models and rela…

Python 1,952 416 Updated Jul 6, 2023

Fit interpretable models. Explain blackbox machine learning.

C++ 6,297 731 Updated Nov 18, 2024

A curated list of applied machine learning and data science notebooks and libraries across different industries (by @firmai)

Jupyter Notebook 7,259 1,209 Updated Oct 4, 2024

Rethinking the Value of Network Pruning (Pytorch) (ICLR 2019)

Python 1,510 293 Updated Jun 7, 2020
Next