MizuleGPT

MizuleGPT

Stars

DiffuGPT and DiffuLLaMA: Scaling Diffusion Language Models via Adaptation from Autoregressive Models

Python 38 2 Updated Oct 24, 2024

[NeurIPS 2024] Can LLMs Learn by Teaching? A Preliminary Study

Python 32 3 Updated Oct 9, 2024

DQNSuite is a revolutionary tool that brings the power of Reinforcement Learning models into the palm of the user's hand.

Python 3 1 Updated Oct 13, 2024

Create an open source toy dataset for finetuning LLMs with reasoning abilities

Keeping my personal experiments separate from the main repo

Python 63 6 Updated Sep 18, 2024

Repo for Rho-1: Token-level Data Selection & Selective Pretraining of LLMs.