Skip to content
View MizuleGPT's full-sized avatar

Block or report MizuleGPT

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

DiffuGPT and DiffuLLaMA: Scaling Diffusion Language Models via Adaptation from Autoregressive Models

Python 38 2 Updated Oct 24, 2024

[NeurIPS 2024] Can LLMs Learn by Teaching? A Preliminary Study

Python 32 3 Updated Oct 9, 2024

DQNSuite is a revolutionary tool that brings the power of Reinforcement Learning models into the palm of the user's hand.

Python 3 1 Updated Oct 13, 2024

Create an open source toy dataset for finetuning LLMs with reasoning abilities

379 27 Updated Sep 17, 2024

Keeping my personal experiments separate from the main repo

Python 63 6 Updated Sep 18, 2024

Repo for Rho-1: Token-level Data Selection & Selective Pretraining of LLMs.

299 11 Updated Apr 18, 2024