Skip to content
View Freja71122's full-sized avatar

Block or report Freja71122

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Expert Specialized Fine-Tuning

Python 148 13 Updated Sep 22, 2024

Super-Efficient RLHF Training of LLMs with Parameter Reallocation

Python 124 5 Updated Nov 20, 2024

DeepSeek LLM: Let there be answers

Makefile 1,472 94 Updated Feb 4, 2024

MOSS-RLHF

Python 1,294 101 Updated Mar 3, 2024

A curated list of reinforcement learning with human feedback resources (continually updated)

3,481 212 Updated Nov 12, 2024

A curated list of open-source projects related to DeepSeek Coder

268 26 Updated Apr 3, 2024

[ICLR 2024] Official implementation of DreamCraft3D: Hierarchical 3D Generation with Bootstrapped Diffusion Prior

Python 2,043 88 Updated Aug 21, 2024

DeepSeek Coder: Let the Code Write Itself

Python 6,887 477 Updated May 21, 2024

一种任务级GPU算力分时调度的高性能深度学习训练平台

Python 312 39 Updated Oct 24, 2023

PyTorch implementations of deep reinforcement learning algorithms and environments

Python 5,649 1,197 Updated Jul 25, 2024

AI magics meet Infinite draw board.

Jupyter Notebook 2,133 215 Updated May 9, 2024

The test of different distributed-training methods on High-Flyer AIHPC

Python 21 3 Updated Oct 18, 2022

FireFlyer Record file format, writer and reader for DL training samples.

Python 116 8 Updated Dec 1, 2022
Python 36 6 Updated Jun 10, 2022

CLIP-GEN: Language-Free Training of a Text-to-Image Generator with CLIP

Python 134 13 Updated Jun 10, 2022