powergiant

powergiant

1 follower · 0 following

Block or Report

Block or report powergiant

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Lists (10)

Sort

Beta Lists are currently in beta. Share feedback and report bugs.

Stars

OpenRLHF / OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & Mixtral)

Python 1,890 180 Updated Aug 9, 2024

yangjianxin1 / Firefly

Firefly: 大模型训练工具，支持训练Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型

Python 5,453 492 Updated Jul 16, 2024

mst272 / LLM-Dojo

欢迎来到 LLM-Dojo，这里是一个开源大模型学习场所，使用简洁且易阅读的代码构建模型训练框架(支持各种主流模型如Qwen、Llama、GLM等等)、RLHF框架(DPO/CPO/KTO/PPO)等各种功能。👩‍🎓👨‍🎓

Python 162 11 Updated Aug 8, 2024

jianzhnie / LLamaTuner

Easy and Efficient Finetuning LLMs. (Supported LLama, LLama2, LLama3, Qwen, Baichuan, GLM , Falcon) 大模型高效量化训练+部署.

Python 554 62 Updated Jul 10, 2024

liaokongVFX / LangChain-Chinese-Getting-Started-Guide

LangChain 的中文入门教程

7,219 576 Updated Jul 7, 2023

huggingface / alignment-handbook

Robust recipes to align language models with human and AI preferences

Python 4,332 373 Updated Aug 8, 2024

apple / ml-mgie

Python 3,817 249 Updated Mar 15, 2024

tyxsspa / AnyText

Official implementation code of the paper <AnyText: Multilingual Visual Text Generation And Editing>

Python 4,108 272 Updated Jun 21, 2024

QwenLM / Qwen-VL

The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.

Python 4,519 338 Updated Aug 7, 2024

haotian-liu / LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 18,639 2,044 Updated Jul 31, 2024

RVC-Boss / GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 30,570 3,519 Updated Aug 10, 2024

infiniflow / ragflow

RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.

Python 13,839 1,368 Updated Aug 9, 2024

Li-TianCheng / TinyDL

基于Eigen运算库的深度学习框架(支持CUDA加速)

C++ 15 4 Updated Jan 12, 2022

ShigekiKarita / grain

autograd mir and CUDA library for dynamic neural networks in D.

D 66 7 Updated May 15, 2021

raskr / rust-autograd

Tensors and differentiable operations (like TensorFlow) in Rust

Rust 484 36 Updated Feb 11, 2023

ikostrikov / pytorch-meta-optimizer

A PyTorch implementation of Learning to learn by gradient descent by gradient descent

Python 310 56 Updated Aug 27, 2018

ymcui / Chinese-LLaMA-Alpaca-2

中文LLaMA-2 & Alpaca-2大模型二期项目 + 64K超长上下文模型 (Chinese LLaMA-2 & Alpaca-2 LLMs with 64K long context models)

Python 7,031 573 Updated Apr 30, 2024

DLLXW / baby-llama2-chinese

用于从头预训练+SFT一个小参数量的中文LLaMa2的仓库；24G单卡即可运行得到一个具备简单中文问答能力的chat-llama2.

Python 2,385 292 Updated May 21, 2024

hpcaitech / Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Python 21,157 2,014 Updated Aug 9, 2024

peter1591 / hearthstone-ai

A Hearthstone AI based on Monte Carlo tree search and neural nets written in modern C++.

C++ 295 49 Updated Mar 1, 2018

TimeBreaker / Multi-Agent-Reinforcement-Learning-papers

Multi-Agent Reinforcement Learning (MARL) papers

192 33 Updated Sep 19, 2022

Allenpandas / Reinforcement-Learning-Papers

📚 List of Top-tier Conference Papers on Reinforcement Learning (RL)，including: NeurIPS, ICML, AAAI, IJCAI, AAMAS, ICLR, ICRA, etc.

279 32 Updated May 30, 2024

sweetice / Deep-reinforcement-learning-with-pytorch

PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....

Python 3,797 837 Updated Mar 24, 2023

ShangtongZhang / reinforcement-learning-an-introduction

Python Implementation of Reinforcement Learning: An Introduction

Python 13,391 4,805 Updated Aug 9, 2024

hiyouga / LLaMA-Factory

A WebUI for Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

Python 28,741 3,526 Updated Aug 10, 2024

liutiedong / goat

a Fine-tuned LLaMA that is Good at Arithmetic Tasks

Jupyter Notebook 173 17 Updated Sep 15, 2023

stanfordnlp / dspy

DSPy: The framework for programming—not prompting—foundation models

Python 15,373 1,189 Updated Aug 10, 2024

hemingkx / Spec-Bench

Spec-Bench: A Comprehensive Benchmark and Unified Evaluation Platform for Speculative Decoding (ACL 2024 Findings)

Python 132 15 Updated May 29, 2024

jafioti / luminal

Deep learning at the speed of light.

Rust 1,418 88 Updated Aug 4, 2024

hsiehjackson / Deep-Reinforcement-Learning-on-Atari-Games

Python 12 5 Updated Nov 22, 2022

powergiant

Block or report powergiant

Lists (10)

CV

DL system

DL theory

image generation

LLM

meta learning

PL

programming projects

RL

voice generation

Stars