Skip to content
View powergiant's full-sized avatar
Block or Report

Block or report powergiant

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.
Showing results

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & Mixtral)

Python 1,890 180 Updated Aug 9, 2024

Firefly: 大模型训练工具,支持训练Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型

Python 5,453 492 Updated Jul 16, 2024

欢迎来到 LLM-Dojo,这里是一个开源大模型学习场所,使用简洁且易阅读的代码构建模型训练框架(支持各种主流模型如Qwen、Llama、GLM等等)、RLHF框架(DPO/CPO/KTO/PPO)等各种功能。👩‍🎓👨‍🎓

Python 162 11 Updated Aug 8, 2024

Easy and Efficient Finetuning LLMs. (Supported LLama, LLama2, LLama3, Qwen, Baichuan, GLM , Falcon) 大模型高效量化训练+部署.

Python 554 62 Updated Jul 10, 2024

LangChain 的中文入门教程

7,219 576 Updated Jul 7, 2023

Robust recipes to align language models with human and AI preferences

Python 4,332 373 Updated Aug 8, 2024
Python 3,817 249 Updated Mar 15, 2024

Official implementation code of the paper <AnyText: Multilingual Visual Text Generation And Editing>

Python 4,108 272 Updated Jun 21, 2024

The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.

Python 4,519 338 Updated Aug 7, 2024

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 18,639 2,044 Updated Jul 31, 2024

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 30,570 3,519 Updated Aug 10, 2024

RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.

Python 13,839 1,368 Updated Aug 9, 2024

基于Eigen运算库的深度学习框架(支持CUDA加速)

C++ 15 4 Updated Jan 12, 2022

autograd mir and CUDA library for dynamic neural networks in D.

D 66 7 Updated May 15, 2021

Tensors and differentiable operations (like TensorFlow) in Rust

Rust 484 36 Updated Feb 11, 2023

A PyTorch implementation of Learning to learn by gradient descent by gradient descent

Python 310 56 Updated Aug 27, 2018

中文LLaMA-2 & Alpaca-2大模型二期项目 + 64K超长上下文模型 (Chinese LLaMA-2 & Alpaca-2 LLMs with 64K long context models)

Python 7,031 573 Updated Apr 30, 2024

用于从头预训练+SFT一个小参数量的中文LLaMa2的仓库;24G单卡即可运行得到一个具备简单中文问答能力的chat-llama2.

Python 2,385 292 Updated May 21, 2024

Open-Sora: Democratizing Efficient Video Production for All

Python 21,157 2,014 Updated Aug 9, 2024

A Hearthstone AI based on Monte Carlo tree search and neural nets written in modern C++.

C++ 295 49 Updated Mar 1, 2018

Multi-Agent Reinforcement Learning (MARL) papers

192 33 Updated Sep 19, 2022

📚 List of Top-tier Conference Papers on Reinforcement Learning (RL),including: NeurIPS, ICML, AAAI, IJCAI, AAMAS, ICLR, ICRA, etc.

279 32 Updated May 30, 2024

PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....

Python 3,797 837 Updated Mar 24, 2023

Python Implementation of Reinforcement Learning: An Introduction

Python 13,391 4,805 Updated Aug 9, 2024

A WebUI for Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

Python 28,741 3,526 Updated Aug 10, 2024

a Fine-tuned LLaMA that is Good at Arithmetic Tasks

Jupyter Notebook 173 17 Updated Sep 15, 2023

DSPy: The framework for programming—not prompting—foundation models

Python 15,373 1,189 Updated Aug 10, 2024

Spec-Bench: A Comprehensive Benchmark and Unified Evaluation Platform for Speculative Decoding (ACL 2024 Findings)

Python 132 15 Updated May 29, 2024

Deep learning at the speed of light.

Rust 1,418 88 Updated Aug 4, 2024
Next