Skip to content
View mhqmhy's full-sized avatar
  • University of Science and Technology of China
  • Hefei, China

Block or report mhqmhy

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

中文Mixtral-8x7B(Chinese-Mixtral-8x7B)

Python 637 32 Updated Aug 17, 2024

Ongoing research training transformer language models at scale, including: BERT & GPT-2

Python 1,302 211 Updated Mar 20, 2024

A modular RL library to fine-tune language models to human preferences

Python 2,154 190 Updated Mar 1, 2024

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 34,479 4,025 Updated Aug 26, 2024

A curated list of reinforcement learning with human feedback resources (continually updated)

3,183 201 Updated Jul 21, 2024

Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback

Python 1,281 117 Updated Jun 13, 2024

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & Mixtral)

Python 1,923 187 Updated Aug 20, 2024

Utilities intended for use with Llama models.

Python 3,538 597 Updated Aug 26, 2024

本项目旨在分享大模型相关技术原理以及实战经验。

HTML 8,730 853 Updated Aug 25, 2024

Resource-adaptive cluster scheduler for deep learning training.

Python 416 75 Updated Mar 5, 2023
Jupyter Notebook 99 41 Updated Aug 14, 2022

整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。

14,421 1,324 Updated Jul 21, 2024

🎉CUDA/C++ 笔记 / 大模型手撕CUDA / 技术博客,更新随缘: flash_attn、sgemm、sgemv、warp reduce、block reduce、dot product、elementwise、softmax、layernorm、rmsnorm、hist etc.

Cuda 1,064 101 Updated Aug 26, 2024

High-speed Large Language Model Serving on PCs with Consumer-grade GPUs

C++ 7,804 400 Updated Jul 15, 2024

[SIGIR'2024] "GraphGPT: Graph Instruction Tuning for Large Language Models"

Python 516 39 Updated Jun 25, 2024

how to optimize some algorithm in cuda.

Cuda 1,361 114 Updated Aug 26, 2024

Awesome-LLM: a curated list of Large Language Model

16,987 1,367 Updated Aug 19, 2024

Lucid: A Non-Intrusive, Scalable and Interpretable Scheduler for Deep Learning Training Jobs

Python 45 15 Updated May 21, 2023

Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 500 universities from 70 countries including Stanford, MIT, Harvard, and Cambridge.

Python 22,971 4,245 Updated Aug 18, 2024

Implementation of NeurIPS 2021 paper "On Joint Learning for Solving Placement and Routing in Chip Design" & NeurIPS 2022 paper "The Policy-gradient Placement and Generative Routing Neural Networks …

Prolog 191 44 Updated Jun 11, 2024

Open-source RAG Framework for building GenAI Second Brains 🧠 Build productivity assistant (RAG) ⚡️🤖 Chat with your docs (PDF, CSV, ...) & apps using Langchain, GPT 3.5 / 4 turbo, Private, Anthropic…

Python 35,661 3,461 Updated Aug 26, 2024

2024届互联网校招信息汇总

958 73 Updated Mar 23, 2024

Eigen library with c++ testing

C++ 7 1 Updated Dec 24, 2019

based on C++11 , a mini threadpool , accept variable number of parameters 基于C++11的线程池,简洁且可以带任意多的参数

C++ 988 344 Updated Feb 15, 2023

workspace是基于C++11的轻量级异步执行框架,支持:通用任务异步并发执行、优先级任务调度、自适应动态线程池、高效静态线程池、异常处理机制等。

C++ 969 145 Updated Jun 11, 2024

BS::thread_pool: a fast, lightweight, and easy-to-use C++17 thread pool library

C++ 2,085 241 Updated May 11, 2024

A reading list for deep graph learning acceleration.

214 16 Updated Jul 23, 2024

A simple, delicate, and modern theme for the static site generator Hexo.

JavaScript 6,329 1,543 Updated Aug 26, 2024

A priority queue implemented in terms of a radix sort

C++ 3 1 Updated Sep 14, 2011
Next