Skip to content
View gsc579's full-sized avatar
Block or Report

Block or report gsc579

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.

Starred repositories

Showing results

RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.

Python 13,020 1,286 Updated Aug 1, 2024

本项目旨在分享大模型相关技术原理以及实战经验。

HTML 8,261 803 Updated Aug 1, 2024

PyTorch Implementation of MADDPG (Lowe et. al. 2017)

Python 545 127 Updated Nov 26, 2019

the resources about the application based on LLM with RAG pattern

627 42 Updated Jul 20, 2024

Transformer是谷歌在17年发表的Attention Is All You Need 中使用的模型,经过这些年的大量的工业使用和论文验证,在深度学习领域已经占据重要地位。Bert就是从Transformer中衍生出来的语言模型。我会以中文翻译英文为例,来解释Transformer输入到输出整个流程。

Python 182 50 Updated Apr 24, 2024

🦄️ 🎃 👻 Clash Premium 规则集(RULE-SET),兼容 ClashX Pro、Clash for Windows 等基于 Clash Premium 内核的客户端。

17,761 1,579 Updated Jul 31, 2024

Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!

Python 31,324 2,334 Updated Aug 1, 2024

中文法律对话语言模型

Python 1,001 114 Updated May 13, 2024

🎉 Repo for LaWGPT, Chinese-Llama tuned with Chinese Legal knowledge. 基于中文法律知识的大语言模型

Python 5,753 529 Updated Jun 11, 2024

ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型

Python 15,663 1,851 Updated Jun 27, 2024

A WebUI for Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

Python 28,231 3,458 Updated Aug 1, 2024

🦜🔗 Build context-aware reasoning applications

Jupyter Notebook 90,084 14,247 Updated Aug 1, 2024

RL in AutoPilot 自动驾驶强化学习:效果展示,框架设计、算法和训练经验文档等(部分开源,update from private repo: egocar)

Python 97 34 Updated Jul 12, 2019

Langchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and…

TypeScript 30,313 5,315 Updated Aug 1, 2024

ChatLaw:A Powerful LLM Tailored for Chinese Legal. 中文法律大模型

6,717 534 Updated Jun 4, 2024

⭐️ NLP Algorithms with transformers lib. Supporting Text-Classification, Text-Generation, Information-Extraction, Text-Matching, RLHF, SFT etc.

Jupyter Notebook 2,070 369 Updated Sep 29, 2023

A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)

Python 4,408 470 Updated Jan 8, 2024

Unified Reinforcement Learning Framework

Python 605 60 Updated Jun 6, 2024

强化学习中文教程(蘑菇书🍄),在线阅读地址:https://datawhalechina.github.io/easy-rl/

Jupyter Notebook 8,790 1,779 Updated Jul 25, 2024

Really Fast End-to-End Jax RL Implementations

Python 645 54 Updated Jul 29, 2024

🐫 CAMEL: Finding the Scaling Law of Agents. A multi-agent framework. https://www.camel-ai.org

Python 5,064 615 Updated Aug 1, 2024

Train transformer language models with reinforcement learning.

Python 8,919 1,097 Updated Aug 1, 2024
Python 53 26 Updated Dec 27, 2023

ChatRWKV is like ChatGPT but powered by RWKV (100% RNN) language model, and open source.

Python 9,348 684 Updated Jul 11, 2024

basic algorithms of reinforcement learning

Jupyter Notebook 1 Updated Jan 15, 2023

basic algorithms of reinforcement learning

Jupyter Notebook 185 53 Updated Aug 23, 2023

High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC, SPOT, Cal-QL, ReBRAC

Python 1,027 118 Updated Aug 3, 2023

An index of algorithms for offline reinforcement learning (offline-rl)

891 86 Updated May 23, 2024

A collection of offline reinforcement learning algorithms.

Python 151 20 Updated Jun 6, 2024
Next