![chrome-extension logo](https://raw.githubusercontent.com/github/explore/8eaa4711f3b6015070483ff1c3b707292304efe4/topics/chrome-extension/chrome-extension.png)
Block or Report
Block or report gsc579
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseLists (2)
Sort Name ascending (A-Z)
Language
Sort by: Recently starred
Starred repositories
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
PyTorch Implementation of MADDPG (Lowe et. al. 2017)
the resources about the application based on LLM with RAG pattern
Transformer是谷歌在17年发表的Attention Is All You Need 中使用的模型,经过这些年的大量的工业使用和论文验证,在深度学习领域已经占据重要地位。Bert就是从Transformer中衍生出来的语言模型。我会以中文翻译英文为例,来解释Transformer输入到输出整个流程。
🦄️ 🎃 👻 Clash Premium 规则集(RULE-SET),兼容 ClashX Pro、Clash for Windows 等基于 Clash Premium 内核的客户端。
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
🎉 Repo for LaWGPT, Chinese-Llama tuned with Chinese Legal knowledge. 基于中文法律知识的大语言模型
ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型
A WebUI for Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
🦜🔗 Build context-aware reasoning applications
RL in AutoPilot 自动驾驶强化学习:效果展示,框架设计、算法和训练经验文档等(部分开源,update from private repo: egocar)
Langchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and…
ChatLaw:A Powerful LLM Tailored for Chinese Legal. 中文法律大模型
⭐️ NLP Algorithms with transformers lib. Supporting Text-Classification, Text-Generation, Information-Extraction, Text-Matching, RLHF, SFT etc.
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
强化学习中文教程(蘑菇书🍄),在线阅读地址:https://datawhalechina.github.io/easy-rl/
Really Fast End-to-End Jax RL Implementations
🐫 CAMEL: Finding the Scaling Law of Agents. A multi-agent framework. https://www.camel-ai.org
Train transformer language models with reinforcement learning.
ChatRWKV is like ChatGPT but powered by RWKV (100% RNN) language model, and open source.
gsc579 / rl-tutorials
Forked from johnjim0816/rl-tutorialsbasic algorithms of reinforcement learning
basic algorithms of reinforcement learning
High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC, SPOT, Cal-QL, ReBRAC
An index of algorithms for offline reinforcement learning (offline-rl)
A collection of offline reinforcement learning algorithms.