Highlights
- Pro
Block or Report
Block or report vo-lar
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseLanguage
Sort by: Recently starred
Starred repositories
😎 Awesome lists about all kinds of interesting topics
Must-read papers on prompt-based tuning for pre-trained language models.
各种安全相关思维导图整理收集。渗透步骤,web安全,CTF,业务安全,人工智能,区块链安全,数据安全,安全开发,无线安全,社会工程学,二进制安全,移动安全,红蓝对抗,运维安全,风控安全,linux安全
Rainbow is all you need! A step-by-step tutorial from DQN to Rainbow
A training framework for Stable Baselines3 reinforcement learning agents, with hyperparameter optimization and pre-trained agents included.
PFRL: a PyTorch-based deep reinforcement learning library
中文整理的强化学习资料(Reinforcement Learning)
This is a PyTorch implementation of the paper "Reinforcement Learning-Based Black-Box Model Inversion Attacks" accepted by CVPR 2023.
Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback
Implementation of ChatGPT RLHF (Reinforcement Learning with Human Feedback) on any generation model in huggingface's transformer (blommz-176B/bloom/gpt/bart/T5/MetaICL)
A curated list of reinforcement learning with human feedback resources (continually updated)
Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM
This is the offical repo for the paper "PARL: A Dialog System Framework with Prompts as Actions for Reinforcement Learning" at ICAART 2023. https://www.scitepress.org/PublicationsDetail.aspx?ID=nLE…
A curated list of safety-related papers, articles, and resources focused on Large Language Models (LLMs). This repository aims to provide researchers, practitioners, and enthusiasts with insights i…
主要是我是日常看过的不错的文章的资源汇总,方便自己也分享给大家。有些我看过的,就会做简单的解读,没看过的,就先罗列一下,然后之后看了把解读更新上;涉及到搜索/推荐/自然语言处理。
深度学习系统笔记,包含深度学习数学基础知识、神经网络基础部件详解、深度学习炼丹策略、模型压缩算法详解,以及如何实现深度学习推理框架实战。
[NeurIPS 2023] Tree of Thoughts: Deliberate Problem Solving with Large Language Models
A framework to evaluate the generalization capability of safety alignment for LLMs
Continuation of Clash Verge - A Clash Meta GUI based on Tauri (Windows, MacOS, Linux)
A framework for serving and evaluating LLM routers - save LLM costs without compromising quality!
A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.
《Hello 算法》:动画图解、一键运行的数据结构与算法教程。支持 Python, Java, C++, C, C#, JS, Go, Swift, Rust, Ruby, Kotlin, TS, Dart 代码。简体版和繁体版同步更新,English version ongoing
润学全球官方指定GITHUB,整理润学宗旨、纲领、理论和各类润之实例;解决为什么润,润去哪里,怎么润三大问题; 并成为新中国人的核心宗教,核心信念。
Adversarial attacks on Deep Reinforcement Learning (RL)
DeepSeek Coder: Let the Code Write Itself