Skip to content
View vo-lar's full-sized avatar
🎯
Focusing
🎯
Focusing
  • TJU->ZJU
  • shanghai
  • 12:40 (UTC +08:00)

Highlights

  • Pro
Block or Report

Block or report vo-lar

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

😎 Awesome lists about all kinds of interesting topics

314,972 27,341 Updated Jul 28, 2024

Must-read papers on prompt-based tuning for pre-trained language models.

4,011 372 Updated Jul 17, 2023

各种安全相关思维导图整理收集。渗透步骤,web安全,CTF,业务安全,人工智能,区块链安全,数据安全,安全开发,无线安全,社会工程学,二进制安全,移动安全,红蓝对抗,运维安全,风控安全,linux安全

1,321 273 Updated Dec 4, 2023

Rainbow is all you need! A step-by-step tutorial from DQN to Rainbow

Jupyter Notebook 1,814 333 Updated Jan 19, 2024

A training framework for Stable Baselines3 reinforcement learning agents, with hyperparameter optimization and pre-trained agents included.

Python 1,921 498 Updated Jul 26, 2024

PFRL: a PyTorch-based deep reinforcement learning library

Python 1,166 158 Updated Jul 26, 2024

Reinforcement Learning in PyTorch

Python 2,212 323 Updated Jan 4, 2021

中文整理的强化学习资料(Reinforcement Learning)

1,901 352 Updated Apr 30, 2020

https://hrl.boyuai.com/

Jupyter Notebook 2,176 498 Updated Nov 22, 2022

This is a PyTorch implementation of the paper "Reinforcement Learning-Based Black-Box Model Inversion Attacks" accepted by CVPR 2023.

Python 33 3 Updated May 4, 2023

Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback

Python 1,260 115 Updated Jun 13, 2024

Implementation of ChatGPT RLHF (Reinforcement Learning with Human Feedback) on any generation model in huggingface's transformer (blommz-176B/bloom/gpt/bart/T5/MetaICL)

Python 534 64 Updated May 9, 2024

A curated list of reinforcement learning with human feedback resources (continually updated)

3,106 195 Updated Jul 21, 2024

Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM

Python 7,655 672 Updated Jan 14, 2024

LLM inference in C/C++

C++ 62,644 8,987 Updated Jul 29, 2024

This is the offical repo for the paper "PARL: A Dialog System Framework with Prompts as Actions for Reinforcement Learning" at ICAART 2023. https://www.scitepress.org/PublicationsDetail.aspx?ID=nLE…

Python 7 Updated Jun 10, 2023

A curated list of safety-related papers, articles, and resources focused on Large Language Models (LLMs). This repository aims to provide researchers, practitioners, and enthusiasts with insights i…

710 45 Updated Jul 28, 2024

主要是我是日常看过的不错的文章的资源汇总,方便自己也分享给大家。有些我看过的,就会做简单的解读,没看过的,就先罗列一下,然后之后看了把解读更新上;涉及到搜索/推荐/自然语言处理。

1,722 325 Updated Jun 3, 2021

深度学习系统笔记,包含深度学习数学基础知识、神经网络基础部件详解、深度学习炼丹策略、模型压缩算法详解,以及如何实现深度学习推理框架实战。

Python 289 49 Updated Feb 2, 2024

[NeurIPS 2023] Tree of Thoughts: Deliberate Problem Solving with Large Language Models

Python 4,443 413 Updated Jun 22, 2024

A framework to evaluate the generalization capability of safety alignment for LLMs

Python 550 61 Updated Jul 24, 2024

极简主义团队管理操作手册

553 27 Updated Apr 8, 2023

Accompanying repo for the RLPrompt paper

Python 287 52 Updated Jun 6, 2024

Continuation of Clash Verge - A Clash Meta GUI based on Tauri (Windows, MacOS, Linux)

TypeScript 28,016 2,147 Updated Jul 28, 2024

A framework for serving and evaluating LLM routers - save LLM costs without compromising quality!

Python 2,417 165 Updated Jul 25, 2024

A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.

Python 2,050 273 Updated Jul 28, 2024

《Hello 算法》:动画图解、一键运行的数据结构与算法教程。支持 Python, Java, C++, C, C#, JS, Go, Swift, Rust, Ruby, Kotlin, TS, Dart 代码。简体版和繁体版同步更新,English version ongoing

Java 90,112 11,356 Updated Jul 28, 2024

润学全球官方指定GITHUB,整理润学宗旨、纲领、理论和各类润之实例;解决为什么润,润去哪里,怎么润三大问题; 并成为新中国人的核心宗教,核心信念。

31,149 2,578 Updated Jan 2, 2024

Adversarial attacks on Deep Reinforcement Learning (RL)

Jupyter Notebook 74 12 Updated Feb 27, 2021

DeepSeek Coder: Let the Code Write Itself

Python 6,189 444 Updated May 21, 2024
Next