vo-lar

Follow

🎯

Focusing

Zzikang vo-lar

🎯

Focusing

Follow

Take it ez

4 followers · 30 following

TJU->ZJU
shanghai
12:40 (UTC +08:00)

Highlights

Pro

Block or Report

Block or report vo-lar

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Starred repositories

sindresorhus / awesome

😎 Awesome lists about all kinds of interesting topics

314,972 27,341 Updated Jul 28, 2024

thunlp / PromptPapers

Must-read papers on prompt-based tuning for pre-trained language models.

4,011 372 Updated Jul 17, 2023

Ascotbe / HackerMind

各种安全相关思维导图整理收集。渗透步骤，web安全，CTF，业务安全，人工智能，区块链安全，数据安全，安全开发，无线安全，社会工程学，二进制安全，移动安全，红蓝对抗，运维安全，风控安全，linux安全

1,321 273 Updated Dec 4, 2023

Curt-Park / rainbow-is-all-you-need

Rainbow is all you need! A step-by-step tutorial from DQN to Rainbow

Jupyter Notebook 1,814 333 Updated Jan 19, 2024

DLR-RM / rl-baselines3-zoo

A training framework for Stable Baselines3 reinforcement learning agents, with hyperparameter optimization and pre-trained agents included.

Python 1,921 498 Updated Jul 26, 2024

pfnet / pfrl

PFRL: a PyTorch-based deep reinforcement learning library

Python 1,166 158 Updated Jul 26, 2024

astooke / rlpyt

Reinforcement Learning in PyTorch

Python 2,212 323 Updated Jan 4, 2021

wwxFromTju / awesome-reinforcement-learning-zh

中文整理的强化学习资料（Reinforcement Learning）

1,901 352 Updated Apr 30, 2020

boyu-ai / Hands-on-RL

https://hrl.boyuai.com/

Jupyter Notebook 2,176 498 Updated Nov 22, 2022

HanGyojin / RLB-MI

This is a PyTorch implementation of the paper "Reinforcement Learning-Based Black-Box Model Inversion Attacks" accepted by CVPR 2023.

Python 33 3 Updated May 4, 2023

PKU-Alignment / safe-rlhf

Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback

Python 1,260 115 Updated Jun 13, 2024

voidful / TextRL

Implementation of ChatGPT RLHF (Reinforcement Learning with Human Feedback) on any generation model in huggingface's transformer (blommz-176B/bloom/gpt/bart/T5/MetaICL)

Python 534 64 Updated May 9, 2024

opendilab / awesome-RLHF

A curated list of reinforcement learning with human feedback resources (continually updated)

3,106 195 Updated Jul 21, 2024

lucidrains / PaLM-rlhf-pytorch

Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM

Python 7,655 672 Updated Jan 14, 2024

ggerganov / llama.cpp

LLM inference in C/C++

C++ 62,644 8,987 Updated Jul 29, 2024

TUM-NLPLab-2022 / PARL-A-Dialog-System-Framework-with-Prompts-as-Actions-for-Reinforcement-Learning

This is the offical repo for the paper "PARL: A Dialog System Framework with Prompts as Actions for Reinforcement Learning" at ICAART 2023. https://www.scitepress.org/PublicationsDetail.aspx?ID=nLE…

Python 7 Updated Jun 10, 2023

ydyjya / Awesome-LLM-Safety

A curated list of safety-related papers, articles, and resources focused on Large Language Models (LLMs). This repository aims to provide researchers, practitioners, and enthusiasts with insights i…

710 45 Updated Jul 28, 2024

DA-southampton / Tech_Aarticle

主要是我是日常看过的不错的文章的资源汇总，方便自己也分享给大家。有些我看过的，就会做简单的解读，没看过的，就先罗列一下，然后之后看了把解读更新上；涉及到搜索/推荐/自然语言处理。

1,722 325 Updated Jun 3, 2021

HarleysZhang / dl_note

深度学习系统笔记，包含深度学习数学基础知识、神经网络基础部件详解、深度学习炼丹策略、模型压缩算法详解，以及如何实现深度学习推理框架实战。

Python 289 49 Updated Feb 2, 2024

princeton-nlp / tree-of-thought-llm

[NeurIPS 2023] Tree of Thoughts: Deliberate Problem Solving with Large Language Models

Python 4,443 413 Updated Jun 22, 2024

RobustNLP / CipherChat

A framework to evaluate the generalization capability of safety alignment for LLMs

Python 550 61 Updated Jul 24, 2024

lazyparser / minimalist-team-leader

极简主义团队管理操作手册

553 27 Updated Apr 8, 2023

mingkaid / rl-prompt

Accompanying repo for the RLPrompt paper

Python 287 52 Updated Jun 6, 2024

clash-verge-rev / clash-verge-rev

Continuation of Clash Verge - A Clash Meta GUI based on Tauri (Windows, MacOS, Linux)

TypeScript 28,016 2,147 Updated Jul 28, 2024

lm-sys / RouteLLM

A framework for serving and evaluating LLM routers - save LLM costs without compromising quality!

Python 2,417 165 Updated Jul 25, 2024

pytorch / rl

A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.

Python 2,050 273 Updated Jul 28, 2024

krahets / hello-algo

《Hello 算法》：动画图解、一键运行的数据结构与算法教程。支持 Python, Java, C++, C, C#, JS, Go, Swift, Rust, Ruby, Kotlin, TS, Dart 代码。简体版和繁体版同步更新，English version ongoing

Java 90,112 11,356 Updated Jul 28, 2024

The-Run-Philosophy-Organization / run

润学全球官方指定GITHUB，整理润学宗旨、纲领、理论和各类润之实例；解决为什么润，润去哪里，怎么润三大问题；并成为新中国人的核心宗教，核心信念。

31,149 2,578 Updated Jan 2, 2024

davide97l / rl-policies-attacks-defenses

Adversarial attacks on Deep Reinforcement Learning (RL)

Jupyter Notebook 74 12 Updated Feb 27, 2021

deepseek-ai / DeepSeek-Coder

DeepSeek Coder: Let the Code Write Itself

Python 6,189 444 Updated May 21, 2024

Starred topics

Security

Docker