Block or Report
Block or report shanpoyang654
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseStars
Language
Sort by: Recently starred
SPUQ: Perturbation-Based Uncertainty Quantification for Large Language Models
[CVPR2023] A faster, smaller, and better text-to-image model for large-scale training
[CVPR 2024 Highlight] OPERA: Alleviating Hallucination in Multi-Modal Large Language Models via Over-Trust Penalty and Retrospection-Allocation
面向开发者的 LLM 入门教程,吴恩达大模型系列课程中文版
Inference-Time Intervention: Eliciting Truthful Answers from a Language Model
Locating and editing factual associations in GPT (NeurIPS 2022)
[ICLR'24] RAIN: Your Language Models Can Align Themselves without Finetuning
Official Repository for ACL 2024 Paper SafeDecoding: Defending against Jailbreak Attacks via Safety-Aware Decoding
PyTorch implementation of MAE https//arxiv.org/abs/2111.06377
[ICRA'23] BEVFusion: Multi-Task Multi-Sensor Fusion with Unified Bird's-Eye View Representation
Official Code for ACL 2024 paper "GradSafe: Detecting Unsafe Prompts for LLMs via Safety-Critical Gradient Analysis"
Persuasive Jailbreaker: we can persuade LLMs to jailbreak them!
GD-MAE: Generative Decoder for MAE Pre-training on LiDAR Point Clouds (CVPR 2023)
Human preference data for "Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback"
Exercises answers to the book "machine-learning" written by Zhou Zhihua。周志华《机器学习》课后习题,个人解答。各算法都拿numpy和pandas实现了一遍
基于PyTorch的BERT中文文本分类模型(BERT Chinese text classification model implemented by PyTorch)
A full pipeline to finetune Vicuna LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the Vicuna architecture. Basically Chat…
Unofficial implementation of "Prompt-to-Prompt Image Editing with Cross Attention Control" with Stable Diffusion
[CCS'24] A dataset consists of 15,140 ChatGPT prompts from Reddit, Discord, websites, and open-source datasets (including 1,405 jailbreak prompts).
Implementation of Denoising Diffusion Probabilistic Model in Pytorch
🎨 ML Visuals contains figures and templates which you can reuse and customize to improve your scientific writing.
This repository contains the jailbreaking process for GPT-3, GPT-4, GPT-3.5, ChatGPT, and ChatGPT Plus. By following the instructions in this repository, you will be able to gain access to the inne…
A curated list of safety-related papers, articles, and resources focused on Large Language Models (LLMs). This repository aims to provide researchers, practitioners, and enthusiasts with insights i…
Perspective is an API that uses machine learning models to score the perceived impact a comment might have on a conversation. See https://developers.perspectiveapi.com for more information.
A relabeled version of the HatEval (SemEval 2019) dataset used in "Practical Transformer-based Multilingual Text Classification" (to appear in proceedings of NAACL 2021).
CMPUT 497 Project: SemEval 2019 Task 5 Shared Task on Multilingual Detection of Hate